ray-serve

Here are 6 public repositories matching this topic...

fork123aniket / LLM-RAG-powered-QA-App

A Production-Ready, Scalable RAG-powered LLM-based Context-Aware QA App

question-answering ray fine-tuning context-aware-system large-language-models ray-serve llmops llm-serving eleutherai llm-training llm-inference retrieval-augmented-generation parameter-efficient-fine-tuning

Updated Jan 27, 2025
Python

SeeMirra / Wingman

Star

Custom AI Generator -- Pretrain your LLM Models with this Automated Embedding Generator and model Q&A Interface. Uses Retrieval Augmented Generation (RAG) to reduce hallucinations and ground the LLM on a source of truth

automation ai embeddings artificial-intelligence gpt rag vector-database llm ray-serve langchain

Updated Mar 24, 2024
Python

marwan116 / raycraft

Star

A drop-in replacement of fastapi to enable scalable and fault tolerant deployments with ray serve

fault-tolerance scalability ray fastapi ray-serve

Updated Nov 7, 2023
Python

blublinsky / ray-serve

Star

Experimenting with Ray Serve on KubeRay

ray-serve kuberay

Updated Sep 11, 2023
Python

mpolinowski / ray-deployments

Star

Use Ray to deploy your remote services.

python deployment ray ray-serve

Updated Jan 29, 2023
Python

Carlososuna11 / demo-ray-serve-multiple-models

Star

contains the basic structure that a model serving application should have. This implementation is based on the Ray Serve framework.

python machine-learning distributed-computing scaling-methods ray-serve

Updated Jan 16, 2023
Python

Improve this page

Add a description, image, and links to the ray-serve topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ray-serve topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ray-serve

Here are 6 public repositories matching this topic...

fork123aniket / LLM-RAG-powered-QA-App

SeeMirra / Wingman

marwan116 / raycraft

blublinsky / ray-serve

mpolinowski / ray-deployments

Carlososuna11 / demo-ray-serve-multiple-models

Improve this page

Add this topic to your repo