Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VLLM Serverless Endpoint Deployment using Python #392

Open
arianyambao opened this issue Jan 3, 2025 · 2 comments
Open

VLLM Serverless Endpoint Deployment using Python #392

arianyambao opened this issue Jan 3, 2025 · 2 comments
Labels
enhancement New feature or request

Comments

@arianyambao
Copy link

Hello How do we deploy a VLLM Serverless endpoint using Python?

Do we explicitly have to create templates first? Not straight to the point of using the vLLM ready template?

@arianyambao arianyambao added the enhancement New feature or request label Jan 3, 2025
@nerdylive123
Copy link

nerdylive123 commented Feb 8, 2025

I don't get what you mean, maybe you mean wanna add something to the vllm for your own use?
maybe you can try making a dockerfile but with the base image of VLLM-worker, and modify the entrypoint or CMD

@deanq
Copy link
Collaborator

deanq commented Feb 9, 2025

@arianyambao Have you looked at this article? How to run vLLM with RunPod Serverless

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants