GitHub - smc40/llm2go: Open Source Large Language Models as an API service

Goal: API Service for interchangeable, locally hosted LLMs

User Story: As a Data Scientist I would like to quickly use and test newly released LLMs so I can compare thir performances.

Links

FastAPI: https://fastapi.tiangolo.com/
LLM Leaderboard: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
Inspiration or even solution?!: https://github.com/huggingface/text-generation-inference

Use Huggingface Models

Install transformers

pip install transformers, torch, SentencePiece, accelerate

download and use a model

from transformers import T5Tokenizer, T5ForConditionalGeneration

tokenizer = T5Tokenizer.from_pretrained("google/flan-t5-small")
model = T5ForConditionalGeneration.from_pretrained("google/flan-t5-small")

prompt = "In the following sentence, what is the drugname: Ibuprofen is well known to cause diarrhia."
input_ids = tokenizer(prompt, return_tensors="pt").input_ids

outputs = model.generate(input_ids, max_length = 512)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.gitignore		.gitignore
100sentences.csv		100sentences.csv
LICENSE		LICENSE
README.md		README.md
api.py		api.py
base.py		base.py
logo.png		logo.png
modeling.py		modeling.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Links

Use Huggingface Models

About

Releases

Packages

Contributors 2

Languages

License

smc40/llm2go

Folders and files

Latest commit

History

Repository files navigation

Links

Use Huggingface Models

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages