Mesolitica
We develop Multimodality Artificial Intelligence for South East Asia.
Pinned Loading
Repositories
Showing 10 of 45 repositories
- Chunk-loss-LoRA Public
Fused kernel chunk loss to include LoRA to reduce memory, support DeepSpeed ZeRO3.
mesolitica/Chunk-loss-LoRA’s past year of commit activity - initial-paged-flash-attention Public Forked from huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
mesolitica/initial-paged-flash-attention’s past year of commit activity - transformers-openai-api Public
Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.
mesolitica/transformers-openai-api’s past year of commit activity - accelerate-torch-compile-speechlm Public Forked from huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
mesolitica/accelerate-torch-compile-speechlm’s past year of commit activity - dynamic-batch-TTS-pipeline Public
Dynamic batching for Speech Enhancement, Speech Tokenizer and TTS.
mesolitica/dynamic-batch-TTS-pipeline’s past year of commit activity - picotron-zero1 Public Forked from huggingface/picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
mesolitica/picotron-zero1’s past year of commit activity