Skip to content
@ModelTC

ModelTC

Model Infra

Pinned Loading

  1. MQBench Public

    Model Quantization Benchmark

    Python 800 142

  2. United-Perception Public

    United Perception

    Python 432 67

  3. Dipoorlet Public

    Offline Quantization Tools for Deploy.

    Python 127 17

  4. lightllm Public

    LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

    Python 3.2k 249

  5. llmc Public

    [EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

    Python 462 54

  6. OmniBal Public

    Python 20 3

Repositories

Showing 10 of 48 repositories
  • lightllm Public

    LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

    Python 3,159 Apache-2.0 249 76 9 Updated Apr 26, 2025
  • lightx2v Public
    Python 13 6 0 2 Updated Apr 25, 2025
  • 0 0 0 0 Updated Apr 25, 2025
  • Dockerfile 0 0 0 0 Updated Apr 24, 2025
  • llmc Public

    [EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

    Python 462 Apache-2.0 54 29 0 Updated Apr 23, 2025
  • general-sam-py Public

    Python bindings for general-sam and some utilities

    Python 3 Apache-2.0 0 0 1 Updated Apr 22, 2025
  • MQBench Public

    Model Quantization Benchmark

    Python 800 Apache-2.0 142 8 5 Updated Apr 20, 2025
  • flash-attention Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    Python 0 BSD-3-Clause 1,647 0 0 Updated Apr 17, 2025
  • greedy-tokenizer Public

    Greedily tokenize strings with the longest tokens iteratively.

    Python 0 Apache-2.0 0 0 1 Updated Mar 24, 2025
  • mtc-token-healing Public

    Token healing implementation in Rust

    Rust 4 Apache-2.0 0 0 0 Updated Mar 22, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…