Skip to content
Change the repository type filter

All

    Repositories list

    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      1.7k17k72862Updated May 16, 2025May 16, 2025
    • Causal depthwise conv1d in CUDA, with a PyTorch interface
      Cuda
      BSD 3-Clause "New" or "Revised" License
      96454228Updated May 9, 2025May 9, 2025
    • Python
      Apache License 2.0
      01900Updated May 5, 2025May 5, 2025
    • cutlass

      Public
      CUDA Templates for Linear Algebra Subroutines
      C++
      Other
      1.2k100Updated Apr 4, 2025Apr 4, 2025
    • Fast Hadamard transform in CUDA, with a PyTorch interface
      C
      BSD 3-Clause "New" or "Revised" License
      2318662Updated May 24, 2024May 24, 2024