Skip to content

CUDA: optimize MMQ int8 tensor core performance#8062

Merged
JohannesGaessler merged 3 commits intoggml-org:masterfrom JohannesGaessler:cuda-mmq-2xa-3Jun 24, 2024

Commits