Skip to content

Allow setting PagedAttention KV cache allocation from context size#640

Merged
EricLBuehler merged 4 commits intomasterfrom pa_context_sizeJul 28, 2024

Commits

Commits on Jul 28, 2024