Skip to content

Actions: ggml-org/llama.cpp

Server

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
10,607 workflow runs
10,607 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

vulkan: matmul dequantization improvements
Server #11214: Pull request #12015 synchronize by netrunnereve
February 26, 2025 01:38 8m 4s netrunnereve:vulkan_mm
February 26, 2025 01:38 8m 4s
tool-call: add support for tool-calls using Model Context Protocol
Server #11212: Pull request #11556 synchronize by bandoti
February 25, 2025 18:55 7m 38s bandoti:llamacli-tools
February 25, 2025 18:55 7m 38s
docs: add docs/function-calling.md to lighten server/README.md's plig…
Server #11211: Commit d7cfe1f pushed by ochafik
February 25, 2025 18:52 8m 46s master
February 25, 2025 18:52 8m 46s
tool-call: add support for tool-calls using Model Context Protocol
Server #11210: Pull request #11556 synchronize by bandoti
February 25, 2025 18:27 8m 12s bandoti:llamacli-tools
February 25, 2025 18:27 8m 12s
llama : add xcframework build script
Server #11209: Pull request #11996 synchronize by danbev
February 25, 2025 17:06 8m 57s danbev:xcframework-build-10747
February 25, 2025 17:06 8m 57s
docs: add docs/function-calling.md to lighten server/README.md's plight
Server #11207: Pull request #12069 synchronize by ochafik
February 25, 2025 15:56 8m 26s ochafik:tool-docs
February 25, 2025 15:56 8m 26s
Add GGML_HIP_ROCWMMA_FATTN to enable rocWMMA for FlashAttention
Server #11206: Pull request #12032 synchronize by hjc4869
February 25, 2025 15:54 Action required hjc4869:pr
February 25, 2025 15:54 Action required
docs: add docs/function-calling.md to lighten server/README.md's plight
Server #11205: Pull request #12069 synchronize by ochafik
February 25, 2025 15:53 3m 54s ochafik:tool-docs
February 25, 2025 15:53 3m 54s
docs: add docs/function-calling.md to lighten server/README.md's plight
Server #11204: Pull request #12069 opened by ochafik
February 25, 2025 15:48 4m 49s ochafik:tool-docs
February 25, 2025 15:48 4m 49s
vulkan: fix assertion when qy_needs_dequant (#12068)
Server #11203: Commit a82c9e7 pushed by 0cc4m
February 25, 2025 15:30 9m 28s master
February 25, 2025 15:30 9m 28s
Add GGML_HIP_ROCWMMA_FATTN to enable rocWMMA for FlashAttention
Server #11202: Pull request #12032 synchronize by hjc4869
February 25, 2025 15:12 Action required hjc4869:pr
February 25, 2025 15:12 Action required
llama : add xcframework build script
Server #11201: Pull request #11996 synchronize by danbev
February 25, 2025 15:06 8m 10s danbev:xcframework-build-10747
February 25, 2025 15:06 8m 10s
Add GGML_HIP_ROCWMMA_FATTN to enable rocWMMA for FlashAttention
Server #11200: Pull request #12032 synchronize by hjc4869
February 25, 2025 15:00 Action required hjc4869:pr
February 25, 2025 15:00 Action required
llama : add xcframework build script
Server #11199: Pull request #11996 synchronize by danbev
February 25, 2025 14:42 25m 1s danbev:xcframework-build-10747
February 25, 2025 14:42 25m 1s
tool-call: add support for tool-calls using Model Context Protocol
Server #11198: Pull request #11556 synchronize by bandoti
February 25, 2025 14:41 19m 39s bandoti:llamacli-tools
February 25, 2025 14:41 19m 39s
tool-call: add support for tool-calls using Model Context Protocol
Server #11197: Pull request #11556 synchronize by bandoti
February 25, 2025 14:15 26m 46s bandoti:llamacli-tools
February 25, 2025 14:15 26m 46s
vulkan: fix assertion when qy_needs_dequant
Server #11196: Pull request #12068 opened by jeffbolznv
February 25, 2025 14:14 33m 5s jeffbolznv:qy_dequant_assert
February 25, 2025 14:14 33m 5s
llama : refactor llama_kv_cache, llama_context and llm_build_context
Server #11194: Pull request #11213 synchronize by ggerganov
February 25, 2025 14:11 8m 33s gg/llama-kv-cache
February 25, 2025 14:11 8m 33s
ggml: aarch64: implement SVE kernels for q2_k_q8_k vector dot
Server #11192: Pull request #12064 synchronize by Vithulep
February 25, 2025 13:33 7m 47s Vithulep:Q2_k_SVE_Kernel
February 25, 2025 13:33 7m 47s
Cache based tokenization for the server input prompts
Server #11191: Pull request #12067 opened by vnicolici
February 25, 2025 13:08 Action required vnicolici:cache-based-tokenization
February 25, 2025 13:08 Action required
llama : add xcframework build script
Server #11190: Pull request #11996 synchronize by danbev
February 25, 2025 12:55 27m 58s danbev:xcframework-build-10747
February 25, 2025 12:55 27m 58s