koboldcpp/ggml
Concedo 3aec5ed0fd Kcpp triage for rowsplit: revert https://github.com/ggml-org/llama.cpp/pull/16715 until https://github.com/ggml-org/llama.cpp/issues/16799 is resolved
revert https://github.com/ggml-org/llama.cpp/pull/16715 (+2 squashed commit)

Squashed commit:

[289af2ee2] Revert "Hide latency of bias and gate-loading (#16847)"

This reverts commit 8b11deea46.

[a3e5c1e95] Revert "CUDA: add unused vars to mmvf and mmvq (#16807)"

This reverts commit 463bbf20bf.
2025-11-02 09:58:41 +08:00
..
include Merge branch 'upstream' into concedo_experimental 2025-10-31 10:52:57 +08:00
src Kcpp triage for rowsplit: revert https://github.com/ggml-org/llama.cpp/pull/16715 until https://github.com/ggml-org/llama.cpp/issues/16799 is resolved 2025-11-02 09:58:41 +08:00