koboldcpp/.github/workflows
Concedo f3bb947a13 cuda use wmma flash attention for turing (+1 squashed commits)
Squashed commits:

[3c5112398] 117 (+10 squashed commit)

Squashed commit:

[4f01bb2d4] 117 graphs 80v

[7549034ea] 117 graphs

[dabf9cb99] checking if cuda 11.5.2 works

[ba7ccdb7a] another try cu11.7 only

[752cf2ae5] increase aria2c download log rate

[dc4f198fd] test send turing to wmma flash attention

[496a22e83] temp build test cu11.7.0

[ca759c424] temp build test cu11.7

[c46ada17c] test build: enable virtual80 for oldcpu

[3ccfd939a] test build: with cuda graphs for all
2025-06-01 11:41:45 +08:00
..
kcpp-build-release-arm64.yaml added a simple cross platform launch script for unpacked dirs 2025-05-22 22:09:46 +08:00
kcpp-build-release-linux-cuda12.yaml tryout smaller binaries 2025-05-07 14:56:34 +08:00
kcpp-build-release-linux-rocm.yaml KoboldCpp.sh updates (#1562) 2025-05-26 15:24:49 +08:00
kcpp-build-release-linux.yaml KoboldCpp.sh updates (#1562) 2025-05-26 15:24:49 +08:00
kcpp-build-release-osx.yaml added a simple cross platform launch script for unpacked dirs 2025-05-22 22:09:46 +08:00
kcpp-build-release-win-full-cu12.yaml cuda use wmma flash attention for turing (+1 squashed commits) 2025-06-01 11:41:45 +08:00
kcpp-build-release-win-full.yaml improved comfyui compatibility, tweaked hf search 2025-05-02 16:18:31 +08:00
kcpp-build-release-win-oldcpu-full.yaml improved comfyui compatibility, tweaked hf search 2025-05-02 16:18:31 +08:00