koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2025-09-13 02:19:41 +00:00

History

Concedo f3bb947a13 cuda use wmma flash attention for turing (+1 squashed commits) Squashed commits: [3c5112398] 117 (+10 squashed commit) Squashed commit: [4f01bb2d4] 117 graphs 80v [7549034ea] 117 graphs [dabf9cb99] checking if cuda 11.5.2 works [ba7ccdb7a] another try cu11.7 only [752cf2ae5] increase aria2c download log rate [dc4f198fd] test send turing to wmma flash attention [496a22e83] temp build test cu11.7.0 [ca759c424] temp build test cu11.7 [c46ada17c] test build: enable virtual80 for oldcpu [3ccfd939a] test build: with cuda graphs for all	2025-06-01 11:41:45 +08:00
..
ISSUE_TEMPLATE	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00
workflows	cuda use wmma flash attention for turing (+1 squashed commits)	2025-06-01 11:41:45 +08:00

Concedo f3bb947a13 cuda use wmma flash attention for turing (+1 squashed commits)

Squashed commits:

[3c5112398] 117 (+10 squashed commit)

Squashed commit:

[4f01bb2d4] 117 graphs 80v

[7549034ea] 117 graphs

[dabf9cb99] checking if cuda 11.5.2 works

[ba7ccdb7a] another try cu11.7 only

[752cf2ae5] increase aria2c download log rate

[dc4f198fd] test send turing to wmma flash attention

[496a22e83] temp build test cu11.7.0

[ca759c424] temp build test cu11.7

[c46ada17c] test build: enable virtual80 for oldcpu

[3ccfd939a] test build: with cuda graphs for all

2025-06-01 11:41:45 +08:00

ISSUE_TEMPLATE

repo : update links to new url (#11886 )

2025-02-15 16:40:57 +02:00

workflows

cuda use wmma flash attention for turing (+1 squashed commits)

2025-06-01 11:41:45 +08:00