koboldcpp/ggml/src/ggml-vulkan
Jeff Bolz 0c74b04376
vulkan: fix NaN issue in flash attention shader (#12776)
Use -FLT_MAX/2 rather than -inf as the initial value for computing the maximum.
2025-04-06 11:03:47 +02:00
..
cmake cmake: fix ggml-shaders-gen compiler paths containing spaces (#12747) 2025-04-04 10:12:40 -03:00
vulkan-shaders vulkan: fix NaN issue in flash attention shader (#12776) 2025-04-06 11:03:47 +02:00
CMakeLists.txt vulkan: Fix missing cmake logic for dot product extension (#12721) 2025-04-03 10:08:26 -05:00
ggml-vulkan.cpp vulkan: Use unclamped loads for flash attention mask (#12720) 2025-04-06 10:47:13 +02:00