mirror of
https://github.com/LostRuins/koboldcpp.git
synced 2026-05-17 04:09:19 +00:00
* SYCL: reduce allocation overhead during flash attention * tidy up whitespace * add a note about the flag * move ggml_sycl_fattn_* into fattn-buffers.hpp * refactor implementation into fattn-buffers.cpp * move new_fattn_kv_buffers back into ggml-sycl.cpp |
||
|---|---|---|
| .. | ||
| snapdragon | ||
| VirtGPU | ||
| BLIS.md | ||
| CANN.md | ||
| CUDA-FEDORA.md | ||
| OPENCL.md | ||
| OPENVINO.md | ||
| SYCL.md | ||
| VirtGPU.md | ||
| zDNN.md | ||
| ZenDNN.md | ||