koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2026-05-27 17:13:41 +00:00

History

Georgi Gerganov 3600cc2886 llama : use n_swa + n_ubatch cells for SWA cache (#13833 ) * llama : use n_swa + n_ubatch cells for SWA cache ggml-ci * llama : add warning about multi-sqeuence SWA contexts		2025-05-31 15:57:44 +03:00
..
llama-cpp.h	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
llama.h	llama : use n_swa + n_ubatch cells for SWA cache (#13833 )	2025-05-31 15:57:44 +03:00