koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2026-04-28 03:30:20 +00:00

History

Wagner Bruna eed5577aaa fix unintended sd model quantization (#1672 ) The recent ggml update added another quant type, GGML_TYPE_MXFP4, which got the same value as SD_TYPE_COUNT. That made the embedded sd.cpp quantize to GGML_TYPE_MXFP4 by default. Photomaker in particular ends up crashing due to "Missing CPY op for types: f32 mxfp4".		2025-08-08 10:19:58 +08:00
..
sdcpp	fix unintended sd model quantization (#1672 )	2025-08-08 10:19:58 +08:00
tools	add old convert tool	2025-06-21 08:40:04 +08:00
whispercpp	switch to miniaudio, support mp3 for whisper	2025-07-13 23:24:07 +08:00
embeddings_adapter.cpp	default kv_unified to true, handle LLAMA_SET_ROWS.	2025-07-21 16:13:20 +08:00
ggml_v1.c
ggml_v1.h
ggml_v2-cuda-legacy.cu	attempts a backflip, but does he stick the landing?	2024-11-16 17:05:45 +08:00
ggml_v2-cuda-legacy.h
ggml_v2-cuda.cu	attempts a backflip, but does he stick the landing?	2024-11-16 17:05:45 +08:00
ggml_v2-cuda.h
ggml_v2-opencl-legacy.c
ggml_v2-opencl-legacy.h
ggml_v2-opencl.cpp
ggml_v2-opencl.h
ggml_v2.c
ggml_v2.h
ggml_v3-cuda.cu	cleaned up unused flags from makefile, updated lite	2025-01-30 19:34:55 +08:00
ggml_v3-cuda.h	attempts a backflip, but does he stick the landing?	2024-11-16 17:05:45 +08:00
ggml_v3-opencl.cpp
ggml_v3-opencl.h
ggml_v3.c	try fix compile issues	2024-09-19 13:56:19 +08:00
ggml_v3.h	Fixed some GGUFv1 loading bugs, long overdue cleanup for compiling, integrated TTS	2025-01-13 14:23:25 +08:00
ggml_v3b-opencl.cpp	better warning message	2025-05-21 21:47:40 +08:00
ggml_v3b-opencl.h	clean and rename old clblast files in preparation for merge	2024-12-15 15:29:02 +08:00
gpt2_v1.cpp
gpt2_v2.cpp
gpt2_v3.cpp
gptj_v1.cpp
gptj_v2.cpp
gptj_v3.cpp
llama-util.h
llama_v2-util.h
llama_v2.cpp	added a nicer built in voice	2025-01-13 23:26:54 +08:00
llama_v2.h	Fixed some GGUFv1 loading bugs, long overdue cleanup for compiling, integrated TTS	2025-01-13 14:23:25 +08:00
llama_v3.cpp	added a nicer built in voice	2025-01-13 23:26:54 +08:00
llama_v3.h
mpt_v3.cpp
neox_v2.cpp
neox_v3.cpp
otherarch.h	updated lite, added better separators for multimodal chunks (universal)	2025-07-17 00:11:08 +08:00
rwkv_v2.cpp	Fixed some GGUFv1 loading bugs, long overdue cleanup for compiling, integrated TTS	2025-01-13 14:23:25 +08:00
rwkv_v2.h
rwkv_v3.cpp	added llava letterboxing feature	2024-08-25 23:15:38 +08:00
rwkv_v3.h
rwkv_vocab.cpp
tts_adapter.cpp	default kv_unified to true, handle LLAMA_SET_ROWS.	2025-07-21 16:13:20 +08:00
utils.cpp	fixed swa pp bug by retrying smaller batches	2025-07-21 23:34:22 +08:00
utils.h	fixed swa pp bug by retrying smaller batches	2025-07-21 23:34:22 +08:00