koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2026-05-10 04:00:53 +00:00

History

Wagner Bruna 5de7ed3d56 WIP: update stable-diffusion.cpp to 5900ef6605c6 (new API) (#1669 ) * Update stable-diffusion.cpp to 5900ef6605c6 (new API) * Clean up pending LoRA code and simplify LoRA changes to upstream * Move VAE tiling disabling for TAESD to sdtype_adapter.cpp * Move auxiliary ctx functions to sdtype_adapter.cpp * Use ref_images parameter for Kontext images * Drop clip skip workaround (fixed upstream) * Workaround for flash attention with img2img leejet/stable-diffusion.cpp#756 * Workaround for Chroma with flash attention, debug prints * Disable forcing CLIP weights to F32 for reduced memory usage		2025-08-12 23:25:02 +08:00
..
sdcpp	WIP: update stable-diffusion.cpp to 5900ef6605c6 (new API) (#1669 )	2025-08-12 23:25:02 +08:00
tools	add old convert tool	2025-06-21 08:40:04 +08:00
whispercpp	switch to miniaudio, support mp3 for whisper	2025-07-13 23:24:07 +08:00
embeddings_adapter.cpp	default kv_unified to true, handle LLAMA_SET_ROWS.	2025-07-21 16:13:20 +08:00
ggml_v1.c
ggml_v1.h
ggml_v2-cuda-legacy.cu	attempts a backflip, but does he stick the landing?	2024-11-16 17:05:45 +08:00
ggml_v2-cuda-legacy.h
ggml_v2-cuda.cu	attempts a backflip, but does he stick the landing?	2024-11-16 17:05:45 +08:00
ggml_v2-cuda.h
ggml_v2-opencl-legacy.c
ggml_v2-opencl-legacy.h
ggml_v2-opencl.cpp
ggml_v2-opencl.h
ggml_v2.c	Merge commit '`280345968d`' into concedo_experimental	2024-04-07 20:27:17 +08:00
ggml_v2.h
ggml_v3-cuda.cu	cleaned up unused flags from makefile, updated lite	2025-01-30 19:34:55 +08:00
ggml_v3-cuda.h	attempts a backflip, but does he stick the landing?	2024-11-16 17:05:45 +08:00
ggml_v3-opencl.cpp
ggml_v3-opencl.h
ggml_v3.c	try fix compile issues	2024-09-19 13:56:19 +08:00
ggml_v3.h	Fixed some GGUFv1 loading bugs, long overdue cleanup for compiling, integrated TTS	2025-01-13 14:23:25 +08:00
ggml_v3b-opencl.cpp	better warning message	2025-05-21 21:47:40 +08:00
ggml_v3b-opencl.h	clean and rename old clblast files in preparation for merge	2024-12-15 15:29:02 +08:00
gpt2_v1.cpp
gpt2_v2.cpp
gpt2_v3.cpp	Merge commit '`280345968d`' into concedo_experimental	2024-04-07 20:27:17 +08:00
gptj_v1.cpp
gptj_v2.cpp
gptj_v3.cpp	Merge commit '`280345968d`' into concedo_experimental	2024-04-07 20:27:17 +08:00
llama-util.h	Merge commit '`280345968d`' into concedo_experimental	2024-04-07 20:27:17 +08:00
llama_v2-util.h	Merge commit '`280345968d`' into concedo_experimental	2024-04-07 20:27:17 +08:00
llama_v2.cpp	added a nicer built in voice	2025-01-13 23:26:54 +08:00
llama_v2.h	Fixed some GGUFv1 loading bugs, long overdue cleanup for compiling, integrated TTS	2025-01-13 14:23:25 +08:00
llama_v3.cpp	added a nicer built in voice	2025-01-13 23:26:54 +08:00
llama_v3.h	Merge commit '`280345968d`' into concedo_experimental	2024-04-07 20:27:17 +08:00
mpt_v3.cpp	Merge commit '`280345968d`' into concedo_experimental	2024-04-07 20:27:17 +08:00
neox_v2.cpp
neox_v3.cpp	Merge commit '`280345968d`' into concedo_experimental	2024-04-07 20:27:17 +08:00
otherarch.h	updated lite, added better separators for multimodal chunks (universal)	2025-07-17 00:11:08 +08:00
rwkv_v2.cpp	Fixed some GGUFv1 loading bugs, long overdue cleanup for compiling, integrated TTS	2025-01-13 14:23:25 +08:00
rwkv_v2.h
rwkv_v3.cpp	added llava letterboxing feature	2024-08-25 23:15:38 +08:00
rwkv_v3.h
rwkv_vocab.cpp	vae test	2024-05-14 19:17:01 +08:00
tts_adapter.cpp	default kv_unified to true, handle LLAMA_SET_ROWS.	2025-07-21 16:13:20 +08:00
utils.cpp	fixed swa pp bug by retrying smaller batches	2025-07-21 23:34:22 +08:00
utils.h	fixed swa pp bug by retrying smaller batches	2025-07-21 23:34:22 +08:00