koboldcpp/otherarch
Wagner Bruna b437d18319
add support for cache modes to accelerate image generation (#2021)
* sd: sync to master-525-d6dd6d7

* sd: add support for cache modes for inference acceleration

* keep gendefaults as a JSON object inside the config file

* covered more invalid cases on gendefaults parsing
2026-03-15 15:27:14 +08:00
..
acestep claude fixes for ace step, idk man who am i to argue with an agi 2026-03-14 12:27:26 +08:00
qwen3tts fix q3tts compile, update docs and lite 2026-03-14 23:33:18 +08:00
sdcpp add support for cache modes to accelerate image generation (#2021) 2026-03-15 15:27:14 +08:00
tools add old convert tool 2025-06-21 08:40:04 +08:00
ttscpp Adjust int types in format strings (#2009) 2026-03-06 19:06:18 +08:00
whispercpp Adjust int types in format strings (#2009) 2026-03-06 19:06:18 +08:00
embeddings_adapter.cpp Adjust int types in format strings (#2009) 2026-03-06 19:06:18 +08:00
ggml_v1.c OpenBSD also needs alloca.h (#2012) 2026-03-05 12:32:31 +08:00
ggml_v1.h
ggml_v2-cuda-legacy.cu try fix compile issues rocm (+4 squashed commit) 2025-09-21 22:31:09 +08:00
ggml_v2-cuda-legacy.h
ggml_v2-cuda.cu try fix compile issues rocm (+4 squashed commit) 2025-09-21 22:31:09 +08:00
ggml_v2-cuda.h
ggml_v2.c remove clblast, part 2 2026-01-23 14:09:46 +08:00
ggml_v2.h
ggml_v3-cuda.cu try fix compile issues rocm (+4 squashed commit) 2025-09-21 22:31:09 +08:00
ggml_v3-cuda.h attempts a backflip, but does he stick the landing? 2024-11-16 17:05:45 +08:00
ggml_v3.c try fix compile issues 2024-09-19 13:56:19 +08:00
ggml_v3.h Fixed some GGUFv1 loading bugs, long overdue cleanup for compiling, integrated TTS 2025-01-13 14:23:25 +08:00
gpt2_v1.cpp
gpt2_v2.cpp Adjust int types in format strings (#2009) 2026-03-06 19:06:18 +08:00
gpt2_v3.cpp
gptj_v1.cpp
gptj_v2.cpp Adjust int types in format strings (#2009) 2026-03-06 19:06:18 +08:00
gptj_v3.cpp Adjust int types in format strings (#2009) 2026-03-06 19:06:18 +08:00
llama-util.h
llama_v2-util.h
llama_v2.cpp added a nicer built in voice 2025-01-13 23:26:54 +08:00
llama_v2.h Fixed some GGUFv1 loading bugs, long overdue cleanup for compiling, integrated TTS 2025-01-13 14:23:25 +08:00
llama_v3.cpp added a nicer built in voice 2025-01-13 23:26:54 +08:00
llama_v3.h
mpt_v3.cpp
neox_v2.cpp
neox_v3.cpp
otherarch.h kv snapshots save and load last logits for correctness. added some text for musicui, updated docs 2026-03-04 21:57:28 +08:00
rwkv_v2.cpp Adjust int types in format strings (#2009) 2026-03-06 19:06:18 +08:00
rwkv_v2.h
rwkv_v3.cpp
rwkv_v3.h
rwkv_vocab.cpp move embeds into their own dir, detach sd vocab into separate files 2025-10-03 14:21:09 +08:00
tts_adapter.cpp fix q3tts compile, update docs and lite 2026-03-14 23:33:18 +08:00
utils.cpp cover mode is now working 2026-03-13 14:55:39 +08:00
utils.h cover mode is now working 2026-03-13 14:55:39 +08:00