koboldcpp/otherarch
Wagner Bruna 5de7ed3d56
WIP: update stable-diffusion.cpp to 5900ef6605c6 (new API) (#1669)
* Update stable-diffusion.cpp to 5900ef6605c6 (new API)

* Clean up pending LoRA code and simplify LoRA changes to upstream

* Move VAE tiling disabling for TAESD to sdtype_adapter.cpp

* Move auxiliary ctx functions to sdtype_adapter.cpp

* Use ref_images parameter for Kontext images

* Drop clip skip workaround (fixed upstream)

* Workaround for flash attention with img2img

leejet/stable-diffusion.cpp#756

* Workaround for Chroma with flash attention, debug prints

* Disable forcing CLIP weights to F32 for reduced memory usage
2025-08-12 23:25:02 +08:00
..
sdcpp WIP: update stable-diffusion.cpp to 5900ef6605c6 (new API) (#1669) 2025-08-12 23:25:02 +08:00
tools add old convert tool 2025-06-21 08:40:04 +08:00
whispercpp switch to miniaudio, support mp3 for whisper 2025-07-13 23:24:07 +08:00
embeddings_adapter.cpp default kv_unified to true, handle LLAMA_SET_ROWS. 2025-07-21 16:13:20 +08:00
ggml_v1.c
ggml_v1.h
ggml_v2-cuda-legacy.cu attempts a backflip, but does he stick the landing? 2024-11-16 17:05:45 +08:00
ggml_v2-cuda-legacy.h
ggml_v2-cuda.cu attempts a backflip, but does he stick the landing? 2024-11-16 17:05:45 +08:00
ggml_v2-cuda.h
ggml_v2-opencl-legacy.c
ggml_v2-opencl-legacy.h
ggml_v2-opencl.cpp
ggml_v2-opencl.h
ggml_v2.c Merge commit '280345968d' into concedo_experimental 2024-04-07 20:27:17 +08:00
ggml_v2.h
ggml_v3-cuda.cu cleaned up unused flags from makefile, updated lite 2025-01-30 19:34:55 +08:00
ggml_v3-cuda.h attempts a backflip, but does he stick the landing? 2024-11-16 17:05:45 +08:00
ggml_v3-opencl.cpp
ggml_v3-opencl.h
ggml_v3.c try fix compile issues 2024-09-19 13:56:19 +08:00
ggml_v3.h Fixed some GGUFv1 loading bugs, long overdue cleanup for compiling, integrated TTS 2025-01-13 14:23:25 +08:00
ggml_v3b-opencl.cpp better warning message 2025-05-21 21:47:40 +08:00
ggml_v3b-opencl.h clean and rename old clblast files in preparation for merge 2024-12-15 15:29:02 +08:00
gpt2_v1.cpp
gpt2_v2.cpp
gpt2_v3.cpp Merge commit '280345968d' into concedo_experimental 2024-04-07 20:27:17 +08:00
gptj_v1.cpp
gptj_v2.cpp
gptj_v3.cpp Merge commit '280345968d' into concedo_experimental 2024-04-07 20:27:17 +08:00
llama-util.h Merge commit '280345968d' into concedo_experimental 2024-04-07 20:27:17 +08:00
llama_v2-util.h Merge commit '280345968d' into concedo_experimental 2024-04-07 20:27:17 +08:00
llama_v2.cpp added a nicer built in voice 2025-01-13 23:26:54 +08:00
llama_v2.h Fixed some GGUFv1 loading bugs, long overdue cleanup for compiling, integrated TTS 2025-01-13 14:23:25 +08:00
llama_v3.cpp added a nicer built in voice 2025-01-13 23:26:54 +08:00
llama_v3.h Merge commit '280345968d' into concedo_experimental 2024-04-07 20:27:17 +08:00
mpt_v3.cpp Merge commit '280345968d' into concedo_experimental 2024-04-07 20:27:17 +08:00
neox_v2.cpp
neox_v3.cpp Merge commit '280345968d' into concedo_experimental 2024-04-07 20:27:17 +08:00
otherarch.h updated lite, added better separators for multimodal chunks (universal) 2025-07-17 00:11:08 +08:00
rwkv_v2.cpp Fixed some GGUFv1 loading bugs, long overdue cleanup for compiling, integrated TTS 2025-01-13 14:23:25 +08:00
rwkv_v2.h
rwkv_v3.cpp added llava letterboxing feature 2024-08-25 23:15:38 +08:00
rwkv_v3.h
rwkv_vocab.cpp vae test 2024-05-14 19:17:01 +08:00
tts_adapter.cpp default kv_unified to true, handle LLAMA_SET_ROWS. 2025-07-21 16:13:20 +08:00
utils.cpp fixed swa pp bug by retrying smaller batches 2025-07-21 23:34:22 +08:00
utils.h fixed swa pp bug by retrying smaller batches 2025-07-21 23:34:22 +08:00