koboldcpp/otherarch
Concedo 7e35954695 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	docs/build.md
#	docs/function-calling.md
#	examples/eval-callback/eval-callback.cpp
#	ggml/CMakeLists.txt
#	ggml/src/ggml-cann/ggml-cann.cpp
#	ggml/src/ggml-cpu/CMakeLists.txt
#	ggml/src/ggml-cpu/kleidiai/kernels.cpp
#	ggml/src/ggml-cpu/kleidiai/kernels.h
#	ggml/src/ggml-cpu/kleidiai/kleidiai.cpp
#	scripts/compare-llama-bench.py
#	scripts/server-bench.py
#	scripts/tool_bench.py
#	tests/test-chat.cpp
#	tools/batched-bench/batched-bench.cpp
#	tools/llama-bench/llama-bench.cpp
#	tools/server/README.md
2025-08-31 23:33:36 +08:00
..
sdcpp minor adjustments for sdquant: allow backend to do the translation for the type more defensively, adjust the UI dropdown for clarity. 2025-08-22 23:23:32 +08:00
tools add old convert tool 2025-06-21 08:40:04 +08:00
ttscpp can't resolve the clicking 2025-08-26 17:55:03 +08:00
whispercpp improve whisper transcribe apt regex 2025-08-22 17:13:51 +08:00
embeddings_adapter.cpp Merge branch 'upstream' into concedo_experimental 2025-08-31 23:33:36 +08:00
ggml_v1.c
ggml_v1.h
ggml_v2-cuda-legacy.cu attempts a backflip, but does he stick the landing? 2024-11-16 17:05:45 +08:00
ggml_v2-cuda-legacy.h
ggml_v2-cuda.cu attempts a backflip, but does he stick the landing? 2024-11-16 17:05:45 +08:00
ggml_v2-cuda.h
ggml_v2-opencl-legacy.c
ggml_v2-opencl-legacy.h
ggml_v2-opencl.cpp
ggml_v2-opencl.h
ggml_v2.c
ggml_v2.h
ggml_v3-cuda.cu cleaned up unused flags from makefile, updated lite 2025-01-30 19:34:55 +08:00
ggml_v3-cuda.h attempts a backflip, but does he stick the landing? 2024-11-16 17:05:45 +08:00
ggml_v3-opencl.cpp
ggml_v3-opencl.h
ggml_v3.c try fix compile issues 2024-09-19 13:56:19 +08:00
ggml_v3.h Fixed some GGUFv1 loading bugs, long overdue cleanup for compiling, integrated TTS 2025-01-13 14:23:25 +08:00
ggml_v3b-opencl.cpp better warning message 2025-05-21 21:47:40 +08:00
ggml_v3b-opencl.h clean and rename old clblast files in preparation for merge 2024-12-15 15:29:02 +08:00
gpt2_v1.cpp
gpt2_v2.cpp
gpt2_v3.cpp
gptj_v1.cpp
gptj_v2.cpp
gptj_v3.cpp
llama-util.h
llama_v2-util.h
llama_v2.cpp added a nicer built in voice 2025-01-13 23:26:54 +08:00
llama_v2.h Fixed some GGUFv1 loading bugs, long overdue cleanup for compiling, integrated TTS 2025-01-13 14:23:25 +08:00
llama_v3.cpp added a nicer built in voice 2025-01-13 23:26:54 +08:00
llama_v3.h
mpt_v3.cpp
neox_v2.cpp
neox_v3.cpp
otherarch.h updated lite, added better separators for multimodal chunks (universal) 2025-07-17 00:11:08 +08:00
rwkv_v2.cpp Fixed some GGUFv1 loading bugs, long overdue cleanup for compiling, integrated TTS 2025-01-13 14:23:25 +08:00
rwkv_v2.h
rwkv_v3.cpp added llava letterboxing feature 2024-08-25 23:15:38 +08:00
rwkv_v3.h
rwkv_vocab.cpp
tts_adapter.cpp Merge branch 'upstream' into concedo_experimental 2025-08-31 23:33:36 +08:00
utils.cpp fixed swa pp bug by retrying smaller batches 2025-07-21 23:34:22 +08:00
utils.h fixed swa pp bug by retrying smaller batches 2025-07-21 23:34:22 +08:00