..
sdcpp
fix unintended sd model quantization ( #1672 )
2025-08-08 10:19:58 +08:00
tools
add old convert tool
2025-06-21 08:40:04 +08:00
whispercpp
switch to miniaudio, support mp3 for whisper
2025-07-13 23:24:07 +08:00
embeddings_adapter.cpp
default kv_unified to true, handle LLAMA_SET_ROWS.
2025-07-21 16:13:20 +08:00
ggml_v1.c
ggml_v1.h
ggml_v2-cuda-legacy.cu
attempts a backflip, but does he stick the landing?
2024-11-16 17:05:45 +08:00
ggml_v2-cuda-legacy.h
ggml_v2-cuda.cu
attempts a backflip, but does he stick the landing?
2024-11-16 17:05:45 +08:00
ggml_v2-cuda.h
ggml_v2-opencl-legacy.c
ggml_v2-opencl-legacy.h
ggml_v2-opencl.cpp
ggml_v2-opencl.h
ggml_v2.c
ggml_v2.h
ggml_v3-cuda.cu
cleaned up unused flags from makefile, updated lite
2025-01-30 19:34:55 +08:00
ggml_v3-cuda.h
attempts a backflip, but does he stick the landing?
2024-11-16 17:05:45 +08:00
ggml_v3-opencl.cpp
ggml_v3-opencl.h
ggml_v3.c
try fix compile issues
2024-09-19 13:56:19 +08:00
ggml_v3.h
Fixed some GGUFv1 loading bugs, long overdue cleanup for compiling, integrated TTS
2025-01-13 14:23:25 +08:00
ggml_v3b-opencl.cpp
better warning message
2025-05-21 21:47:40 +08:00
ggml_v3b-opencl.h
clean and rename old clblast files in preparation for merge
2024-12-15 15:29:02 +08:00
gpt2_v1.cpp
gpt2_v2.cpp
gpt2_v3.cpp
gptj_v1.cpp
gptj_v2.cpp
gptj_v3.cpp
llama-util.h
llama_v2-util.h
llama_v2.cpp
added a nicer built in voice
2025-01-13 23:26:54 +08:00
llama_v2.h
Fixed some GGUFv1 loading bugs, long overdue cleanup for compiling, integrated TTS
2025-01-13 14:23:25 +08:00
llama_v3.cpp
added a nicer built in voice
2025-01-13 23:26:54 +08:00
llama_v3.h
mpt_v3.cpp
neox_v2.cpp
neox_v3.cpp
otherarch.h
updated lite, added better separators for multimodal chunks (universal)
2025-07-17 00:11:08 +08:00
rwkv_v2.cpp
Fixed some GGUFv1 loading bugs, long overdue cleanup for compiling, integrated TTS
2025-01-13 14:23:25 +08:00
rwkv_v2.h
rwkv_v3.cpp
added llava letterboxing feature
2024-08-25 23:15:38 +08:00
rwkv_v3.h
rwkv_vocab.cpp
tts_adapter.cpp
default kv_unified to true, handle LLAMA_SET_ROWS.
2025-07-21 16:13:20 +08:00
utils.cpp
fixed swa pp bug by retrying smaller batches
2025-07-21 23:34:22 +08:00
utils.h
fixed swa pp bug by retrying smaller batches
2025-07-21 23:34:22 +08:00