..
batched-bench
app : add batched-bench, fit-params, quantize & perplexity ( #23459 )
2026-05-21 10:29:44 +03:00
cli
mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision ( #23329 )
2026-05-21 00:35:37 +02:00
completion
mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision ( #23329 )
2026-05-21 00:35:37 +02:00
cvector-generator
libs : rename libcommon -> libllama-common ( #21936 )
2026-04-17 11:11:46 +03:00
export-lora
libs : rename libcommon -> libllama-common ( #21936 )
2026-04-17 11:11:46 +03:00
fit-params
app : add batched-bench, fit-params, quantize & perplexity ( #23459 )
2026-05-21 10:29:44 +03:00
gguf-split
libs : rename libcommon -> libllama-common ( #21936 )
2026-04-17 11:11:46 +03:00
imatrix
libs : rename libcommon -> libllama-common ( #21936 )
2026-04-17 11:11:46 +03:00
llama-bench
app : introduce the llama unified executable ( #23296 )
2026-05-20 13:22:22 +02:00
mtmd
mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision ( #23329 )
2026-05-21 00:35:37 +02:00
parser
libs : rename libcommon -> libllama-common ( #21936 )
2026-04-17 11:11:46 +03:00
perplexity
app : add batched-bench, fit-params, quantize & perplexity ( #23459 )
2026-05-21 10:29:44 +03:00
quantize
app : add batched-bench, fit-params, quantize & perplexity ( #23459 )
2026-05-21 10:29:44 +03:00
results
libs : rename libcommon -> libllama-common ( #21936 )
2026-04-17 11:11:46 +03:00
rpc
fix: rpc-server cache may not work in Windows environments ( #22394 )
2026-04-27 17:25:09 +03:00
server
server: expose prompt token counts in /slots endpoint ( #23454 )
2026-05-21 13:29:13 +02:00
tokenize
libs : rename libcommon -> libllama-common ( #21936 )
2026-04-17 11:11:46 +03:00
tts
logs : reduce ( #23021 )
2026-05-14 13:05:52 +03:00
ui
ui: Improve Git Hooks for UI development ( #23403 )
2026-05-21 08:27:50 +02:00
CMakeLists.txt
ui: Restructure repo to use tools/ui folder and ui / UI / llama-ui / LLAMA_UI naming ( #23064 )
2026-05-16 02:02:40 +02:00