koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2026-05-22 11:16:08 +00:00

History

ScrewTSW b65bb4baae server: expose prompt token counts in /slots endpoint (#23454 ) Add n_prompt_tokens, n_prompt_tokens_processed, and n_prompt_tokens_cache to the /slots JSON response. These fields are already tracked internally but were not exposed, making it impossible for clients to monitor prompt evaluation progress during processing.		2026-05-21 13:29:13 +02:00
..
batched-bench	app : add batched-bench, fit-params, quantize & perplexity (#23459 )	2026-05-21 10:29:44 +03:00
cli	mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision (#23329 )	2026-05-21 00:35:37 +02:00
completion	mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision (#23329 )	2026-05-21 00:35:37 +02:00
cvector-generator	libs : rename libcommon -> libllama-common (#21936 )	2026-04-17 11:11:46 +03:00
export-lora	libs : rename libcommon -> libllama-common (#21936 )	2026-04-17 11:11:46 +03:00
fit-params	app : add batched-bench, fit-params, quantize & perplexity (#23459 )	2026-05-21 10:29:44 +03:00
gguf-split	libs : rename libcommon -> libllama-common (#21936 )	2026-04-17 11:11:46 +03:00
imatrix	libs : rename libcommon -> libllama-common (#21936 )	2026-04-17 11:11:46 +03:00
llama-bench	app : introduce the llama unified executable (#23296 )	2026-05-20 13:22:22 +02:00
mtmd	mtmd, model : merge HunyuanOCR into HunyuanVL and fix OCR vision precision (#23329 )	2026-05-21 00:35:37 +02:00
parser	libs : rename libcommon -> libllama-common (#21936 )	2026-04-17 11:11:46 +03:00
perplexity	app : add batched-bench, fit-params, quantize & perplexity (#23459 )	2026-05-21 10:29:44 +03:00
quantize	app : add batched-bench, fit-params, quantize & perplexity (#23459 )	2026-05-21 10:29:44 +03:00
results	libs : rename libcommon -> libllama-common (#21936 )	2026-04-17 11:11:46 +03:00
rpc	fix: rpc-server cache may not work in Windows environments (#22394 )	2026-04-27 17:25:09 +03:00
server	server: expose prompt token counts in /slots endpoint (#23454 )	2026-05-21 13:29:13 +02:00
tokenize	libs : rename libcommon -> libllama-common (#21936 )	2026-04-17 11:11:46 +03:00
tts	logs : reduce (#23021 )	2026-05-14 13:05:52 +03:00
ui	ui: Improve Git Hooks for UI development (#23403 )	2026-05-21 08:27:50 +02:00
CMakeLists.txt	ui: Restructure repo to use `tools/ui` folder and `ui` / `UI` / `llama-ui` / `LLAMA_UI` naming (#23064 )	2026-05-16 02:02:40 +02:00