koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2026-07-09 17:08:33 +00:00

History

Concedo e944cca86f Merge branch 'upstream' into concedo_experimental # Conflicts: # scripts/sync_vendor.py # src/llama-model-loader.cpp # tests/test-backend-ops.cpp # tests/test-chat-auto-parser.cpp # tests/test-chat.cpp # tools/cli/cli.cpp # tools/server/README.md		2026-07-05 11:30:12 +08:00
..
jinja	Merge commit '`8c146a8366`' into concedo_experimental	2026-06-30 22:00:03 +08:00
arg.cpp	Merge branch 'upstream' into concedo_experimental	2026-07-02 21:42:36 +08:00
arg.h	arg: fix handling --spec-draft-hf and --hf-repo-v (#25043 )	2026-06-26 14:36:03 +02:00
base64.hpp	llava : expose as a shared library for downstream projects (#3613 )	2023-11-07 00:36:23 +03:00
build-info.cpp.in	libs : rename libcommon -> libllama-common (#21936 )	2026-04-17 11:11:46 +03:00
build-info.h	Merge branch 'upstream' into concedo_experimental	2026-04-17 22:37:37 +08:00
chat-auto-parser-generator.cpp	common/peg : implement ac parser for stricter grammar generation (#24869 )	2026-06-21 16:20:58 -05:00
chat-auto-parser-helpers.cpp	server: fix checkpoints creation (#22929 )	2026-05-25 08:56:18 +03:00
chat-auto-parser-helpers.h	chat : avoid including json in chat.h (#21306 )	2026-04-03 09:07:59 +03:00
chat-auto-parser.h	chat: harden peg-native tool call parsing (#24329 )	2026-06-15 15:37:04 +02:00
chat-diff-analyzer.cpp	chat: harden peg-native tool call parsing (#24329 )	2026-06-15 15:37:04 +02:00
chat-peg-parser.cpp	chat : fix LFM2 tool-call parsing double-escaping (#24667 )	2026-06-15 22:10:09 +02:00
chat-peg-parser.h	chat: harden peg-native tool call parsing (#24329 )	2026-06-15 15:37:04 +02:00
chat.cpp	Merge branch 'upstream' into concedo_experimental	2026-07-05 11:30:12 +08:00
chat.h	server: improve user message detection and create checkpoints at every user message (#24176 )	2026-06-23 08:27:28 +03:00
common.cpp	Merge branch 'upstream' into concedo_experimental	2026-06-29 16:43:44 +08:00
common.h	Merge branch 'upstream' into concedo_experimental	2026-06-29 16:43:44 +08:00
console.cpp	cli: fix stripping of \n in multiline input (#21485 )	2026-04-06 20:54:06 +02:00
console.h	cli : add command and file auto-completion (#19985 )	2026-03-05 10:47:28 +01:00
debug.cpp	common: fix missing exports in llama-common (#22340 )	2026-04-27 08:06:39 +03:00
debug.h	common: fix missing exports in llama-common (#22340 )	2026-04-27 08:06:39 +03:00
download.cpp	fix builds	2026-06-27 18:50:05 +08:00
download.h	Merge commit '`9d5d882d8c`' into concedo_experimental	2026-06-27 10:18:52 +08:00
fit.cpp	logs : reduce v2 (#25078 )	2026-06-28 08:52:15 +03:00
fit.h	fit : wrap llama_device_memory_data (#24522 )	2026-06-13 08:09:52 +03:00
hf-cache.cpp	server: (router) add model management API (#23976 )	2026-06-17 18:04:58 +02:00
hf-cache.h	server: (router) add model management API (#23976 )	2026-06-17 18:04:58 +02:00
http.h	common,server: handle bracketed IPv6 literals in URL authority (#25140 )	2026-06-30 16:16:44 +02:00
imatrix-loader.cpp	Move duplicated imatrix code into single common imatrix-loader.cpp (#22445 )	2026-06-04 17:45:40 +02:00
imatrix-loader.h	Move duplicated imatrix code into single common imatrix-loader.cpp (#22445 )	2026-06-04 17:45:40 +02:00
json-schema-to-grammar.cpp	common/json-schema-to-grammar : align spacing rules with parsers (#24835 )	2026-06-20 17:43:04 -05:00
json-schema-to-grammar.h	common : add nemotron 3 parsing (#18077 )	2025-12-16 04:05:23 -06:00
llguidance.cpp	sampling : add support for backend sampling (#17004 )	2026-01-04 22:22:16 +02:00
log.cpp	common: update logging to enforce max_capacity and optimize queue resizing (#24490 )	2026-06-17 09:19:11 +03:00
log.h	logs : reduce (#23021 )	2026-05-14 13:05:52 +03:00
ngram-cache.cpp	spec : add self‑speculative decoding (no draft model required) + refactor (#18471 )	2026-01-28 19:42:42 +02:00
ngram-cache.h	spec : add self‑speculative decoding (no draft model required) + refactor (#18471 )	2026-01-28 19:42:42 +02:00
ngram-map.cpp	llama : MTP clean-up (#23269 )	2026-05-19 15:32:58 +03:00
ngram-map.h	fix: correct misspellings in code comments (#21217 )	2026-03-31 13:50:51 +02:00
ngram-mod.cpp	ngram-mod : Add missing include (#23857 )	2026-05-29 09:21:37 +03:00
ngram-mod.h	ngram-mod : fix build [no ci] (#19216 )	2026-01-30 21:27:27 +02:00
peg-parser.cpp	common/peg : implement ac parser for stricter grammar generation (#24869 )	2026-06-21 16:20:58 -05:00
peg-parser.h	common/peg : implement ac parser for stricter grammar generation (#24869 )	2026-06-21 16:20:58 -05:00
preset.cpp	common : dedup preset and cached model entries in /v1/models (#25131 )	2026-06-29 17:37:23 +02:00
preset.h	server: (router) rework -hf preset repo (#24739 )	2026-06-18 12:45:23 +02:00
reasoning-budget.cpp	logs : reduce v2 (#25078 )	2026-06-28 08:52:15 +03:00
reasoning-budget.h	common : support manually triggering the reasoning budget end sequence (#23949 )	2026-06-01 11:37:11 +02:00
sampling.cpp	server : return HTTP 400 on invalid grammar (#24144 ) (#24154 )	2026-06-18 12:49:14 +02:00
sampling.h	common : relax sampler name matching (#23744 )	2026-06-07 22:48:11 +02:00
speculative.cpp	Merge branch 'upstream' into concedo_experimental	2026-07-05 11:30:12 +08:00
speculative.h	spec: support eagle3 for qwen3.5 & 3.6 (#24593 )	2026-06-19 13:08:50 +03:00
unicode.cpp	common/parser: handle reasoning budget (#20297 )	2026-03-11 10:26:12 +01:00
unicode.h	common/parser: handle reasoning budget (#20297 )	2026-03-11 10:26:12 +01:00