koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2025-09-10 09:04:36 +00:00

Author	SHA1	Message	Date
Concedo	fda682fa12	updated lite	2025-04-30 19:49:54 +08:00
Concedo	e561eeb5e5	qwen3 no think template fix	2025-04-29 21:06:27 +08:00
Concedo	c2802af9e8	fix qwen3, fixed sd, fixed glm4	2025-04-29 20:50:46 +08:00
Concedo	4d8a7a6594	fix occasional clip segfault, fix glm4 (+1 squashed commits) Squashed commits: [bd71cd688] GLM4 fix wip	2025-04-29 01:42:50 +08:00
kallewoof	7cb815b727	AutoGuess: GLM-4 (#1502 ) * AutoGuess: GLM-4 * add 'chat_start' field to adapters * GLM-4 fix	2025-04-26 08:47:42 +08:00
Concedo	d32d0b382a	glm4 template	2025-04-25 00:41:15 +08:00
Concedo	a6149ad0fc	fixed g3 adapter back	2025-04-12 23:17:54 +08:00
Concedo	c94aec1930	update workflows, update gemma default adapter sysprompt	2025-04-12 18:38:23 +08:00
Concedo	efef14bb82	added llama4 tags	2025-04-12 08:58:04 +08:00
Concedo	93a226d9e4	added prefix for llava, reverted system role in template as it degreaded gemma3. truncated debug logs	2025-04-05 18:06:41 +08:00
Concedo	669311365c	fixed gemma system prompt	2025-04-02 13:58:51 +08:00
Concedo	0ed95fcccc	fixed l3 template, add index	2025-03-31 23:59:06 +08:00
Concedo	6a1dd57435	gemma3 template, updated lite, fixed tool calling, reenable ctx shift for gemma3	2025-03-14 17:47:01 +08:00
Concedo	6b7d2349a7	Rewrite history to fix bad vulkan shader commits without increasing repo size added dpe colab (+8 squashed commit) Squashed commit: [b8362da4] updated lite [ed6c037d] move nsigma into the regular sampler stack [ac5f61c6] relative filepath fixed [05fe96ab] export template [ed0a5a3e] nix_example.md: refactor (#1401) * nix_example.md: add override example * nix_example.md: drop graphics example, already basic nixos knowledge * nix_example.md: format * nix_example.md: Vulkan is disabled on macOS Disabled in: `1ccd253acc` * nix_examples.md: nixpkgs.config.cuda{Arches -> Capabilities} Fixes: https://github.com/LostRuins/koboldcpp/issues/1367 [675c62f7] AutoGuess: Phi 4 (mini) (#1402) [`4bf56982`] phrasing [`b8c0df04`] Add Rep Pen to Top N Sigma sampler chain (#1397) - place after nsigma and before xtc (+3 squashed commit) Squashed commit: [`87c52b97`] disable VMM from HIP [`ee8906f3`] edit description [`e85c0e69`] Remove Unnecessary Rep Counting (#1394) * stop counting reps * fix range-based initializer * strike that - reverse it	2025-03-05 00:02:20 +08:00
Concedo	cca4a934dd	fix for chat templates and drafting	2025-01-23 11:49:40 +08:00
kallewoof	1cb9805024	add autoguess adapter for DeepSeek V2.5/R1 (#1329 )	2025-01-22 20:39:04 +08:00
kallewoof	23ec550835	PoC: add chat template heuristics (#1283 ) * PoC: add chat template heuristics The fallback chat template adapter of Vicuna is not ideal in some cases (e.g. a test against a sub-portion of the BBC news classification task on Kaggle gave an 82% accuracy with Vicuna and 88% with the official ChatML format for a q4_k_m Qwen 2.5 3B-Instruct gguf). This PR adds a proof of concept simple heuristic which looks at the chat template and upgrades the adapter when it is able to. * gemma 2 heuristic * Phi 4, Llama 3.x heuristics * better qwen vs generic heuristic * cleanup * mistral (generic) heuristic * fix sys msg for mistral * phi 3.5 * mistral v3 * cohere (aya expanse 32b based) * only derive from chat template if AutoGuess * add notes about alpaca fallbacks * added AutoGuess.json dummy * add mistral v7 * switch to using a json list with search strings	2024-12-28 12:15:23 +08:00
Concedo	a577015425	consolidate mistral templates (too many)	2024-12-15 00:03:34 +08:00
pandora	a548108dd2	Create Mistral-V7.json (#1224 )	2024-11-19 10:45:50 +08:00
Concedo	c38d1ecc8d	update templates, fix rwkv	2024-09-22 01:32:12 +08:00
pandora	d4d19f6dbe	Fixing Mistral Templates (#1131 ) * Update and rename Mistral.json to Mistral V1.json * Create Mistral V2 & V3 * Rename Mistral V2 & V3 to Mistral V2 & V3.json * Create Mistral V3-Tekken.json	2024-09-22 01:30:17 +08:00
xzuyn	add0a88111	Update kcpp_adapters (#985 ) * Update kcpp_adapters * Update kcpp_adapters	2024-07-13 19:06:15 +08:00
Nexesenex	c9c098dab2	Add mistral chat adapter (#983 )	2024-07-09 19:38:35 +08:00
Concedo	5e458f42da	update gemma format	2024-07-06 18:54:20 +08:00
Concedo	82202aebda	updated lite, add gemma 2 template	2024-07-02 21:02:52 +08:00
henk717	8421243c6d	Chat Adapters (#956 ) * Give the CI builds a recognizable AVX1 name * Chat Adapters	2024-06-30 10:28:43 +08:00
Concedo	73b99a7266	add premade chat completions adapter	2024-06-27 00:13:06 +08:00

27 commits