koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2025-09-11 17:44:38 +00:00

Author	SHA1	Message	Date
kallewoof	23ec550835	PoC: add chat template heuristics (#1283 ) * PoC: add chat template heuristics The fallback chat template adapter of Vicuna is not ideal in some cases (e.g. a test against a sub-portion of the BBC news classification task on Kaggle gave an 82% accuracy with Vicuna and 88% with the official ChatML format for a q4_k_m Qwen 2.5 3B-Instruct gguf). This PR adds a proof of concept simple heuristic which looks at the chat template and upgrades the adapter when it is able to. * gemma 2 heuristic * Phi 4, Llama 3.x heuristics * better qwen vs generic heuristic * cleanup * mistral (generic) heuristic * fix sys msg for mistral * phi 3.5 * mistral v3 * cohere (aya expanse 32b based) * only derive from chat template if AutoGuess * add notes about alpaca fallbacks * added AutoGuess.json dummy * add mistral v7 * switch to using a json list with search strings	2024-12-28 12:15:23 +08:00
Concedo	a577015425	consolidate mistral templates (too many)	2024-12-15 00:03:34 +08:00
pandora	a548108dd2	Create Mistral-V7.json (#1224 )	2024-11-19 10:45:50 +08:00
Concedo	c38d1ecc8d	update templates, fix rwkv	2024-09-22 01:32:12 +08:00
pandora	d4d19f6dbe	Fixing Mistral Templates (#1131 ) * Update and rename Mistral.json to Mistral V1.json * Create Mistral V2 & V3 * Rename Mistral V2 & V3 to Mistral V2 & V3.json * Create Mistral V3-Tekken.json	2024-09-22 01:30:17 +08:00
xzuyn	add0a88111	Update kcpp_adapters (#985 ) * Update kcpp_adapters * Update kcpp_adapters	2024-07-13 19:06:15 +08:00
Nexesenex	c9c098dab2	Add mistral chat adapter (#983 )	2024-07-09 19:38:35 +08:00
Concedo	5e458f42da	update gemma format	2024-07-06 18:54:20 +08:00
Concedo	82202aebda	updated lite, add gemma 2 template	2024-07-02 21:02:52 +08:00
henk717	8421243c6d	Chat Adapters (#956 ) * Give the CI builds a recognizable AVX1 name * Chat Adapters	2024-06-30 10:28:43 +08:00
Concedo	73b99a7266	add premade chat completions adapter	2024-06-27 00:13:06 +08:00

11 commits