Commit graph

11 commits

Author SHA1 Message Date
kallewoof
23ec550835
PoC: add chat template heuristics (#1283)
* PoC: add chat template heuristics

The fallback chat template adapter of Vicuna is not ideal in some cases (e.g. a test against a sub-portion of the BBC news classification task on Kaggle gave an 82% accuracy with Vicuna and 88% with the official ChatML format for a q4_k_m Qwen 2.5 3B-Instruct gguf).

This PR adds a proof of concept simple heuristic which looks at the chat template and upgrades the adapter when it is able to.

* gemma 2 heuristic

* Phi 4, Llama 3.x heuristics

* better qwen vs generic heuristic

* cleanup

* mistral (generic) heuristic

* fix sys msg for mistral

* phi 3.5

* mistral v3

* cohere (aya expanse 32b based)

* only derive from chat template if AutoGuess

* add notes about alpaca fallbacks

* added AutoGuess.json dummy

* add mistral v7

* switch to using a json list with search strings
2024-12-28 12:15:23 +08:00
Concedo
a577015425 consolidate mistral templates (too many) 2024-12-15 00:03:34 +08:00
pandora
a548108dd2
Create Mistral-V7.json (#1224) 2024-11-19 10:45:50 +08:00
Concedo
c38d1ecc8d update templates, fix rwkv 2024-09-22 01:32:12 +08:00
pandora
d4d19f6dbe
Fixing Mistral Templates (#1131)
* Update and rename Mistral.json to Mistral V1.json

* Create Mistral V2 & V3

* Rename Mistral V2 & V3 to Mistral V2 & V3.json

* Create Mistral V3-Tekken.json
2024-09-22 01:30:17 +08:00
xzuyn
add0a88111
Update kcpp_adapters (#985)
* Update kcpp_adapters

* Update kcpp_adapters
2024-07-13 19:06:15 +08:00
Nexesenex
c9c098dab2
Add mistral chat adapter (#983) 2024-07-09 19:38:35 +08:00
Concedo
5e458f42da update gemma format 2024-07-06 18:54:20 +08:00
Concedo
82202aebda updated lite, add gemma 2 template 2024-07-02 21:02:52 +08:00
henk717
8421243c6d
Chat Adapters (#956)
* Give the CI builds a recognizable AVX1 name

* Chat Adapters
2024-06-30 10:28:43 +08:00
Concedo
73b99a7266 add premade chat completions adapter 2024-06-27 00:13:06 +08:00