Commit graph

27 commits

Author SHA1 Message Date
Concedo
fda682fa12 updated lite 2025-04-30 19:49:54 +08:00
Concedo
e561eeb5e5 qwen3 no think template fix 2025-04-29 21:06:27 +08:00
Concedo
c2802af9e8 fix qwen3, fixed sd, fixed glm4 2025-04-29 20:50:46 +08:00
Concedo
4d8a7a6594 fix occasional clip segfault, fix glm4 (+1 squashed commits)
Squashed commits:

[bd71cd688] GLM4 fix wip
2025-04-29 01:42:50 +08:00
kallewoof
7cb815b727
AutoGuess: GLM-4 (#1502)
* AutoGuess: GLM-4

* add 'chat_start' field to adapters

* GLM-4 fix
2025-04-26 08:47:42 +08:00
Concedo
d32d0b382a glm4 template 2025-04-25 00:41:15 +08:00
Concedo
a6149ad0fc fixed g3 adapter back 2025-04-12 23:17:54 +08:00
Concedo
c94aec1930 update workflows, update gemma default adapter sysprompt 2025-04-12 18:38:23 +08:00
Concedo
efef14bb82 added llama4 tags 2025-04-12 08:58:04 +08:00
Concedo
93a226d9e4 added prefix for llava, reverted system role in template as it degreaded gemma3. truncated debug logs 2025-04-05 18:06:41 +08:00
Concedo
669311365c fixed gemma system prompt 2025-04-02 13:58:51 +08:00
Concedo
0ed95fcccc fixed l3 template, add index 2025-03-31 23:59:06 +08:00
Concedo
6a1dd57435 gemma3 template, updated lite, fixed tool calling, reenable ctx shift for gemma3 2025-03-14 17:47:01 +08:00
Concedo
6b7d2349a7 Rewrite history to fix bad vulkan shader commits without increasing repo size
added dpe colab (+8 squashed commit)

Squashed commit:

[b8362da4] updated lite

[ed6c037d] move nsigma into the regular sampler stack

[ac5f61c6] relative filepath fixed

[05fe96ab] export template

[ed0a5a3e] nix_example.md: refactor (#1401)

* nix_example.md: add override example

* nix_example.md: drop graphics example, already basic nixos knowledge

* nix_example.md: format

* nix_example.md: Vulkan is disabled on macOS

Disabled in: 1ccd253acc

* nix_examples.md: nixpkgs.config.cuda{Arches -> Capabilities}

Fixes: https://github.com/LostRuins/koboldcpp/issues/1367

[675c62f7] AutoGuess: Phi 4 (mini) (#1402)

[4bf56982] phrasing

[b8c0df04] Add Rep Pen to Top N Sigma sampler chain (#1397)

- place after nsigma and before xtc (+3 squashed commit)

Squashed commit:

[87c52b97] disable VMM from HIP

[ee8906f3] edit description

[e85c0e69] Remove Unnecessary Rep Counting (#1394)

* stop counting reps

* fix range-based initializer

* strike that - reverse it
2025-03-05 00:02:20 +08:00
Concedo
cca4a934dd fix for chat templates and drafting 2025-01-23 11:49:40 +08:00
kallewoof
1cb9805024
add autoguess adapter for DeepSeek V2.5/R1 (#1329) 2025-01-22 20:39:04 +08:00
kallewoof
23ec550835
PoC: add chat template heuristics (#1283)
* PoC: add chat template heuristics

The fallback chat template adapter of Vicuna is not ideal in some cases (e.g. a test against a sub-portion of the BBC news classification task on Kaggle gave an 82% accuracy with Vicuna and 88% with the official ChatML format for a q4_k_m Qwen 2.5 3B-Instruct gguf).

This PR adds a proof of concept simple heuristic which looks at the chat template and upgrades the adapter when it is able to.

* gemma 2 heuristic

* Phi 4, Llama 3.x heuristics

* better qwen vs generic heuristic

* cleanup

* mistral (generic) heuristic

* fix sys msg for mistral

* phi 3.5

* mistral v3

* cohere (aya expanse 32b based)

* only derive from chat template if AutoGuess

* add notes about alpaca fallbacks

* added AutoGuess.json dummy

* add mistral v7

* switch to using a json list with search strings
2024-12-28 12:15:23 +08:00
Concedo
a577015425 consolidate mistral templates (too many) 2024-12-15 00:03:34 +08:00
pandora
a548108dd2
Create Mistral-V7.json (#1224) 2024-11-19 10:45:50 +08:00
Concedo
c38d1ecc8d update templates, fix rwkv 2024-09-22 01:32:12 +08:00
pandora
d4d19f6dbe
Fixing Mistral Templates (#1131)
* Update and rename Mistral.json to Mistral V1.json

* Create Mistral V2 & V3

* Rename Mistral V2 & V3 to Mistral V2 & V3.json

* Create Mistral V3-Tekken.json
2024-09-22 01:30:17 +08:00
xzuyn
add0a88111
Update kcpp_adapters (#985)
* Update kcpp_adapters

* Update kcpp_adapters
2024-07-13 19:06:15 +08:00
Nexesenex
c9c098dab2
Add mistral chat adapter (#983) 2024-07-09 19:38:35 +08:00
Concedo
5e458f42da update gemma format 2024-07-06 18:54:20 +08:00
Concedo
82202aebda updated lite, add gemma 2 template 2024-07-02 21:02:52 +08:00
henk717
8421243c6d
Chat Adapters (#956)
* Give the CI builds a recognizable AVX1 name

* Chat Adapters
2024-06-30 10:28:43 +08:00
Concedo
73b99a7266 add premade chat completions adapter 2024-06-27 00:13:06 +08:00