From 95feeab52e41ceaf71e87b2dd01895f6d8815b60 Mon Sep 17 00:00:00 2001 From: Jesus Talavera <145992175+jesus-talavera-ibm@users.noreply.github.com> Date: Fri, 22 May 2026 14:35:46 +0200 Subject: [PATCH] docs: Update documentation with Granite 4.0/4.1 (#23404) --- docs/autoparser.md | 1 + docs/function-calling.md | 1 + 2 files changed, 2 insertions(+) diff --git a/docs/autoparser.md b/docs/autoparser.md index adc4d43ed..da2f1a3a0 100644 --- a/docs/autoparser.md +++ b/docs/autoparser.md @@ -489,6 +489,7 @@ The following templates have active tests in `tests/test-chat.cpp`: | Qwen-QwQ-32B | Reasoning | Forced-open thinking | | NousResearch Hermes 2 Pro | JSON_NATIVE | `` wrapper | | IBM Granite 3.3 | JSON_NATIVE | `` + `` | +| IBM Granite 4.0 | JSON_NATIVE | `` wrapper (same template used by 4.1) | | ByteDance Seed-OSS | TAG_WITH_TAGGED | Custom `` and `` tags | | Qwen3-Coder | TAG_WITH_TAGGED | XML-style tool format | | DeepSeek V3.1 | JSON_NATIVE | Forced thinking mode | diff --git a/docs/function-calling.md b/docs/function-calling.md index 9ede914c0..850b59ce7 100644 --- a/docs/function-calling.md +++ b/docs/function-calling.md @@ -291,6 +291,7 @@ Here are some models known to work (w/ chat template override when needed): llama-server --jinja -fa -hf bartowski/Qwen2.5-7B-Instruct-GGUF:Q4_K_M llama-server --jinja -fa -hf bartowski/Mistral-Nemo-Instruct-2407-GGUF:Q6_K_L llama-server --jinja -fa -hf bartowski/Llama-3.3-70B-Instruct-GGUF:Q4_K_M +llama-server --jinja -fa -hf ibm-granite/granite-4.1-3b-GGUF:Q4_K_M # Native support for DeepSeek R1 works best w/ our template override (official template is buggy, although we do work around it)