koboldcpp/docs
Ravi Panchumarthy 7e16646015
docs : Update OPENVINO.md (#22959)
Updated OPENVINO.md with Validated models and quantizations

Co-authored-by: Haarika Madaka <haarika.madaka@intel.com>
2026-05-13 17:12:15 +03:00
..
android android: fix missing screenshots for Android.md (#18156) 2025-12-19 09:32:04 +02:00
backend docs : Update OPENVINO.md (#22959) 2026-05-13 17:12:15 +03:00
development docs: more extensive RoPE documentation [no ci] (#21953) 2026-04-15 14:45:16 +02:00
multimodal mtmd : support MiniCPM-V 4.6 (#22529) 2026-05-06 21:54:09 +02:00
ops ggml-webgpu: Enables running gpt-oss-20b (#22906) 2026-05-12 07:27:40 -07:00
android.md android: fix missing screenshots for Android.md (#18156) 2025-12-19 09:32:04 +02:00
autoparser.md common/parser: add proper reasoning tag prefill reading (#20424) 2026-03-19 16:58:21 +01:00
build-riscv64-spacemit.md refactor : remove libcurl, use OpenSSL when available (#18828) 2026-01-14 18:02:47 +01:00
build-s390x.md docs: update s390x build docs (#19643) 2026-02-16 00:33:34 +08:00
build.md CUDA: require explicit opt-in for P2P access (#21910) 2026-04-15 16:01:46 +02:00
docker.md CI : Enable CUDA and Vulkan ARM64 runners and fix CI/CD (#21122) 2026-03-30 20:24:37 +02:00
function-calling.md common : implement new jinja template engine (#18462) 2026-01-16 11:22:06 +01:00
install.md docs : add "Quick start" section for new users (#13862) 2025-06-03 13:09:36 +02:00
llguidance.md llguidance build fixes for Windows (#11664) 2025-02-14 12:46:08 -08:00
multi-gpu.md Write a readme on Multi-GPU usage in llama.cpp (#22729) 2026-05-07 17:48:40 +02:00
multimodal.md docs: listing qwen3-asr and qwen3-omni as supported (#21857) 2026-04-13 22:28:17 +02:00
ops.md ggml-webgpu: Enables running gpt-oss-20b (#22906) 2026-05-12 07:27:40 -07:00
preset.md preset: allow named remote preset (#18728) 2026-01-10 15:12:29 +01:00
speculative.md docs : update speculative decoding parameters after refactor (#22397) (#22539) 2026-05-04 08:52:07 +03:00