koboldcpp/docs
Alexey Kopytko 581d020b12
SYCL: implement ggml_sycl_pool_vmm (#22862)
* SYCL: implement ggml_sycl_pool_vmm

* Add an option to bypass VMM with GGML_SYCL_DISABLE_VMM

* Clean up debugging logging

* document GGML_SYCL_DISABLE_VMM

* Multi-stream MoE optimization

* Revert "Multi-stream MoE optimization"

This reverts commit 938929c3f13a562ec67c59e87cc5d38595444cce.

* Update common.hpp

Co-authored-by: Neo Zhang <zhang.jianyu@outlook.com>

* Flip GGML_SYCL_DISABLE_VMM to GGML_SYCL_ENABLE_VMM

* add logging for GGML_SYCL_ENABLE_VMM when extension is not available (SYCL_EXT_ONEAPI_VIRTUAL_MEM macro)

* Apply suggestions from code review

Co-authored-by: Alexey Kopytko <alexey@kopytko.com>

* Apply suggestion from @sanmai

* Apply suggestion from @sanmai

---------

Co-authored-by: Neo Zhang <zhang.jianyu@outlook.com>
2026-05-26 07:59:00 +03:00
..
android android: fix missing screenshots for Android.md (#18156) 2025-12-19 09:32:04 +02:00
backend SYCL: implement ggml_sycl_pool_vmm (#22862) 2026-05-26 07:59:00 +03:00
development docs: more extensive RoPE documentation [no ci] (#21953) 2026-04-15 14:45:16 +02:00
multimodal mtmd : support MiniCPM-V 4.6 (#22529) 2026-05-06 21:54:09 +02:00
ops ggml-webgpu: Enables running gpt-oss-20b (#22906) 2026-05-12 07:27:40 -07:00
android.md android: fix missing screenshots for Android.md (#18156) 2025-12-19 09:32:04 +02:00
autoparser.md docs: Update documentation with Granite 4.0/4.1 (#23404) 2026-05-22 20:35:46 +08:00
build-riscv64-spacemit.md ci : update spacemit toolchain url and enhance curl command (#23642) 2026-05-25 10:43:24 +02:00
build-s390x.md docs: update s390x build docs (#19643) 2026-02-16 00:33:34 +08:00
build.md Update WebGPU support and add link to blog/demo (#23483) 2026-05-21 11:00:27 -07:00
docker.md CI : Enable CUDA and Vulkan ARM64 runners and fix CI/CD (#21122) 2026-03-30 20:24:37 +02:00
function-calling.md docs: Update documentation with Granite 4.0/4.1 (#23404) 2026-05-22 20:35:46 +08:00
install.md docs : add "Quick start" section for new users (#13862) 2025-06-03 13:09:36 +02:00
llguidance.md llguidance build fixes for Windows (#11664) 2025-02-14 12:46:08 -08:00
multi-gpu.md Write a readme on Multi-GPU usage in llama.cpp (#22729) 2026-05-07 17:48:40 +02:00
multimodal.md docs: listing qwen3-asr and qwen3-omni as supported (#21857) 2026-04-13 22:28:17 +02:00
ops.md ggml-webgpu: Enables running gpt-oss-20b (#22906) 2026-05-12 07:27:40 -07:00
preset.md preset: allow named remote preset (#18728) 2026-01-10 15:12:29 +01:00
speculative.md doc: fix spec mtp typo (#23435) 2026-05-21 09:30:55 +03:00