koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2026-04-28 11:40:43 +00:00

History

Zijun Yu 52f1096f21 openvino: driver setup, CI split, thread safety, and NPU optimizations (#21944 ) * Thread safety per request only * Fix ROPE yarn case * Fix sticky stateful config * Use i4/i8 directly for symmetric quant * Use weightless caching * Add WeightlessCacheAttribute to reduce NPU memory usage * Gelu tanh support (#125) * Imrope support (#126) * fix(openvino): explicit ov::Tensor frees in ggml_backend_openvino_free * add GPU,NPU support in OV Dockerfile * add build-openvino.yml ci * Fix sticky stateful config * add concurrency to ov-gpu ci runs. Move OV CI to build-openvino.yml * fix thread-safety of shared runtime context * rope type abstraction for frontend translations * fix editorconfig --------- Co-authored-by: Mustafa Cavus <mustafa.cavus@intel.com> Co-authored-by: Dan Hoffman <dhoff749@gmail.com> Co-authored-by: Ravi Panchumarthy <ravi.panchumarthy@intel.com>		2026-04-21 18:58:34 +03:00
..
nix	devops : added spirv-headers to nix (#21965 )	2026-04-16 11:12:52 +03:00
cann.Dockerfile	CANN: update docker images to 8.5.0 and improve CANN.md (#20801 )	2026-03-27 08:53:00 +08:00
cpu.Dockerfile	CI : Enable CUDA and Vulkan ARM64 runners and fix CI/CD (#21122 )	2026-03-30 20:24:37 +02:00
cuda.Dockerfile	CI : Enable CUDA and Vulkan ARM64 runners and fix CI/CD (#21122 )	2026-03-30 20:24:37 +02:00
intel.Dockerfile	CI : Enable CUDA and Vulkan ARM64 runners and fix CI/CD (#21122 )	2026-03-30 20:24:37 +02:00
llama-cli-cann.Dockerfile	CANN: update docker images to 8.5.0 and improve CANN.md (#20801 )	2026-03-27 08:53:00 +08:00
llama-cpp-cuda.srpm.spec	CLI: fixed adding cli and completion into docker containers, improved docs (#18003 )	2025-12-16 11:52:23 +01:00
llama-cpp.srpm.spec	CLI: fixed adding cli and completion into docker containers, improved docs (#18003 )	2025-12-16 11:52:23 +01:00
musa.Dockerfile	CI : Enable CUDA and Vulkan ARM64 runners and fix CI/CD (#21122 )	2026-03-30 20:24:37 +02:00
openvino.Dockerfile	openvino: driver setup, CI split, thread safety, and NPU optimizations (#21944 )	2026-04-21 18:58:34 +03:00
rocm.Dockerfile	[HIP] Bump ROCm version to 7.2.1 (#21066 )	2026-04-03 00:59:20 +02:00
s390x.Dockerfile	refactor : remove libcurl, use OpenSSL when available (#18828 )	2026-01-14 18:02:47 +01:00
tools.sh	docker : include legacy llama-completion binary (#17964 )	2025-12-12 19:39:23 +01:00
vulkan.Dockerfile	vulkan: Programmatically add RoundingModeRTE to all shaders when the device supports it (#21572 )	2026-04-14 15:17:45 +02:00