koboldcpp/tools/server
Concedo 746664fde6 Merge commit '2cd20b72ed' into concedo_experimental
# Conflicts:
#	CONTRIBUTING.md
#	docs/backend/CANN.md
#	docs/backend/SYCL.md
#	docs/backend/snapdragon/README.md
#	docs/backend/snapdragon/windows.md
#	docs/build.md
#	docs/multimodal/MobileVLM.md
#	docs/ops.md
#	docs/ops/WebGPU.csv
#	examples/debug/README.md
#	examples/llama.vim
#	examples/model-conversion/README.md
#	examples/sycl/README.md
#	ggml/src/ggml-cpu/amx/mmq.cpp
#	ggml/src/ggml-cpu/arch/x86/repack.cpp
#	ggml/src/ggml-hexagon/ggml-hexagon.cpp
#	ggml/src/ggml-hexagon/htp-drv.cpp
#	ggml/src/ggml-hexagon/htp/flash-attn-ops.c
#	ggml/src/ggml-hexagon/htp/hvx-base.h
#	ggml/src/ggml-hexagon/htp/hvx-copy.h
#	ggml/src/ggml-hexagon/htp/hvx-inverse.h
#	ggml/src/ggml-hexagon/htp/hvx-reduce.h
#	ggml/src/ggml-hexagon/htp/matmul-ops.c
#	ggml/src/ggml-hexagon/htp/rope-ops.c
#	ggml/src/ggml-hexagon/htp/worker-pool.c
#	ggml/src/ggml-opencl/ggml-opencl.cpp
#	ggml/src/ggml-opencl/kernels/cpy.cl
#	ggml/src/ggml-sycl/common.hpp
#	ggml/src/ggml-sycl/quants.hpp
#	ggml/src/ggml-sycl/softmax.cpp
#	ggml/src/ggml-vulkan/CMakeLists.txt
#	ggml/src/ggml-webgpu/ggml-webgpu-shader-lib.hpp
#	ggml/src/ggml-webgpu/ggml-webgpu.cpp
#	scripts/pr2wt.sh
#	scripts/server-bench.py
#	scripts/snapdragon/windows/run-cli.ps1
#	tests/test-alloc.cpp
#	tests/test-backend-ops.cpp
#	tests/test-chat.cpp
#	tools/cli/cli.cpp
#	tools/completion/README.md
#	tools/cvector-generator/cvector-generator.cpp
#	tools/imatrix/README.md
#	tools/perplexity/README.md
#	tools/server/public_simplechat/readme.md
#	tools/server/tests/README.md
2026-03-10 22:11:08 +08:00
..
bench Merge branch 'upstream' into concedo_experimental 2025-08-23 11:35:28 +08:00
public webui: Improvements for Models Selector UI (#20066) 2026-03-05 08:52:22 +01:00
public_legacy chore : correct typos [no ci] (#20041) 2026-03-05 08:50:21 +01:00
public_simplechat Merge commit '2cd20b72ed' into concedo_experimental 2026-03-10 22:11:08 +08:00
tests chore : correct typos [no ci] (#20041) 2026-03-05 08:50:21 +01:00
themes Merge branch 'upstream' into concedo_experimental 2026-02-03 19:00:42 +08:00
webui Merge commit '2cd20b72ed' into concedo_experimental 2026-03-10 22:11:08 +08:00
chat-llama2.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
chat.mjs
chat.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
README-dev.md server: add auto-sleep after N seconds of idle (#18228) 2025-12-21 02:24:42 +01:00
server-common.cpp server : support multi-modal context checkpoints (#19849) 2026-02-25 15:14:27 +02:00
server-common.h server : support multi-modal context checkpoints (#19849) 2026-02-25 15:14:27 +02:00
server-context.cpp server : support multiple model aliases via comma-separated --alias (#19926) 2026-02-27 07:05:23 +01:00
server-context.h server: Add pragma once to server-context.h (#19944) 2026-02-27 18:28:36 +01:00
server-http.cpp server: fix query params lost when proxying requests in multi-model router mode (#19854) 2026-02-24 21:46:06 +01:00
server-http.h server: fix query params lost when proxying requests in multi-model router mode (#19854) 2026-02-24 21:46:06 +01:00
server-models.cpp server : support multiple model aliases via comma-separated --alias (#19926) 2026-02-27 07:05:23 +01:00
server-models.h server : support multiple model aliases via comma-separated --alias (#19926) 2026-02-27 07:05:23 +01:00
server-queue.cpp server: improve slots scheduling for n_cmpl (#18789) 2026-01-15 17:10:28 +01:00
server-queue.h server: improve slots scheduling for n_cmpl (#18789) 2026-01-15 17:10:28 +01:00
server-task.cpp server : enable multi-modal prompt caching (#19877) 2026-02-25 15:15:42 +02:00
server-task.h server : support multi-modal context checkpoints (#19849) 2026-02-25 15:14:27 +02:00
server.cpp Fix locale-dependent float printing in GGUF metadata (#17331) 2026-03-04 09:30:40 +01:00