koboldcpp/tools/server
Concedo 2e4f94822e Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/workflows/build-self-hosted.yml
#	.github/workflows/docker.yml
#	ci/run.sh
#	docs/build.md
#	ggml/src/ggml-webgpu/ggml-webgpu-shader-lib.hpp
#	ggml/src/ggml-webgpu/ggml-webgpu.cpp
#	src/llama-vocab.cpp
#	tests/test-chat.cpp
#	tests/test-jinja.cpp
#	tools/cli/README.md
#	tools/completion/README.md
#	tools/server/README.md
2026-04-04 14:27:23 +08:00
..
bench Merge branch 'upstream' into concedo_experimental 2026-03-22 23:39:13 +08:00
public fix: Use lower-case proxy headers naming (#21235) 2026-03-31 17:47:46 +02:00
public_legacy Autoparser - complete refactoring of parser architecture (#18675) 2026-03-06 21:01:00 +01:00
public_simplechat Merge commit '2cd20b72ed' into concedo_experimental 2026-03-10 22:11:08 +08:00
tests Merge branch 'upstream' into concedo_experimental 2026-04-04 14:27:23 +08:00
themes Merge branch 'upstream' into concedo_experimental 2026-02-03 19:00:42 +08:00
webui Merge branch 'upstream' into concedo_experimental 2026-04-01 10:54:13 +08:00
chat-llama2.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
chat.mjs llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
chat.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
README-dev.md server: (webui) no more gzip compression (#21073) 2026-03-31 15:44:26 +02:00
server-common.cpp common : inhibit lazy grammar sampler while reasoning is active (#20970) 2026-03-27 18:30:40 +01:00
server-common.h common/parser: add --skip-chat-parsing to force a pure content parser. (#20289) 2026-03-17 16:16:43 +01:00
server-context.cpp server: save and clear idle slots on new task (--clear-idle) (#20993) 2026-04-03 19:02:27 +02:00
server-context.h server: allow router to report child instances sleep status (#20849) 2026-03-22 18:33:52 +01:00
server-cors-proxy.h fix: Use lower-case proxy headers naming (#21235) 2026-03-31 17:47:46 +02:00
server-http.cpp server: Bypass API Key validation for WebUI static bundle assets (#21269) 2026-04-01 21:32:15 +02:00
server-http.h server: wrap headers for mcp proxy (#21072) 2026-03-30 08:59:16 +02:00
server-models.cpp Ignore Transfer-Encoding header. (#20269) 2026-04-02 10:41:19 +02:00
server-models.h server: allow router to report child instances sleep status (#20849) 2026-03-22 18:33:52 +01:00
server-queue.cpp server: improve slots scheduling for n_cmpl (#18789) 2026-01-15 17:10:28 +01:00
server-queue.h server: allow router to report child instances sleep status (#20849) 2026-03-22 18:33:52 +01:00
server-task.cpp server: save and clear idle slots on new task (--clear-idle) (#20993) 2026-04-03 19:02:27 +02:00
server-task.h server: Add cached_tokens info to oaicompat responses (#19361) 2026-03-19 19:09:33 +01:00
server-tools.cpp cli : add /glob command (#21084) 2026-03-28 02:33:04 +01:00
server-tools.h server: add built-in tools backend support (#20898) 2026-03-27 10:07:11 +01:00
server.cpp common : move up common_init() and fix Windows UTF-8 logs (#21176) 2026-03-31 12:53:41 +02:00