koboldcpp/tools/server
Concedo e8e7c357c9 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/workflows/build-cache.yml
#	.github/workflows/build-cmake-pkg.yml
#	.github/workflows/build-linux-cross.yml
#	.github/workflows/build.yml
#	.github/workflows/check-vendor.yml
#	.github/workflows/close-issue.yml
#	.github/workflows/copilot-setup-steps.yml
#	.github/workflows/docker.yml
#	.github/workflows/editorconfig.yml
#	.github/workflows/gguf-publish.yml
#	.github/workflows/labeler.yml
#	.github/workflows/pre-tokenizer-hashes.yml
#	.github/workflows/python-check-requirements.yml
#	.github/workflows/python-lint.yml
#	.github/workflows/python-type-check.yml
#	.github/workflows/release.yml
#	.github/workflows/server-webui.yml
#	.github/workflows/server.yml
#	.github/workflows/update-ops-docs.yml
#	.github/workflows/winget.yml
#	ggml/src/ggml-opencl/CMakeLists.txt
#	ggml/src/ggml-opencl/ggml-opencl.cpp
#	ggml/src/ggml-zdnn/ggml-zdnn.cpp
#	requirements/requirements-tool_bench.txt
#	src/CMakeLists.txt
#	src/llama-quant.cpp
#	tests/test-backend-ops.cpp
#	tests/test-chat.cpp
#	tools/cli/cli.cpp
#	tools/server/README.md
2026-01-23 14:27:04 +08:00
..
bench Merge branch 'upstream' into concedo_experimental 2025-08-23 11:35:28 +08:00
public fix: Use tabular-nums for chat message statistics (#18915) 2026-01-21 18:46:01 +01:00
public_legacy common : fix json schema with '\' in literals (#17307) 2025-11-29 17:06:32 +01:00
public_simplechat Merge branch 'upstream' into concedo_experimental 2025-05-03 12:15:36 +08:00
tests Merge branch 'upstream' into concedo_experimental 2026-01-23 14:27:04 +08:00
themes Merge branch 'upstream' into concedo_experimental 2025-05-03 12:15:36 +08:00
webui Merge branch 'upstream' into concedo_experimental 2026-01-23 14:27:04 +08:00
chat-llama2.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
chat.mjs llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
chat.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
README-dev.md server: add auto-sleep after N seconds of idle (#18228) 2025-12-21 02:24:42 +01:00
server-common.cpp server: /v1/responses (partial) (#18486) 2026-01-21 17:47:23 +01:00
server-common.h server: /v1/responses (partial) (#18486) 2026-01-21 17:47:23 +01:00
server-context.cpp server : support preserving reasoning_content in assistant message (#18994) 2026-01-22 21:30:06 +01:00
server-context.h server : support preserving reasoning_content in assistant message (#18994) 2026-01-22 21:30:06 +01:00
server-http.cpp server: do not log certain endpoints (avoid log spam) (#19028) 2026-01-22 19:24:37 +01:00
server-http.h server: split HTTP into its own interface (#17216) 2025-11-17 22:05:44 +01:00
server-models.cpp server : fix router child env in containerized environments (#18562) 2026-01-05 14:12:05 +01:00
server-models.h server : fix router child env in containerized environments (#18562) 2026-01-05 14:12:05 +01:00
server-queue.cpp server: improve slots scheduling for n_cmpl (#18789) 2026-01-15 17:10:28 +01:00
server-queue.h server: improve slots scheduling for n_cmpl (#18789) 2026-01-15 17:10:28 +01:00
server-task.cpp server : support preserving reasoning_content in assistant message (#18994) 2026-01-22 21:30:06 +01:00
server-task.h server: /v1/responses (partial) (#18486) 2026-01-21 17:47:23 +01:00
server.cpp server: /v1/responses (partial) (#18486) 2026-01-21 17:47:23 +01:00