koboldcpp/tools/server
Concedo 983baac46b Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.devops/vulkan.Dockerfile
#	.github/workflows/build.yml
#	ci/run.sh
#	examples/model-conversion/Makefile
#	examples/model-conversion/README.md
#	examples/model-conversion/scripts/causal/compare-logits.py
#	examples/model-conversion/scripts/embedding/run-converted-model.sh
#	examples/model-conversion/scripts/utils/common.py
#	examples/model-conversion/scripts/utils/semantic_check.py
#	ggml/src/ggml-cann/ggml-cann.cpp
#	ggml/src/ggml-cuda/CMakeLists.txt
#	ggml/src/ggml-opencl/CMakeLists.txt
#	ggml/src/ggml-opencl/ggml-opencl.cpp
#	ggml/src/ggml-sycl/ggml-sycl.cpp
#	ggml/src/ggml-webgpu/ggml-webgpu.cpp
#	scripts/pr2wt.sh
#	scripts/sync_vendor.py
#	tests/test-arg-parser.cpp
2026-01-09 01:23:10 +08:00
..
bench Merge branch 'upstream' into concedo_experimental 2025-08-23 11:35:28 +08:00
public sampling : add support for backend sampling (#17004) 2026-01-04 22:22:16 +02:00
public_legacy common : fix json schema with '\' in literals (#17307) 2025-11-29 17:06:32 +01:00
public_simplechat Merge branch 'upstream' into concedo_experimental 2025-05-03 12:15:36 +08:00
tests Merge commit '56d2fed2b3' into concedo_experimental 2026-01-09 00:30:53 +08:00
themes Merge branch 'upstream' into concedo_experimental 2025-05-03 12:15:36 +08:00
webui Merge commit '67e3f6f601' into concedo_experimental 2026-01-05 20:52:20 +08:00
chat-llama2.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
chat.mjs llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
chat.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
README-dev.md server: add auto-sleep after N seconds of idle (#18228) 2025-12-21 02:24:42 +01:00
server-common.cpp vendor : update cpp-httplib to 0.30.0 (#18660) 2026-01-08 13:53:54 +01:00
server-common.h server: prevent data race from HTTP threads (#18263) 2025-12-22 14:23:34 +01:00
server-context.cpp model : add LFM2-ColBert-350M (#18607) 2026-01-05 19:52:56 +01:00
server-context.h server: prevent data race from HTTP threads (#18263) 2025-12-22 14:23:34 +01:00
server-http.cpp server: prevent data race from HTTP threads (#18263) 2025-12-22 14:23:34 +01:00
server-http.h server: split HTTP into its own interface (#17216) 2025-11-17 22:05:44 +01:00
server-models.cpp server : fix router child env in containerized environments (#18562) 2026-01-05 14:12:05 +01:00
server-models.h server : fix router child env in containerized environments (#18562) 2026-01-05 14:12:05 +01:00
server-queue.cpp server: prevent data race from HTTP threads (#18263) 2025-12-22 14:23:34 +01:00
server-queue.h server: prevent data race from HTTP threads (#18263) 2025-12-22 14:23:34 +01:00
server-task.cpp server : add thinking content blocks to Anthropic Messages API (#18551) 2026-01-06 16:17:13 +01:00
server-task.h server : add thinking content blocks to Anthropic Messages API (#18551) 2026-01-06 16:17:13 +01:00
server.cpp server : fix router child env in containerized environments (#18562) 2026-01-05 14:12:05 +01:00