koboldcpp/tools/server
Concedo fecf2dc3fa Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/workflows/server-self-hosted.yml
#	CMakeLists.txt
#	CODEOWNERS
#	ci/run.sh
#	cmake/llama-config.cmake.in
#	common/chat.cpp
#	examples/sycl/start-svr.sh
#	examples/sycl/test.sh
#	examples/sycl/win-start-svr.bat
#	examples/sycl/win-test.bat
#	ggml/src/ggml-sycl/ggml-sycl.cpp
#	ggml/src/ggml-sycl/vecdotq.hpp
#	ggml/src/ggml-vulkan/CMakeLists.txt
#	scripts/wc2wt.sh
#	tests/test-backend-ops.cpp
#	tests/test-chat.cpp
2026-05-18 21:27:23 +08:00
..
bench Merge branch 'upstream' into concedo_experimental 2026-03-22 23:39:13 +08:00
tests Merge branch 'upstream' into concedo_experimental 2026-05-18 21:27:23 +08:00
chat-llama2.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
chat.mjs
chat.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
README-dev.md ui: Restructure repo to use tools/ui folder and ui / UI / llama-ui / LLAMA_UI naming (#23064) 2026-05-16 02:02:40 +02:00
server-chat.cpp Support for Codex CLI by skipping unsupported Responses tools (#23041) 2026-05-15 09:03:24 +02:00
server-chat.h server: (router) Forward form-data to model server (Fixes #22044) (#22118) 2026-04-27 23:55:00 +02:00
server-common.cpp common : delegate assistant continuation to underlying template handlers (#23089) 2026-05-17 13:36:05 +02:00
server-common.h logs : reduce (#23021) 2026-05-14 13:05:52 +03:00
server-context.cpp llama: avoid copying logits during prompt decode in MTP (#23198) 2026-05-17 23:30:25 +08:00
server-context.h ui: Restructure repo to use tools/ui folder and ui / UI / llama-ui / LLAMA_UI naming (#23064) 2026-05-16 02:02:40 +02:00
server-cors-proxy.h server: (router) Forward form-data to model server (Fixes #22044) (#22118) 2026-04-27 23:55:00 +02:00
server-http.cpp cmake : fix LLAMA_BUILD_UI logic (#23190) 2026-05-17 14:42:26 -04:00
server-http.h server: support Vertex AI compatible API (#22545) 2026-05-08 15:23:04 +02:00
server-models.cpp server: (router) alloc tmp buffer on heap (#23159) 2026-05-16 23:42:16 +02:00
server-models.h ui: Restructure repo to use tools/ui folder and ui / UI / llama-ui / LLAMA_UI naming (#23064) 2026-05-16 02:02:40 +02:00
server-queue.cpp server : print warning when HTTP timeout exceeded (#22907) 2026-05-10 22:00:18 +03:00
server-queue.h server: allow router to report child instances sleep status (#20849) 2026-03-22 18:33:52 +01:00
server-task.cpp common : delegate assistant continuation to underlying template handlers (#23089) 2026-05-17 13:36:05 +02:00
server-task.h common : delegate assistant continuation to underlying template handlers (#23089) 2026-05-17 13:36:05 +02:00
server-tools.cpp server : validate --tools CLI argument against known tool names (#22538) 2026-05-05 06:35:27 +03:00
server-tools.h server: add built-in tools backend support (#20898) 2026-03-27 10:07:11 +01:00
server.cpp server: skip device enumeration in router mode to avoid creating CUDA primary context (#23137) 2026-05-16 21:21:06 +02:00