koboldcpp/tools/server
Concedo 9b0b36b5ef Merge commit '66001722aa' into concedo_experimental
# Conflicts:
#	README.md
#	docs/ops.md
#	docs/ops/SYCL.csv
#	examples/sycl/start-svr.sh
#	ggml/src/ggml-hexagon/ggml-hexagon.cpp
#	ggml/src/ggml-hexagon/htp/CMakeLists.txt
#	ggml/src/ggml-hexagon/htp/htp-ctx.h
#	ggml/src/ggml-hexagon/htp/htp-ops.h
#	ggml/src/ggml-hexagon/htp/main.c
#	ggml/src/ggml-hexagon/htp/unary-ops.c
#	ggml/src/ggml-opencl/CMakeLists.txt
#	ggml/src/ggml-opencl/ggml-opencl.cpp
#	ggml/src/ggml-opencl/kernels/cvt.cl
#	ggml/src/ggml-sycl/gated_delta_net.hpp
#	ggml/src/ggml-sycl/ggml-sycl.cpp
#	ggml/src/ggml-sycl/pad.cpp
#	ggml/src/ggml-sycl/ssm_conv.cpp
#	tests/test-backend-ops.cpp
#	tests/test-reasoning-budget.cpp
#	tools/server/README.md
#	tools/server/webui/src/lib/constants/settings-config.ts
2026-05-11 15:40:10 +08:00
..
bench Merge branch 'upstream' into concedo_experimental 2026-03-22 23:39:13 +08:00
public webui: fix LLM title generation for agentic conversations (#22840) 2026-05-08 16:36:04 +02:00
tests Merge commit '66001722aa' into concedo_experimental 2026-05-11 15:40:10 +08:00
webui Merge commit '66001722aa' into concedo_experimental 2026-05-11 15:40:10 +08:00
chat-llama2.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
chat.mjs llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
chat.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
README-dev.md server: (webui) no more gzip compression (#21073) 2026-03-31 15:44:26 +02:00
server-chat.cpp server: (router) Forward form-data to model server (Fixes #22044) (#22118) 2026-04-27 23:55:00 +02:00
server-chat.h server: (router) Forward form-data to model server (Fixes #22044) (#22118) 2026-04-27 23:55:00 +02:00
server-common.cpp parser: fix structured output bug (#22302) 2026-04-24 23:19:55 +02:00
server-common.h common/chat, server: refactor, move all conversion functions to common, add tests (#20690) 2026-04-22 10:28:45 +02:00
server-context.cpp server: (router) expose child model info from router's /v1/models (#22683) 2026-05-08 14:42:15 +02:00
server-context.h server: (router) expose child model info from router's /v1/models (#22683) 2026-05-08 14:42:15 +02:00
server-cors-proxy.h server: (router) Forward form-data to model server (Fixes #22044) (#22118) 2026-04-27 23:55:00 +02:00
server-http.cpp server: support Vertex AI compatible API (#22545) 2026-05-08 15:23:04 +02:00
server-http.h server: support Vertex AI compatible API (#22545) 2026-05-08 15:23:04 +02:00
server-models.cpp server: (router) expose child model info from router's /v1/models (#22683) 2026-05-08 14:42:15 +02:00
server-models.h server: (router) expose child model info from router's /v1/models (#22683) 2026-05-08 14:42:15 +02:00
server-queue.cpp server: improve slots scheduling for n_cmpl (#18789) 2026-01-15 17:10:28 +01:00
server-queue.h server: allow router to report child instances sleep status (#20849) 2026-03-22 18:33:52 +01:00
server-task.cpp spec : refactor params (#22397) 2026-04-28 09:07:33 +03:00
server-task.h server : speculative checkpointing (#19493) 2026-04-19 10:24:06 +03:00
server-tools.cpp server : validate --tools CLI argument against known tool names (#22538) 2026-05-05 06:35:27 +03:00
server-tools.h server: add built-in tools backend support (#20898) 2026-03-27 10:07:11 +01:00
server.cpp server: support Vertex AI compatible API (#22545) 2026-05-08 15:23:04 +02:00