koboldcpp/tools/server
2026-02-26 16:45:04 +08:00
..
bench Merge branch 'upstream' into concedo_experimental 2025-08-23 11:35:28 +08:00
public feat: Add code blocks full height setting to parameter sync service (#19835) 2026-02-23 22:30:13 +01:00
public_legacy docs : Minor cleanups (#19252) 2026-02-02 08:38:55 +02:00
public_simplechat Merge branch 'upstream' into concedo_experimental 2025-05-03 12:15:36 +08:00
tests Merge commit '88d23ad515' into concedo_experimental 2026-01-29 22:25:56 +08:00
themes Merge branch 'upstream' into concedo_experimental 2026-02-03 19:00:42 +08:00
webui Merge commit '8c2c0108dd' into concedo_experimental 2026-02-26 16:30:37 +08:00
chat-llama2.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
chat.mjs llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
chat.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
README-dev.md server: add auto-sleep after N seconds of idle (#18228) 2025-12-21 02:24:42 +01:00
server-common.cpp server : support multi-modal context checkpoints (#19849) 2026-02-25 15:14:27 +02:00
server-common.h server : support multi-modal context checkpoints (#19849) 2026-02-25 15:14:27 +02:00
server-context.cpp server : enable multi-modal prompt caching (#19877) 2026-02-25 15:15:42 +02:00
server-context.h cli : provide model with text filename (#19783) 2026-02-22 22:33:49 +01:00
server-http.cpp server: fix query params lost when proxying requests in multi-model router mode (#19854) 2026-02-24 21:46:06 +01:00
server-http.h server: fix query params lost when proxying requests in multi-model router mode (#19854) 2026-02-24 21:46:06 +01:00
server-models.cpp server: fix query params lost when proxying requests in multi-model router mode (#19854) 2026-02-24 21:46:06 +01:00
server-models.h server : fix router child env in containerized environments (#18562) 2026-01-05 14:12:05 +01:00
server-queue.cpp server: improve slots scheduling for n_cmpl (#18789) 2026-01-15 17:10:28 +01:00
server-queue.h server: improve slots scheduling for n_cmpl (#18789) 2026-01-15 17:10:28 +01:00
server-task.cpp server : enable multi-modal prompt caching (#19877) 2026-02-25 15:15:42 +02:00
server-task.h server : support multi-modal context checkpoints (#19849) 2026-02-25 15:14:27 +02:00
server.cpp server: /v1/responses (partial) (#18486) 2026-01-21 17:47:23 +01:00