koboldcpp/tools/server
Concedo 7c70187e26 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/ISSUE_TEMPLATE/010-bug-compilation.yml
#	.github/ISSUE_TEMPLATE/011-bug-results.yml
#	.github/ISSUE_TEMPLATE/019-bug-misc.yml
#	.github/ISSUE_TEMPLATE/020-enhancement.yml
#	.github/ISSUE_TEMPLATE/030-research.yml
#	.github/ISSUE_TEMPLATE/040-refactor.yml
#	ggml/CMakeLists.txt
#	ggml/src/ggml-cann/ggml-cann.cpp
#	ggml/src/ggml-hexagon/CMakeLists.txt
#	ggml/src/ggml-hexagon/ggml-hexagon.cpp
#	ggml/src/ggml-hexagon/htp/CMakeLists.txt
#	ggml/src/ggml-hexagon/htp/cmake-toolchain.cmake
#	ggml/src/ggml-hexagon/htp/flash-attn-ops.c
#	ggml/src/ggml-hexagon/htp/hex-utils.h
#	ggml/src/ggml-hexagon/htp/hmx-matmul-ops.c
#	ggml/src/ggml-hexagon/htp/hmx-ops.h
#	ggml/src/ggml-hexagon/htp/hmx-utils.h
#	ggml/src/ggml-hexagon/htp/hvx-base.h
#	ggml/src/ggml-hexagon/htp/hvx-copy.h
#	ggml/src/ggml-hexagon/htp/hvx-exp.h
#	ggml/src/ggml-hexagon/htp/unary-ops.c
#	ggml/src/ggml-opencl/CMakeLists.txt
#	ggml/src/ggml-opencl/ggml-opencl.cpp
#	ggml/src/ggml-opencl/kernels/cvt.cl
#	ggml/src/ggml-rpc/ggml-rpc.cpp
#	ggml/src/ggml-sycl/ggml-sycl.cpp
#	ggml/src/ggml-virtgpu/ggml-backend.cpp
#	ggml/src/ggml-webgpu/ggml-webgpu-shader-lib.hpp
#	ggml/src/ggml-webgpu/ggml-webgpu.cpp
#	ggml/src/ggml-webgpu/wgsl-shaders/mul_mat_vec.wgsl
#	ggml/src/ggml-zdnn/ggml-zdnn.cpp
#	ggml/src/ggml-zendnn/ggml-zendnn.cpp
#	scripts/sync-ggml.last
#	tests/test-backend-ops.cpp
2026-05-02 18:07:50 +08:00
..
bench Merge branch 'upstream' into concedo_experimental 2026-03-22 23:39:13 +08:00
public webui: Spring Cleaning Refactor v1 (#22505) 2026-05-01 18:36:29 +02:00
tests Merge commit '52e5f0a5c1' into concedo_experimental 2026-04-29 22:27:36 +08:00
webui Merge branch 'upstream' into concedo_experimental 2026-05-02 18:07:50 +08:00
chat-llama2.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
chat.mjs llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
chat.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
README-dev.md server: (webui) no more gzip compression (#21073) 2026-03-31 15:44:26 +02:00
server-chat.cpp server: (router) Forward form-data to model server (Fixes #22044) (#22118) 2026-04-27 23:55:00 +02:00
server-chat.h server: (router) Forward form-data to model server (Fixes #22044) (#22118) 2026-04-27 23:55:00 +02:00
server-common.cpp parser: fix structured output bug (#22302) 2026-04-24 23:19:55 +02:00
server-common.h common/chat, server: refactor, move all conversion functions to common, add tests (#20690) 2026-04-22 10:28:45 +02:00
server-context.cpp spec : fix draft model checkpoints (#22521) 2026-04-30 08:32:18 +03:00
server-context.h server : remove /api endpoints (#22165) 2026-04-20 20:41:19 +03:00
server-cors-proxy.h server: (router) Forward form-data to model server (Fixes #22044) (#22118) 2026-04-27 23:55:00 +02:00
server-http.cpp server: (router) Forward form-data to model server (Fixes #22044) (#22118) 2026-04-27 23:55:00 +02:00
server-http.h server: (router) Forward form-data to model server (Fixes #22044) (#22118) 2026-04-27 23:55:00 +02:00
server-models.cpp server: (router) Forward form-data to model server (Fixes #22044) (#22118) 2026-04-27 23:55:00 +02:00
server-models.h server: (router) Forward form-data to model server (Fixes #22044) (#22118) 2026-04-27 23:55:00 +02:00
server-queue.cpp server: improve slots scheduling for n_cmpl (#18789) 2026-01-15 17:10:28 +01:00
server-queue.h server: allow router to report child instances sleep status (#20849) 2026-03-22 18:33:52 +01:00
server-task.cpp spec : refactor params (#22397) 2026-04-28 09:07:33 +03:00
server-task.h server : speculative checkpointing (#19493) 2026-04-19 10:24:06 +03:00
server-tools.cpp cli : add /glob command (#21084) 2026-03-28 02:33:04 +01:00
server-tools.h server: add built-in tools backend support (#20898) 2026-03-27 10:07:11 +01:00
server.cpp fit-params : refactor + add option to output estimated memory per device (#22171) 2026-04-21 09:54:36 +03:00