koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2026-05-22 19:47:49 +00:00

History

Concedo fecf2dc3fa Merge branch 'upstream' into concedo_experimental # Conflicts: # .github/workflows/server-self-hosted.yml # CMakeLists.txt # CODEOWNERS # ci/run.sh # cmake/llama-config.cmake.in # common/chat.cpp # examples/sycl/start-svr.sh # examples/sycl/test.sh # examples/sycl/win-start-svr.bat # examples/sycl/win-test.bat # ggml/src/ggml-sycl/ggml-sycl.cpp # ggml/src/ggml-sycl/vecdotq.hpp # ggml/src/ggml-vulkan/CMakeLists.txt # scripts/wc2wt.sh # tests/test-backend-ops.cpp # tests/test-chat.cpp		2026-05-18 21:27:23 +08:00
..
bench	Merge branch 'upstream' into concedo_experimental	2026-03-22 23:39:13 +08:00
tests	Merge branch 'upstream' into concedo_experimental	2026-05-18 21:27:23 +08:00
chat-llama2.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
chat.mjs
chat.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
README-dev.md	ui: Restructure repo to use `tools/ui` folder and `ui` / `UI` / `llama-ui` / `LLAMA_UI` naming (#23064 )	2026-05-16 02:02:40 +02:00
server-chat.cpp	Support for Codex CLI by skipping unsupported Responses tools (#23041 )	2026-05-15 09:03:24 +02:00
server-chat.h	server: (router) Forward form-data to model server (Fixes #22044 ) (#22118 )	2026-04-27 23:55:00 +02:00
server-common.cpp	common : delegate assistant continuation to underlying template handlers (#23089 )	2026-05-17 13:36:05 +02:00
server-common.h	logs : reduce (#23021 )	2026-05-14 13:05:52 +03:00
server-context.cpp	llama: avoid copying logits during prompt decode in MTP (#23198 )	2026-05-17 23:30:25 +08:00
server-context.h	ui: Restructure repo to use `tools/ui` folder and `ui` / `UI` / `llama-ui` / `LLAMA_UI` naming (#23064 )	2026-05-16 02:02:40 +02:00
server-cors-proxy.h	server: (router) Forward form-data to model server (Fixes #22044 ) (#22118 )	2026-04-27 23:55:00 +02:00
server-http.cpp	cmake : fix LLAMA_BUILD_UI logic (#23190 )	2026-05-17 14:42:26 -04:00
server-http.h	server: support Vertex AI compatible API (#22545 )	2026-05-08 15:23:04 +02:00
server-models.cpp	server: (router) alloc tmp buffer on heap (#23159 )	2026-05-16 23:42:16 +02:00
server-models.h	ui: Restructure repo to use `tools/ui` folder and `ui` / `UI` / `llama-ui` / `LLAMA_UI` naming (#23064 )	2026-05-16 02:02:40 +02:00
server-queue.cpp	server : print warning when HTTP timeout exceeded (#22907 )	2026-05-10 22:00:18 +03:00
server-queue.h	server: allow router to report child instances sleep status (#20849 )	2026-03-22 18:33:52 +01:00
server-task.cpp	common : delegate assistant continuation to underlying template handlers (#23089 )	2026-05-17 13:36:05 +02:00
server-task.h	common : delegate assistant continuation to underlying template handlers (#23089 )	2026-05-17 13:36:05 +02:00
server-tools.cpp	server : validate --tools CLI argument against known tool names (#22538 )	2026-05-05 06:35:27 +03:00
server-tools.h	server: add built-in tools backend support (#20898 )	2026-03-27 10:07:11 +01:00
server.cpp	server: skip device enumeration in router mode to avoid creating CUDA primary context (#23137 )	2026-05-16 21:21:06 +02:00