koboldcpp/tools
Concedo da2cc90723 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/labeler.yml
#	.github/workflows/build-and-test-snapdragon.yml
#	.github/workflows/build-self-hosted.yml
#	.github/workflows/release.yml
#	.github/workflows/server-self-hosted.yml
#	.github/workflows/server-webui.yml
#	.github/workflows/server.yml
#	.gitignore
#	CMakeLists.txt
#	CONTRIBUTING.md
#	README.md
#	ggml/src/ggml-cuda/fattn.cu
#	ggml/src/ggml-hexagon/htp/cpy-ops.c
#	ggml/src/ggml-webgpu/ggml-webgpu-shader-lib.hpp
#	ggml/src/ggml-webgpu/ggml-webgpu.cpp
#	grammars/README.md
#	scripts/snapdragon/qdc/run_qdc_jobs.py
#	scripts/snapdragon/qdc/tests/run_backend_ops_posix.py
#	scripts/snapdragon/qdc/tests/run_bench_tests_posix.py
#	scripts/snapdragon/qdc/tests/utils.py
#	tests/test-backend-ops.cpp
#	tests/test-chat.cpp
#	tools/server/CMakeLists.txt
#	tools/server/README.md
#	tools/server/webui/src/lib/components/app/server/ServerLoadingSplash.svelte
#	tools/server/webui/src/routes/(chat)/chat/[id]/+page.svelte
#	ty.toml
2026-05-15 17:09:48 +08:00
..
completion spec : update CLI arguments for better consistency (#22964) 2026-05-13 09:15:39 +03:00
fit-params fit-params : refactor + add option to output estimated memory per device (#22171) 2026-04-21 09:54:36 +03:00
gguf-split Merge branch 'upstream' into concedo_experimental 2026-04-17 22:37:37 +08:00
mtmd Merge branch 'upstream' into concedo_experimental 2026-05-14 19:04:04 +08:00
parser libs : rename libcommon -> libllama-common (#21936) 2026-04-17 11:11:46 +03:00
quantize Merge branch 'upstream' into concedo_experimental 2026-04-17 22:37:37 +08:00
rpc reinstate rpc files 2026-05-12 21:41:10 +08:00
server Merge branch 'upstream' into concedo_experimental 2026-05-15 17:09:48 +08:00
tts Merge branch 'upstream' into concedo_experimental 2026-05-14 19:04:04 +08:00
kcpplauncherhook.py Switched VS2019 for revert cu12.1 build, hopefully solves dll issues 2025-06-10 23:08:02 +08:00
quantclip.cpp fixed autoguess adapters, fixed tool builds 2025-05-13 19:38:56 +08:00