koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2026-05-11 04:51:25 +00:00

History

Concedo 754fef5204 Merge branch 'upstream' into concedo_experimental # Conflicts: # .devops/cuda.Dockerfile # .devops/musa.Dockerfile # .github/workflows/build.yml # README.md # docs/docker.md # examples/imatrix/imatrix.cpp # examples/llama-bench/llama-bench.cpp # examples/main/README.md # examples/perplexity/perplexity.cpp # examples/server/README.md # ggml/src/ggml-cpu/ggml-cpu.c # ggml/src/ggml-cuda/CMakeLists.txt # models/templates/deepseek-ai-DeepSeek-R1-Distill-Llama-8B.jinja # models/templates/deepseek-ai-DeepSeek-R1-Distill-Qwen-32B.jinja # scripts/get_chat_template.py # scripts/sync-ggml.last # tests/test-chat.cpp # tests/test-gguf.cpp # tests/test-sampling.cpp		2025-02-15 00:49:46 +08:00
..
unit	`server`: fix tool-call of DeepSeek R1 Qwen, return reasoning_content (Command 7RB & DeepSeek R1) unless `--reasoning-format none` (#11607 )	2025-02-13 10:05:16 +00:00
.gitignore	server : replace behave with pytest (#10416 )	2024-11-26 16:20:18 +01:00
conftest.py	server : replace behave with pytest (#10416 )	2024-11-26 16:20:18 +01:00
pytest.ini	Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639 )	2025-01-30 19:13:58 +00:00
requirements.txt	server : allow using LoRA adapters per-request (#10994 )	2025-01-02 15:05:18 +01:00
tests.sh	Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639 )	2025-01-30 19:13:58 +00:00
utils.py	`server`: fix tool-call of DeepSeek R1 Qwen, return reasoning_content (Command 7RB & DeepSeek R1) unless `--reasoning-format none` (#11607 )	2025-02-13 10:05:16 +00:00