koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2025-09-11 17:44:38 +00:00

History

Concedo 6d7ef10671 Merge branch 'upstream' into concedo_experimental Renable qwen2vl GPU for vulkan https://github.com/ggml-org/llama.cpp/pull/11902 # Conflicts: # .github/workflows/build.yml # .github/workflows/docker.yml # .gitignore # CONTRIBUTING.md # Makefile # common/CMakeLists.txt # common/arg.cpp # common/common.cpp # examples/main/main.cpp # examples/run/run.cpp # examples/server/tests/README.md # ggml/src/ggml-cuda/mma.cuh # scripts/get_chat_template.py # tests/test-backend-ops.cpp # tests/test-chat-template.cpp # tests/test-chat.cpp		2025-02-20 23:17:20 +08:00
..
unit	tool-call: refactor common chat / tool-call api (+ tests / fixes) (#11900 )	2025-02-18 18:03:23 +00:00
.gitignore	server : replace behave with pytest (#10416 )	2024-11-26 16:20:18 +01:00
conftest.py	server : replace behave with pytest (#10416 )	2024-11-26 16:20:18 +01:00
pytest.ini	Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639 )	2025-01-30 19:13:58 +00:00
requirements.txt	server : allow using LoRA adapters per-request (#10994 )	2025-01-02 15:05:18 +01:00
tests.sh	Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639 )	2025-01-30 19:13:58 +00:00
utils.py	`server`: fix tool-call of DeepSeek R1 Qwen, return reasoning_content (Command 7RB & DeepSeek R1) unless `--reasoning-format none` (#11607 )	2025-02-13 10:05:16 +00:00