Commit graph

1239 commits

Author SHA1 Message Date
Concedo
62bea5ef4f allow overriding the devices directly 2026-01-17 19:08:06 +08:00
Concedo
8855a7f52b Merge commit 'c945aaaef2' into concedo_experimental
# Conflicts:
#	.devops/cann.Dockerfile
#	.github/workflows/build.yml
#	.github/workflows/release.yml
#	README.md
#	common/CMakeLists.txt
#	common/chat.cpp
#	docs/function-calling.md
#	ggml/src/ggml-cann/aclnn_ops.cpp
#	ggml/src/ggml-cann/aclnn_ops.h
#	ggml/src/ggml-cann/common.h
#	ggml/src/ggml-cann/ggml-cann.cpp
#	models/templates/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16.jinja
#	scripts/sync_vendor.py
#	tests/CMakeLists.txt
#	tests/peg-parser/tests.h
#	tests/test-chat-peg-parser.cpp
#	tests/test-chat-template.cpp
#	tests/test-chat.cpp
#	tests/testing.h
#	tools/llama-bench/llama-bench.cpp
2026-01-17 10:24:03 +08:00
Concedo
a5204d2363 fixed mcp command location 2026-01-17 00:09:46 +08:00
Concedo
c332bb614c better mcp error messages 2026-01-16 17:55:34 +08:00
Concedo
612c19afe7 interrogate max length increased 2026-01-13 11:06:05 +08:00
Concedo
3752040165 default to continue assistant turns 2026-01-12 23:12:27 +08:00
Concedo
fc51d8b216 fix prop type for tools 2026-01-12 18:28:00 +08:00
Concedo
4bf6d9eb9a trying with fa on by default 2026-01-07 11:38:45 +08:00
Concedo
3108fe740c Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	docs/ops.md
#	docs/ops/WebGPU.csv
#	examples/model-conversion/logits.cpp
#	examples/retrieval/retrieval.cpp
#	ggml/src/ggml-cann/ggml-cann.cpp
#	ggml/src/ggml-webgpu/ggml-webgpu.cpp
#	ggml/src/ggml-webgpu/wgsl-shaders/unary_op.wgsl
2026-01-06 20:49:01 +08:00
Concedo
bd51d775be Merge branch 'concedo' into concedo_experimental 2026-01-05 21:04:42 +08:00
Concedo
1fc405b8b6 1.105.4 2026-01-05 21:01:42 +08:00
Concedo
c9308570b2 added mcp to list of capabilities, allow it to run standalone 2026-01-05 20:32:25 +08:00
Concedo
301a04adfc Merge branch 'concedo' into concedo_experimental 2026-01-05 15:24:43 +08:00
Concedo
9a4eeafbfc hotfix 1.105.3 2026-01-05 15:24:21 +08:00
Concedo
4d3866a016 mcp proxy is done 2026-01-05 12:24:43 +08:00
Concedo
91089ad1bd wip on mcp 2026-01-04 22:52:47 +08:00
Concedo
01c70a7d3d allow transcribe to be used with the LLM instead if no whisper model exists 2026-01-04 11:06:05 +08:00
Concedo
e4abf643fa Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	ggml/src/ggml-hexagon/htp/act-ops.c
#	ggml/src/ggml-rpc/ggml-rpc.cpp
#	src/CMakeLists.txt
#	src/llama-vocab.cpp
2026-01-03 15:37:30 +08:00
Concedo
77082dddfb mcp image handling 2026-01-03 00:03:05 +08:00
Concedo
d8942cde14 smartcache allow custom number of slots 2026-01-02 17:19:40 +08:00
Concedo
0a23388e7d added images in tool call queries 2026-01-02 10:48:34 +08:00
Concedo
442fa7cd7c support for circular textures in sdcpp 2026-01-01 16:34:09 +08:00
Concedo
03df0c40f3 if gendefaults is set, horde has debug flag 2026-01-01 00:54:57 +08:00
Concedo
329c0e7e32 mini qol to prevent fake tool calls 2025-12-29 17:54:27 +08:00
Concedo
58d8635827 fixed autofit 2025-12-28 23:15:06 +08:00
Concedo
07fb18a04b handle case differences 2025-12-28 21:41:56 +08:00
Concedo
21d801f6d5 init total weight for adaptive p 2025-12-28 15:33:06 +08:00
Concedo
ec95655f3c fixed default handling for special keys 2025-12-28 13:56:05 +08:00
Concedo
27261bfc26 adaptive decay as an overridable param (+1 squashed commits)
Squashed commits:

[d94df7843] adaptive decay as an overridable param
2025-12-28 13:34:20 +08:00
Concedo
1051313cb2 added deprecated item sdgendefaults (+1 squashed commits)
Squashed commits:

[efc14a5d9] fixed sd error
2025-12-27 22:47:43 +08:00
Concedo
f5282e114d allow ANY api field to have specified defaults, and to be overwritten by value specified at load time 2025-12-27 18:57:04 +08:00
Concedo
6548645aaa rename power law sampler to adaptive p 2025-12-27 17:50:58 +08:00
Concedo
91d8863f18 power law sampler added 2025-12-27 09:46:06 +08:00
Concedo
399fc9c57e rename tokens tab to context, move fa to hardware 2025-12-26 00:06:07 +08:00
Concedo
cf4201e213 wip power law sampling 2025-12-25 22:01:16 +08:00
Concedo
afe41b6eea Merge branch 'concedo_experimental' of https://github.com/LostRuins/koboldcpp into concedo_experimental 2025-12-24 23:42:52 +08:00
Concedo
d1983959d2 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/workflows/release.yml
#	AGENTS.md
#	common/CMakeLists.txt
#	docs/development/parsing.md
#	ggml/src/ggml-rpc/ggml-rpc.cpp
#	ggml/src/ggml-vulkan/ggml-vulkan.cpp
#	tests/test-arg-parser.cpp
#	tests/test-backend-ops.cpp
#	tests/test-grammar-llguidance.cpp
#	tests/test-tokenizer-0.cpp
#	tests/test-tokenizer-1-bpe.cpp
#	tests/test-tokenizer-1-spm.cpp
#	tools/batched-bench/batched-bench.cpp
#	tools/cli/cli.cpp
#	tools/llama-bench/llama-bench.cpp
#	tools/server/README.md
2025-12-24 23:42:28 +08:00
Wagner Bruna
f30da43b7f
sd: get the available schedulers directly from sd.cpp (#1900)
Avoids a hardcoded list on the Python side.
2025-12-24 21:55:24 +08:00
Concedo
26d89bf589 support for downloading AVI from sdui 2025-12-24 18:40:10 +08:00
Concedo
a5f8410001 redact admin password for templates 2025-12-23 13:41:49 +08:00
Wagner Bruna
86a094c559
fix autofit_tax_mb type error (#1897) 2025-12-23 11:31:09 +08:00
Concedo
62e6956def wider launch button 2025-12-22 22:34:54 +08:00
Concedo
8b184dd638 corrupt scaler fix test 2025-12-22 22:24:10 +08:00
Concedo
a14fb971b9 template saving fix 2025-12-22 22:13:58 +08:00
Concedo
7fad4dc0ad fixed ordering of gpu overhead detection 2025-12-22 17:39:05 +08:00
Wagner Bruna
44ce1a80b3
sd: sync to master-431-23fce0b (#1893)
* sd: sync to master-427-78e15bd

* add kl_optimal to the available schedulers list

* more robust workaround to avoid stb linkage issues

* sd: sync to master-431-23fce0b

* add TAEHV support and disable TAE if the model isn't found
2025-12-22 15:07:09 +08:00
Concedo
27c53099f4 adjust scaler checks 2025-12-22 11:50:15 +08:00
Concedo
a0e4b8c18a text for maingpu 2025-12-22 11:07:18 +08:00
Concedo
db4634b9a4 testing new workaround for corrupt scaling 2025-12-21 22:54:40 +08:00
Concedo
fedd529fdc autofit counts overheads 2025-12-21 14:31:08 +08:00