Commit graph

1228 commits

Author SHA1 Message Date
Concedo
c9308570b2 added mcp to list of capabilities, allow it to run standalone 2026-01-05 20:32:25 +08:00
Concedo
301a04adfc Merge branch 'concedo' into concedo_experimental 2026-01-05 15:24:43 +08:00
Concedo
9a4eeafbfc hotfix 1.105.3 2026-01-05 15:24:21 +08:00
Concedo
4d3866a016 mcp proxy is done 2026-01-05 12:24:43 +08:00
Concedo
91089ad1bd wip on mcp 2026-01-04 22:52:47 +08:00
Concedo
01c70a7d3d allow transcribe to be used with the LLM instead if no whisper model exists 2026-01-04 11:06:05 +08:00
Concedo
e4abf643fa Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	ggml/src/ggml-hexagon/htp/act-ops.c
#	ggml/src/ggml-rpc/ggml-rpc.cpp
#	src/CMakeLists.txt
#	src/llama-vocab.cpp
2026-01-03 15:37:30 +08:00
Concedo
77082dddfb mcp image handling 2026-01-03 00:03:05 +08:00
Concedo
d8942cde14 smartcache allow custom number of slots 2026-01-02 17:19:40 +08:00
Concedo
0a23388e7d added images in tool call queries 2026-01-02 10:48:34 +08:00
Concedo
442fa7cd7c support for circular textures in sdcpp 2026-01-01 16:34:09 +08:00
Concedo
03df0c40f3 if gendefaults is set, horde has debug flag 2026-01-01 00:54:57 +08:00
Concedo
329c0e7e32 mini qol to prevent fake tool calls 2025-12-29 17:54:27 +08:00
Concedo
58d8635827 fixed autofit 2025-12-28 23:15:06 +08:00
Concedo
07fb18a04b handle case differences 2025-12-28 21:41:56 +08:00
Concedo
21d801f6d5 init total weight for adaptive p 2025-12-28 15:33:06 +08:00
Concedo
ec95655f3c fixed default handling for special keys 2025-12-28 13:56:05 +08:00
Concedo
27261bfc26 adaptive decay as an overridable param (+1 squashed commits)
Squashed commits:

[d94df7843] adaptive decay as an overridable param
2025-12-28 13:34:20 +08:00
Concedo
1051313cb2 added deprecated item sdgendefaults (+1 squashed commits)
Squashed commits:

[efc14a5d9] fixed sd error
2025-12-27 22:47:43 +08:00
Concedo
f5282e114d allow ANY api field to have specified defaults, and to be overwritten by value specified at load time 2025-12-27 18:57:04 +08:00
Concedo
6548645aaa rename power law sampler to adaptive p 2025-12-27 17:50:58 +08:00
Concedo
91d8863f18 power law sampler added 2025-12-27 09:46:06 +08:00
Concedo
399fc9c57e rename tokens tab to context, move fa to hardware 2025-12-26 00:06:07 +08:00
Concedo
cf4201e213 wip power law sampling 2025-12-25 22:01:16 +08:00
Concedo
afe41b6eea Merge branch 'concedo_experimental' of https://github.com/LostRuins/koboldcpp into concedo_experimental 2025-12-24 23:42:52 +08:00
Concedo
d1983959d2 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/workflows/release.yml
#	AGENTS.md
#	common/CMakeLists.txt
#	docs/development/parsing.md
#	ggml/src/ggml-rpc/ggml-rpc.cpp
#	ggml/src/ggml-vulkan/ggml-vulkan.cpp
#	tests/test-arg-parser.cpp
#	tests/test-backend-ops.cpp
#	tests/test-grammar-llguidance.cpp
#	tests/test-tokenizer-0.cpp
#	tests/test-tokenizer-1-bpe.cpp
#	tests/test-tokenizer-1-spm.cpp
#	tools/batched-bench/batched-bench.cpp
#	tools/cli/cli.cpp
#	tools/llama-bench/llama-bench.cpp
#	tools/server/README.md
2025-12-24 23:42:28 +08:00
Wagner Bruna
f30da43b7f
sd: get the available schedulers directly from sd.cpp (#1900)
Avoids a hardcoded list on the Python side.
2025-12-24 21:55:24 +08:00
Concedo
26d89bf589 support for downloading AVI from sdui 2025-12-24 18:40:10 +08:00
Concedo
a5f8410001 redact admin password for templates 2025-12-23 13:41:49 +08:00
Wagner Bruna
86a094c559
fix autofit_tax_mb type error (#1897) 2025-12-23 11:31:09 +08:00
Concedo
62e6956def wider launch button 2025-12-22 22:34:54 +08:00
Concedo
8b184dd638 corrupt scaler fix test 2025-12-22 22:24:10 +08:00
Concedo
a14fb971b9 template saving fix 2025-12-22 22:13:58 +08:00
Concedo
7fad4dc0ad fixed ordering of gpu overhead detection 2025-12-22 17:39:05 +08:00
Wagner Bruna
44ce1a80b3
sd: sync to master-431-23fce0b (#1893)
* sd: sync to master-427-78e15bd

* add kl_optimal to the available schedulers list

* more robust workaround to avoid stb linkage issues

* sd: sync to master-431-23fce0b

* add TAEHV support and disable TAE if the model isn't found
2025-12-22 15:07:09 +08:00
Concedo
27c53099f4 adjust scaler checks 2025-12-22 11:50:15 +08:00
Concedo
a0e4b8c18a text for maingpu 2025-12-22 11:07:18 +08:00
Concedo
db4634b9a4 testing new workaround for corrupt scaling 2025-12-21 22:54:40 +08:00
Concedo
fedd529fdc autofit counts overheads 2025-12-21 14:31:08 +08:00
Concedo
9458e08346 fixed https://github.com/LostRuins/koboldcpp/issues/1892 2025-12-19 22:52:39 +08:00
Concedo
30fecac3a3 small tweak 2025-12-18 15:41:22 +08:00
Concedo
1e083d9c8b integrate autofit for upstream, removed forceversion 2025-12-17 18:42:47 +08:00
Concedo
9bc724f86c rearrage some elements in launcher 2025-12-17 17:00:26 +08:00
Concedo
cacfa37611 wip 2025-12-17 16:04:45 +08:00
Concedo
bca0258c2a bump default gen amount by 128 to 896 2025-12-14 22:17:31 +08:00
Concedo
e46a6a2796 better int parser 2025-12-13 09:28:10 +08:00
Concedo
ab9bc6f2ae zimage cfg clamp is opt out with remove_limits 2025-12-13 09:20:00 +08:00
Concedo
b714fe19e2 allow easy clamping of max cfg and steps 2025-12-12 15:22:37 +08:00
Concedo
d07d2c1b39 stub loras endpoint for comfy 2025-12-11 22:48:38 +08:00
Concedo
fd0d0cab03 move pipeline parallelism to a --pipelineparallel launch flag 2025-12-11 21:03:41 +08:00