Commit graph

1215 commits

Author SHA1 Message Date
Concedo
58d8635827 fixed autofit 2025-12-28 23:15:06 +08:00
Concedo
07fb18a04b handle case differences 2025-12-28 21:41:56 +08:00
Concedo
21d801f6d5 init total weight for adaptive p 2025-12-28 15:33:06 +08:00
Concedo
ec95655f3c fixed default handling for special keys 2025-12-28 13:56:05 +08:00
Concedo
27261bfc26 adaptive decay as an overridable param (+1 squashed commits)
Squashed commits:

[d94df7843] adaptive decay as an overridable param
2025-12-28 13:34:20 +08:00
Concedo
1051313cb2 added deprecated item sdgendefaults (+1 squashed commits)
Squashed commits:

[efc14a5d9] fixed sd error
2025-12-27 22:47:43 +08:00
Concedo
f5282e114d allow ANY api field to have specified defaults, and to be overwritten by value specified at load time 2025-12-27 18:57:04 +08:00
Concedo
6548645aaa rename power law sampler to adaptive p 2025-12-27 17:50:58 +08:00
Concedo
91d8863f18 power law sampler added 2025-12-27 09:46:06 +08:00
Concedo
399fc9c57e rename tokens tab to context, move fa to hardware 2025-12-26 00:06:07 +08:00
Concedo
cf4201e213 wip power law sampling 2025-12-25 22:01:16 +08:00
Concedo
afe41b6eea Merge branch 'concedo_experimental' of https://github.com/LostRuins/koboldcpp into concedo_experimental 2025-12-24 23:42:52 +08:00
Concedo
d1983959d2 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/workflows/release.yml
#	AGENTS.md
#	common/CMakeLists.txt
#	docs/development/parsing.md
#	ggml/src/ggml-rpc/ggml-rpc.cpp
#	ggml/src/ggml-vulkan/ggml-vulkan.cpp
#	tests/test-arg-parser.cpp
#	tests/test-backend-ops.cpp
#	tests/test-grammar-llguidance.cpp
#	tests/test-tokenizer-0.cpp
#	tests/test-tokenizer-1-bpe.cpp
#	tests/test-tokenizer-1-spm.cpp
#	tools/batched-bench/batched-bench.cpp
#	tools/cli/cli.cpp
#	tools/llama-bench/llama-bench.cpp
#	tools/server/README.md
2025-12-24 23:42:28 +08:00
Wagner Bruna
f30da43b7f
sd: get the available schedulers directly from sd.cpp (#1900)
Avoids a hardcoded list on the Python side.
2025-12-24 21:55:24 +08:00
Concedo
26d89bf589 support for downloading AVI from sdui 2025-12-24 18:40:10 +08:00
Concedo
a5f8410001 redact admin password for templates 2025-12-23 13:41:49 +08:00
Wagner Bruna
86a094c559
fix autofit_tax_mb type error (#1897) 2025-12-23 11:31:09 +08:00
Concedo
62e6956def wider launch button 2025-12-22 22:34:54 +08:00
Concedo
8b184dd638 corrupt scaler fix test 2025-12-22 22:24:10 +08:00
Concedo
a14fb971b9 template saving fix 2025-12-22 22:13:58 +08:00
Concedo
7fad4dc0ad fixed ordering of gpu overhead detection 2025-12-22 17:39:05 +08:00
Wagner Bruna
44ce1a80b3
sd: sync to master-431-23fce0b (#1893)
* sd: sync to master-427-78e15bd

* add kl_optimal to the available schedulers list

* more robust workaround to avoid stb linkage issues

* sd: sync to master-431-23fce0b

* add TAEHV support and disable TAE if the model isn't found
2025-12-22 15:07:09 +08:00
Concedo
27c53099f4 adjust scaler checks 2025-12-22 11:50:15 +08:00
Concedo
a0e4b8c18a text for maingpu 2025-12-22 11:07:18 +08:00
Concedo
db4634b9a4 testing new workaround for corrupt scaling 2025-12-21 22:54:40 +08:00
Concedo
fedd529fdc autofit counts overheads 2025-12-21 14:31:08 +08:00
Concedo
9458e08346 fixed https://github.com/LostRuins/koboldcpp/issues/1892 2025-12-19 22:52:39 +08:00
Concedo
30fecac3a3 small tweak 2025-12-18 15:41:22 +08:00
Concedo
1e083d9c8b integrate autofit for upstream, removed forceversion 2025-12-17 18:42:47 +08:00
Concedo
9bc724f86c rearrage some elements in launcher 2025-12-17 17:00:26 +08:00
Concedo
cacfa37611 wip 2025-12-17 16:04:45 +08:00
Concedo
bca0258c2a bump default gen amount by 128 to 896 2025-12-14 22:17:31 +08:00
Concedo
e46a6a2796 better int parser 2025-12-13 09:28:10 +08:00
Concedo
ab9bc6f2ae zimage cfg clamp is opt out with remove_limits 2025-12-13 09:20:00 +08:00
Concedo
b714fe19e2 allow easy clamping of max cfg and steps 2025-12-12 15:22:37 +08:00
Concedo
d07d2c1b39 stub loras endpoint for comfy 2025-12-11 22:48:38 +08:00
Concedo
fd0d0cab03 move pipeline parallelism to a --pipelineparallel launch flag 2025-12-11 21:03:41 +08:00
Concedo
b7428048fc try reduce pipeline parallelism in order to reduce compute buffer sizes 2025-12-11 14:30:38 +08:00
Concedo
8a18e094f5 added smartcaching implementation inspired from Pento95 (+2 squashed commit)
Squashed commit:

[fcc498688] wip basic smart caching test

[b6e8b2577] wip basic smart caching test
2025-12-10 18:00:03 +08:00
Concedo
242ae8b8f3 http get cleanup 2025-12-08 19:51:55 +08:00
Concedo
8c17541cc0 modify llama.cpp branding on lcpp ui (+1 squashed commits)
Squashed commits:

[067343edf] modify llama.cpp branding on lcpp ui
2025-12-07 12:53:33 +08:00
Concedo
12d11cee5c added url to docs 2025-12-06 09:21:56 +08:00
Concedo
3550265249 add indent to kcpps files 2025-12-04 21:26:44 +08:00
Concedo
177e0d7515 strip_oaicontent_of_media placeholder (+2 squashed commit)
Squashed commit:

[7ccd52ef4] placeholder

[71fd2d7bb] strip_oaicontent_of_media
2025-12-01 01:29:57 +08:00
Concedo
bf5efcf86d Merge commit 'd82b7a7c1d' into concedo_experimental
# Conflicts:
#	ci/run.sh
#	ggml/CMakeLists.txt
#	ggml/src/CMakeLists.txt
#	ggml/src/ggml-cuda/common.cuh
#	tests/CMakeLists.txt
2025-11-30 15:43:11 +08:00
Concedo
2985575be4 allow assistant prefills, fixed showgui issue 2025-11-30 12:52:28 +08:00
Concedo
925e7f8f6d added a secondary terminal mirror for linux 2025-11-29 21:53:51 +08:00
Concedo
9999b8950d cleaner resizing 2025-11-29 18:01:49 +08:00
Concedo
eda4a312cb Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.devops/vulkan.Dockerfile
#	ggml/src/ggml-cpu/CMakeLists.txt
#	ggml/src/ggml-opencl/CMakeLists.txt
#	ggml/src/ggml-opencl/ggml-opencl.cpp
#	ggml/src/ggml-sycl/common.hpp
#	tests/test-backend-ops.cpp
#	tools/server/README.md
2025-11-28 13:22:02 +08:00
Concedo
e570478275 limit cuda arches + scale tweaks 2025-11-28 13:05:11 +08:00