Concedo
58d8635827
fixed autofit
2025-12-28 23:15:06 +08:00
Concedo
07fb18a04b
handle case differences
2025-12-28 21:41:56 +08:00
Concedo
21d801f6d5
init total weight for adaptive p
2025-12-28 15:33:06 +08:00
Concedo
ec95655f3c
fixed default handling for special keys
2025-12-28 13:56:05 +08:00
Concedo
27261bfc26
adaptive decay as an overridable param (+1 squashed commits)
...
Squashed commits:
[d94df7843] adaptive decay as an overridable param
2025-12-28 13:34:20 +08:00
Concedo
1051313cb2
added deprecated item sdgendefaults (+1 squashed commits)
...
Squashed commits:
[efc14a5d9] fixed sd error
2025-12-27 22:47:43 +08:00
Concedo
f5282e114d
allow ANY api field to have specified defaults, and to be overwritten by value specified at load time
2025-12-27 18:57:04 +08:00
Concedo
6548645aaa
rename power law sampler to adaptive p
2025-12-27 17:50:58 +08:00
Concedo
91d8863f18
power law sampler added
2025-12-27 09:46:06 +08:00
Concedo
399fc9c57e
rename tokens tab to context, move fa to hardware
2025-12-26 00:06:07 +08:00
Concedo
cf4201e213
wip power law sampling
2025-12-25 22:01:16 +08:00
Concedo
afe41b6eea
Merge branch 'concedo_experimental' of https://github.com/LostRuins/koboldcpp into concedo_experimental
2025-12-24 23:42:52 +08:00
Concedo
d1983959d2
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# .github/workflows/release.yml
# AGENTS.md
# common/CMakeLists.txt
# docs/development/parsing.md
# ggml/src/ggml-rpc/ggml-rpc.cpp
# ggml/src/ggml-vulkan/ggml-vulkan.cpp
# tests/test-arg-parser.cpp
# tests/test-backend-ops.cpp
# tests/test-grammar-llguidance.cpp
# tests/test-tokenizer-0.cpp
# tests/test-tokenizer-1-bpe.cpp
# tests/test-tokenizer-1-spm.cpp
# tools/batched-bench/batched-bench.cpp
# tools/cli/cli.cpp
# tools/llama-bench/llama-bench.cpp
# tools/server/README.md
2025-12-24 23:42:28 +08:00
Wagner Bruna
f30da43b7f
sd: get the available schedulers directly from sd.cpp ( #1900 )
...
Avoids a hardcoded list on the Python side.
2025-12-24 21:55:24 +08:00
Concedo
26d89bf589
support for downloading AVI from sdui
2025-12-24 18:40:10 +08:00
Concedo
a5f8410001
redact admin password for templates
2025-12-23 13:41:49 +08:00
Wagner Bruna
86a094c559
fix autofit_tax_mb type error ( #1897 )
2025-12-23 11:31:09 +08:00
Concedo
62e6956def
wider launch button
2025-12-22 22:34:54 +08:00
Concedo
8b184dd638
corrupt scaler fix test
2025-12-22 22:24:10 +08:00
Concedo
a14fb971b9
template saving fix
2025-12-22 22:13:58 +08:00
Concedo
7fad4dc0ad
fixed ordering of gpu overhead detection
2025-12-22 17:39:05 +08:00
Wagner Bruna
44ce1a80b3
sd: sync to master-431-23fce0b ( #1893 )
...
* sd: sync to master-427-78e15bd
* add kl_optimal to the available schedulers list
* more robust workaround to avoid stb linkage issues
* sd: sync to master-431-23fce0b
* add TAEHV support and disable TAE if the model isn't found
2025-12-22 15:07:09 +08:00
Concedo
27c53099f4
adjust scaler checks
2025-12-22 11:50:15 +08:00
Concedo
a0e4b8c18a
text for maingpu
2025-12-22 11:07:18 +08:00
Concedo
db4634b9a4
testing new workaround for corrupt scaling
2025-12-21 22:54:40 +08:00
Concedo
fedd529fdc
autofit counts overheads
2025-12-21 14:31:08 +08:00
Concedo
9458e08346
fixed https://github.com/LostRuins/koboldcpp/issues/1892
2025-12-19 22:52:39 +08:00
Concedo
30fecac3a3
small tweak
2025-12-18 15:41:22 +08:00
Concedo
1e083d9c8b
integrate autofit for upstream, removed forceversion
2025-12-17 18:42:47 +08:00
Concedo
9bc724f86c
rearrage some elements in launcher
2025-12-17 17:00:26 +08:00
Concedo
cacfa37611
wip
2025-12-17 16:04:45 +08:00
Concedo
bca0258c2a
bump default gen amount by 128 to 896
2025-12-14 22:17:31 +08:00
Concedo
e46a6a2796
better int parser
2025-12-13 09:28:10 +08:00
Concedo
ab9bc6f2ae
zimage cfg clamp is opt out with remove_limits
2025-12-13 09:20:00 +08:00
Concedo
b714fe19e2
allow easy clamping of max cfg and steps
2025-12-12 15:22:37 +08:00
Concedo
d07d2c1b39
stub loras endpoint for comfy
2025-12-11 22:48:38 +08:00
Concedo
fd0d0cab03
move pipeline parallelism to a --pipelineparallel launch flag
2025-12-11 21:03:41 +08:00
Concedo
b7428048fc
try reduce pipeline parallelism in order to reduce compute buffer sizes
2025-12-11 14:30:38 +08:00
Concedo
8a18e094f5
added smartcaching implementation inspired from Pento95 (+2 squashed commit)
...
Squashed commit:
[fcc498688] wip basic smart caching test
[b6e8b2577] wip basic smart caching test
2025-12-10 18:00:03 +08:00
Concedo
242ae8b8f3
http get cleanup
2025-12-08 19:51:55 +08:00
Concedo
8c17541cc0
modify llama.cpp branding on lcpp ui (+1 squashed commits)
...
Squashed commits:
[067343edf] modify llama.cpp branding on lcpp ui
2025-12-07 12:53:33 +08:00
Concedo
12d11cee5c
added url to docs
2025-12-06 09:21:56 +08:00
Concedo
3550265249
add indent to kcpps files
2025-12-04 21:26:44 +08:00
Concedo
177e0d7515
strip_oaicontent_of_media placeholder (+2 squashed commit)
...
Squashed commit:
[7ccd52ef4] placeholder
[71fd2d7bb] strip_oaicontent_of_media
2025-12-01 01:29:57 +08:00
Concedo
bf5efcf86d
Merge commit ' d82b7a7c1d' into concedo_experimental
...
# Conflicts:
# ci/run.sh
# ggml/CMakeLists.txt
# ggml/src/CMakeLists.txt
# ggml/src/ggml-cuda/common.cuh
# tests/CMakeLists.txt
2025-11-30 15:43:11 +08:00
Concedo
2985575be4
allow assistant prefills, fixed showgui issue
2025-11-30 12:52:28 +08:00
Concedo
925e7f8f6d
added a secondary terminal mirror for linux
2025-11-29 21:53:51 +08:00
Concedo
9999b8950d
cleaner resizing
2025-11-29 18:01:49 +08:00
Concedo
eda4a312cb
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# .devops/vulkan.Dockerfile
# ggml/src/ggml-cpu/CMakeLists.txt
# ggml/src/ggml-opencl/CMakeLists.txt
# ggml/src/ggml-opencl/ggml-opencl.cpp
# ggml/src/ggml-sycl/common.hpp
# tests/test-backend-ops.cpp
# tools/server/README.md
2025-11-28 13:22:02 +08:00
Concedo
e570478275
limit cuda arches + scale tweaks
2025-11-28 13:05:11 +08:00