Commit graph

534 commits

Author SHA1 Message Date
Wagner Bruna
c2d96328fe
sd: sync to master-493-65891d7 (#1960) 2026-02-05 22:11:47 +08:00
Wagner Bruna
d9ac52a01a
sd: sync to master-492-f957fa3 (#1957)
* sd: sync to master-492-f957fa3

* add Res Multistep and Res 2s samplers

* make sdflashattention control flash_attn too
2026-02-04 16:12:39 +08:00
Concedo
7b393fa487 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/workflows/build.yml
#	AUTHORS
#	ci/run.sh
#	docs/backend/SYCL.md
#	docs/build.md
#	docs/multimodal/minicpmo2.6.md
#	docs/multimodal/minicpmo4.0.md
#	docs/multimodal/minicpmv2.5.md
#	docs/multimodal/minicpmv2.6.md
#	docs/multimodal/minicpmv4.0.md
#	docs/multimodal/minicpmv4.5.md
#	docs/ops.md
#	docs/ops/SYCL.csv
#	docs/speculative.md
#	examples/deprecation-warning/README.md
#	examples/deprecation-warning/deprecation-warning.cpp
#	examples/model-conversion/Makefile
#	examples/model-conversion/scripts/causal/convert-model.sh
#	ggml/include/ggml-cann.h
#	ggml/src/ggml-cann/acl_tensor.cpp
#	ggml/src/ggml-cann/acl_tensor.h
#	ggml/src/ggml-cann/aclnn_ops.cpp
#	ggml/src/ggml-cann/aclnn_ops.h
#	ggml/src/ggml-cann/common.h
#	ggml/src/ggml-cann/ggml-cann.cpp
#	ggml/src/ggml-metal/CMakeLists.txt
#	ggml/src/ggml-opencl/ggml-opencl.cpp
#	ggml/src/ggml-opencl/kernels/concat.cl
#	ggml/src/ggml-opencl/kernels/repeat.cl
#	ggml/src/ggml-opencl/kernels/scale.cl
#	ggml/src/ggml-opencl/kernels/tanh.cl
#	ggml/src/ggml-sycl/CMakeLists.txt
#	ggml/src/ggml-sycl/dpct/helper.hpp
#	ggml/src/ggml-sycl/ggml-sycl.cpp
#	ggml/src/ggml-sycl/outprod.cpp
#	ggml/src/ggml-sycl/rope.cpp
#	ggml/src/ggml-sycl/wkv.cpp
#	src/llama-vocab.cpp
#	tests/test-autorelease.cpp
#	tests/test-backend-ops.cpp
#	tools/cvector-generator/pca.hpp
#	tools/export-lora/export-lora.cpp
#	tools/perplexity/README.md
2026-02-03 19:00:42 +08:00
Concedo
a5ae116033 increase z-image default clamp to 4.0, to tolerate z-image base requirement for higher cfg 2026-02-01 22:02:20 +08:00
Concedo
ffdc1b0f9f flux2 image editing 2026-01-31 16:36:45 +08:00
Wagner Bruna
1f01d54848
sd: sync to master-487-43e829f (#1947) 2026-01-29 21:37:30 +08:00
Concedo
7e98269cbc increase sd hard limit to 1.6 mp 2026-01-24 10:51:08 +08:00
Concedo
5c6cc02985 remove clblast, part 2 2026-01-23 14:09:46 +08:00
Concedo
7f485e5287 remove CLBlast, part 1 2026-01-23 13:50:12 +08:00
Wagner Bruna
715a76ce5f
sd: sync to master-480-b87fe13 (#1932) 2026-01-21 20:49:35 +08:00
Concedo
cdd6578a9a esrgan added 2026-01-20 22:10:37 +08:00
Concedo
c9c15749e0 wip on adding esrgan upscaling 2026-01-20 00:35:35 +08:00
Concedo
393791496d flux 2 taesd (+1 squashed commits)
Squashed commits:

[adfc3f3a2] flux 2 taesd
2026-01-19 23:47:16 +08:00
Concedo
70f92b12f8 sdxs clamp steps and cfg 2026-01-19 01:07:27 +08:00
Wagner Bruna
10851f223d
sd: sync to master-473-9565c7f (#1927)
* sd: sync to master-473-9565c7f

* sd: add support for flux2 klein
2026-01-19 01:04:34 +08:00
Concedo
62bea5ef4f allow overriding the devices directly 2026-01-17 19:08:06 +08:00
Concedo
d2b2224b0d vulkan env var always take priority 2026-01-17 10:34:45 +08:00
Wagner Bruna
c91fc850c1
sd: sync to master-467-0e52afc (#1916) 2026-01-15 23:06:51 +08:00
Wagner Bruna
0ef55844d3
sd: sync to master-453-4ff2c8c (#1907) 2026-01-03 15:28:27 +08:00
Concedo
bfa2ae7744 fixed smartcache bug when used with images 2026-01-02 00:35:05 +08:00
Concedo
442fa7cd7c support for circular textures in sdcpp 2026-01-01 16:34:09 +08:00
Concedo
27261bfc26 adaptive decay as an overridable param (+1 squashed commits)
Squashed commits:

[d94df7843] adaptive decay as an overridable param
2025-12-28 13:34:20 +08:00
Concedo
6548645aaa rename power law sampler to adaptive p 2025-12-27 17:50:58 +08:00
Concedo
445aad5e00 remove sdcpp qwen image lora hack 2025-12-27 16:31:29 +08:00
Wagner Bruna
84765f5967
sd: sync to master-447-ccb6b0a (#1898)
* sd: sync to master-438-298b110

* sd: sync to master-440-3e81246

* sd: sync to master-444-a0adcfb

* sd: sync to master-447-ccb6b0a
2025-12-27 16:30:52 +08:00
Concedo
91d8863f18 power law sampler added 2025-12-27 09:46:06 +08:00
Concedo
cf4201e213 wip power law sampling 2025-12-25 22:01:16 +08:00
Concedo
afe41b6eea Merge branch 'concedo_experimental' of https://github.com/LostRuins/koboldcpp into concedo_experimental 2025-12-24 23:42:52 +08:00
Concedo
d1983959d2 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/workflows/release.yml
#	AGENTS.md
#	common/CMakeLists.txt
#	docs/development/parsing.md
#	ggml/src/ggml-rpc/ggml-rpc.cpp
#	ggml/src/ggml-vulkan/ggml-vulkan.cpp
#	tests/test-arg-parser.cpp
#	tests/test-backend-ops.cpp
#	tests/test-grammar-llguidance.cpp
#	tests/test-tokenizer-0.cpp
#	tests/test-tokenizer-1-bpe.cpp
#	tests/test-tokenizer-1-spm.cpp
#	tools/batched-bench/batched-bench.cpp
#	tools/cli/cli.cpp
#	tools/llama-bench/llama-bench.cpp
#	tools/server/README.md
2025-12-24 23:42:28 +08:00
Wagner Bruna
f30da43b7f
sd: get the available schedulers directly from sd.cpp (#1900)
Avoids a hardcoded list on the Python side.
2025-12-24 21:55:24 +08:00
Concedo
26d89bf589 support for downloading AVI from sdui 2025-12-24 18:40:10 +08:00
Concedo
1f6b9338d6 hack to fix lora loading for qwen image 2025-12-23 17:19:16 +08:00
Concedo
7c82cad72c support ovis, added taehv wan embed, fixed compile error (+1 squashed commits)
Squashed commits:

[ab71f6d33] support ovis, added taehv wan embed
2025-12-22 17:08:09 +08:00
Wagner Bruna
44ce1a80b3
sd: sync to master-431-23fce0b (#1893)
* sd: sync to master-427-78e15bd

* add kl_optimal to the available schedulers list

* more robust workaround to avoid stb linkage issues

* sd: sync to master-431-23fce0b

* add TAEHV support and disable TAE if the model isn't found
2025-12-22 15:07:09 +08:00
Concedo
d69db26b44 fix stb multiple impl 2025-12-20 12:05:50 +08:00
Concedo
fb31059f9c fixed a bug in vision with mrope, mrope is refactored to match upstream, should be more accurate now 2025-12-19 01:23:52 +08:00
Concedo
cefb32df19 track clip img patch nx and ny 2025-12-18 22:58:10 +08:00
Concedo
e88bf41fdc Merge commit '12280ae905' into concedo_experimental
# Conflicts:
#	.github/workflows/build.yml
#	common/CMakeLists.txt
#	docs/docker.md
#	examples/model-conversion/scripts/causal/compare-logits.py
#	ggml/src/ggml-hexagon/htp/rope-ops.c
#	tests/test-backend-ops.cpp
#	tests/test-barrier.cpp
#	tools/server/CMakeLists.txt
#	tools/server/README.md
2025-12-16 16:29:01 +08:00
Wagner Bruna
78bbe89956
sd: sync to master-417-43a70e8 (#1889)
* sd: sync to master-417-43a70e8

* fix sdmain build

* switch to upstream apply_loras()

* refactor u8 path conversions and add it to the gguf reader
2025-12-16 16:16:48 +08:00
Concedo
ab9bc6f2ae zimage cfg clamp is opt out with remove_limits 2025-12-13 09:20:00 +08:00
Concedo
3fb0f337fe remove z-image clamping for now 2025-12-11 23:05:00 +08:00
Concedo
8a18e094f5 added smartcaching implementation inspired from Pento95 (+2 squashed commit)
Squashed commit:

[fcc498688] wip basic smart caching test

[b6e8b2577] wip basic smart caching test
2025-12-10 18:00:03 +08:00
Concedo
1aab32fe03 fixed safetensors loading for zimage 2025-12-09 18:09:47 +08:00
Wagner Bruna
801840d3bd
sd: sync to master-391-5865b5e (#1878) 2025-12-08 19:53:52 +08:00
Concedo
20363dc6e7 z image limit cfg scale to 3.0 max 2025-12-07 16:24:26 +08:00
Wagner Bruna
510508e7da
sd: sync to master-391-5865b5e (#1872) 2025-12-04 16:29:38 +08:00
Concedo
7316c24bf6 fixes to load flux2 2025-12-03 18:38:54 +08:00
Wagner Bruna
fbdcc74ae6 adapt kcpp changes for z-image 2025-12-01 19:15:38 -03:00
Wagner Bruna
fcb86f22df sd: sync to master-387-e4c50f1 2025-12-01 19:15:38 -03:00
Wagner Bruna
e971eaefe3 fix: qwenvl.hpp was renamed to llm.hpp 2025-12-01 19:15:38 -03:00