Commit graph

207 commits

Author SHA1 Message Date
Wagner Bruna
b437d18319
add support for cache modes to accelerate image generation (#2021)
* sd: sync to master-525-d6dd6d7

* sd: add support for cache modes for inference acceleration

* keep gendefaults as a JSON object inside the config file

* covered more invalid cases on gendefaults parsing
2026-03-15 15:27:14 +08:00
Concedo
8f23b8d81e wip on ref audio, but it compiles 2026-03-12 23:46:10 +08:00
Concedo
3fd9648726 added mp3 support 2026-03-12 21:00:50 +08:00
Concedo
211d4fe632 lots of tweaks for ace step 2026-03-11 23:57:52 +08:00
Wagner Bruna
3f42ed1af7
support for customizing LoRA multipliers through the sdapi (#1982)
* fix corner case in sd_oai_transform_params

Also fix typo in the function name.

* support for customizing loaded LoRA multipliers

The `sdloramult` flag now accepts a list of multipliers, one for each
LoRA. If all multipliers are non-zero, LoRAs load as before, with no extra
VRAM usage or performance impact.

If any LoRA has a multiplier of 0, we switch to `at_runtime` mode, and these
LoRAs will be available to multiplier changes via the `lora` sdapi field and
show up in the `sdapi/v1/loras` endpoint. All LoRAs are still preloaded on
startup, and cached to avoid file reloads.

If the list of multipliers is shorter than the list of LoRAs, the multiplier
list is extended with the first multiplier (1.0 by default), to keep it
compatible with the previous behavior.

* support for `<lora:name:multiplier>` prompt syntax and metadata

* add a few tests for sanitize_lora_multipliers
2026-03-10 21:29:39 +08:00
Wagner Bruna
5c40f07d4a
sd: sync to 0752cc9 (master-507-b314d80 +1) (#1999)
* sd: sync to 0752cc9 (master-507-b314d80 +1)

* sd: add flow-shift support to gendefaults
2026-02-28 12:22:32 +08:00
Concedo
ba42f22fc8 stereo is working 2026-02-27 20:36:44 +08:00
Concedo
5c5fe55f7d bump kv overrides max (+1 squashed commits)
Squashed commits:

[9bc8212a0] bump kv overrides max
2026-02-26 00:24:53 +08:00
Concedo
11a85d62fc lowvram for music lm 2026-02-24 22:21:17 +08:00
Concedo
488c431331 not yet working 2026-02-24 17:47:50 +08:00
Concedo
c2b0cb26a8 ace step codes api 2026-02-23 14:04:45 +08:00
Concedo
4be93db21c ace step codes generation now working 2026-02-23 00:27:26 +08:00
Concedo
13db5aee9e stub files for loading ace step 2026-02-22 23:15:08 +08:00
Concedo
2db018a1d7 qwen3tts support reference audio 2026-02-21 17:30:21 +08:00
Concedo
bf3f2e1ba8 support loading multiple sd loras (up to 4 at once) 2026-02-19 13:57:58 +08:00
Concedo
349c461453 add stop reason for error 2026-02-04 20:23:18 +08:00
Concedo
7f485e5287 remove CLBlast, part 1 2026-01-23 13:50:12 +08:00
Concedo
cdd6578a9a esrgan added 2026-01-20 22:10:37 +08:00
Concedo
c9c15749e0 wip on adding esrgan upscaling 2026-01-20 00:35:35 +08:00
Llama
95ebfdcde8
Add token ids to logprob data returned by the API (#1928)
Previously, logprobs only contained the token string
and byte data, as well as the log probability itself.
For workflows that require the token id, translating
from the token bytes to the token id is potentially
costly and unreliable. It is simple and inexpensive
to expose the numeric token ids directly instead.
2026-01-18 16:30:46 +08:00
Concedo
3816391a74 increase logprobs returned to 10 2026-01-18 11:13:42 +08:00
Concedo
62bea5ef4f allow overriding the devices directly 2026-01-17 19:08:06 +08:00
Concedo
d8942cde14 smartcache allow custom number of slots 2026-01-02 17:19:40 +08:00
Concedo
442fa7cd7c support for circular textures in sdcpp 2026-01-01 16:34:09 +08:00
Concedo
27261bfc26 adaptive decay as an overridable param (+1 squashed commits)
Squashed commits:

[d94df7843] adaptive decay as an overridable param
2025-12-28 13:34:20 +08:00
Concedo
6548645aaa rename power law sampler to adaptive p 2025-12-27 17:50:58 +08:00
Concedo
91d8863f18 power law sampler added 2025-12-27 09:46:06 +08:00
Wagner Bruna
f30da43b7f
sd: get the available schedulers directly from sd.cpp (#1900)
Avoids a hardcoded list on the Python side.
2025-12-24 21:55:24 +08:00
Concedo
26d89bf589 support for downloading AVI from sdui 2025-12-24 18:40:10 +08:00
Concedo
fedd529fdc autofit counts overheads 2025-12-21 14:31:08 +08:00
Concedo
1e083d9c8b integrate autofit for upstream, removed forceversion 2025-12-17 18:42:47 +08:00
Concedo
ab9bc6f2ae zimage cfg clamp is opt out with remove_limits 2025-12-13 09:20:00 +08:00
Concedo
fd0d0cab03 move pipeline parallelism to a --pipelineparallel launch flag 2025-12-11 21:03:41 +08:00
Concedo
8a18e094f5 added smartcaching implementation inspired from Pento95 (+2 squashed commit)
Squashed commit:

[fcc498688] wip basic smart caching test

[b6e8b2577] wip basic smart caching test
2025-12-10 18:00:03 +08:00
Wagner Bruna
3a7dd1a97f sd: sync to master-358-347710f
Also adapt Koboldcpp LoRA loading function, and add
backend support for lora_apply_mode.
2025-11-23 19:28:54 -03:00
LostRuins Concedo
281542aa0d add smoothing curve, not tested 2025-11-17 23:07:35 +08:00
LostRuins Concedo
a67f097de3 fixed incorrect parameter order for mirostat 2025-11-16 17:38:25 +08:00
Concedo
57e1d9c822 rename blasbatchsize to batchsize 2025-10-24 18:16:54 +08:00
Concedo
68c9d955d2 support multiple override kv 2025-10-24 17:28:54 +08:00
Wagner Bruna
c48999f7c0
additional options for image generation (#1765)
* sd: add backend support for choosing the default sampler

* use the default sampler on the API

* sd: add backend support for the scheduler

* sd: add backend support for distilled guidance

* sd: add backend support for timestep-shift

* sd: add a config field to set default image gen options
2025-10-05 23:36:20 +08:00
Concedo
f282362414 added qwen image support (+1 squashed commits)
Squashed commits:

[92df28061] added qwen image support (+1 squashed commits)

Squashed commits:

[1485c71ed] wip adding qwen image
2025-10-03 18:58:48 +08:00
Concedo
2fc31d36c0 gif mime type for animated images 2025-10-01 17:18:00 +08:00
Concedo
4b1c89ca5c can save animated gifs 2025-09-29 22:52:42 +08:00
Concedo
7b71742223 added additional toggles for SD, sdoffloadcpu , sdvaecpu and sdclipcpu 2025-09-28 22:08:29 +08:00
Concedo
553890f975 compact UI launcher, WIP handling video outputs 2025-09-28 17:24:04 +08:00
Concedo
3210b378e8 better tool calls 2025-08-20 22:11:31 +08:00
Wagner Bruna
6003e90e50
Add flash attention and conv2d direct controls for image generation (#1678)
* Add separate flash attention config for image generation

* Add config option for Conv2D Direct
2025-08-20 12:17:57 +08:00
Concedo
2abe11071b custom voice handling 2025-08-18 16:57:34 +08:00
Concedo
5a921a40f9 add overridenativecontext flag, stop nagging me 2025-08-14 22:54:45 +08:00
Concedo
e40d26b9e7 allow offloading moe to cpu with --moecpu 2025-08-05 23:42:42 +08:00