Commit graph

493 commits

Author SHA1 Message Date
Concedo
67ef5e6c02 phonemizer fixes, now kokoro works very well 2025-08-18 16:13:16 +08:00
Concedo
3f621be7dd wip, adding IPA for kokoro 2025-08-18 00:51:12 +08:00
Concedo
3138a151c2 improve hparams for tts 2025-08-17 22:07:13 +08:00
Concedo
685129fb5a add missing title, set max tts length to 1024, updated lite (+2 squashed commit)
Squashed commit:

[0737a028] add missing title

[a42328b0] add max tts length 1024
2025-08-17 21:42:56 +08:00
Concedo
bcaf379509 tts.cpp merged and working in kcpp! 2025-08-17 18:09:28 +08:00
Concedo
52606e9b1d tts cpp model is now loadable in kcpp 2025-08-17 15:47:22 +08:00
Concedo
9935ac093f standardize tts linting and formatting 2025-08-17 14:11:30 +08:00
Concedo
cfc1a0d4ef tts cpp cli builds and runs fine. 2025-08-17 13:53:27 +08:00
Concedo
bc04366a65 builds but crashes 2025-08-17 00:09:03 +08:00
Concedo
302bb8cd39 fixed sdmain compile issues 2025-08-13 22:32:02 +08:00
Concedo
4b2ca1169c more consistency fixes 2025-08-13 19:28:53 +08:00
Wagner Bruna
5de7ed3d56
WIP: update stable-diffusion.cpp to 5900ef6605c6 (new API) (#1669)
* Update stable-diffusion.cpp to 5900ef6605c6 (new API)

* Clean up pending LoRA code and simplify LoRA changes to upstream

* Move VAE tiling disabling for TAESD to sdtype_adapter.cpp

* Move auxiliary ctx functions to sdtype_adapter.cpp

* Use ref_images parameter for Kontext images

* Drop clip skip workaround (fixed upstream)

* Workaround for flash attention with img2img

leejet/stable-diffusion.cpp#756

* Workaround for Chroma with flash attention, debug prints

* Disable forcing CLIP weights to F32 for reduced memory usage
2025-08-12 23:25:02 +08:00
Concedo
7b5cf7143f handle gguf already containing renamed diffusion tensors prefix 2025-08-12 22:42:29 +08:00
Wagner Bruna
eed5577aaa
fix unintended sd model quantization (#1672)
The recent ggml update added another quant type, GGML_TYPE_MXFP4,
which got the same value as SD_TYPE_COUNT. That made the embedded
sd.cpp quantize to GGML_TYPE_MXFP4 by default.

Photomaker in particular ends up crashing due to
"Missing CPY op for types: f32 mxfp4".
2025-08-08 10:19:58 +08:00
Concedo
9f4d0f6ccf fixed swa pp bug by retrying smaller batches 2025-07-21 23:34:22 +08:00
Concedo
6d50def409 default kv_unified to true, handle LLAMA_SET_ROWS. 2025-07-21 16:13:20 +08:00
Concedo
15b1034dd6 allow larger img2img 2025-07-20 11:10:03 +08:00
Concedo
f0564f9caf updated lite, added better separators for multimodal chunks (universal) 2025-07-17 00:11:08 +08:00
Concedo
66755c8fe9 switch to miniaudio, support mp3 for whisper 2025-07-13 23:24:07 +08:00
Concedo
dca49de059 fixed qwen2 audio issues, works fine now (+3 squashed commit)
Squashed commit:

[b3053a1ba] updated lite

[5071630d6] fixed mtmd issues, audio works

[06efa5af4] fix mtmd compile
2025-07-12 18:54:41 +08:00
Concedo
e9473305d0 wip2 (+1 squashed commits)
Squashed commits:

[4628777b6] wip
2025-07-12 18:54:40 +08:00
Wagner Bruna
d74c16e6e0
enable flash attention for image generation (#1633) 2025-07-05 11:20:51 +08:00
Concedo
186227fc26 sync with sd.cpp 2025-06-30 00:10:51 +08:00
Concedo
2594be7d4e fixed scaling behavior again 2025-06-29 11:36:38 +08:00
Concedo
2635e4b932 try fix segfault in sdcpp 2025-06-29 02:33:03 +08:00
Concedo
485148b293 fixed sdmain compiling 2025-06-28 23:19:13 +08:00
Concedo
6c92a9f0e1 fixed resizing 2025-06-28 23:10:04 +08:00
Concedo
a1175cf34f merged leejet changes 2025-06-28 22:57:07 +08:00
Concedo
4ec0e0fd21 now accept multiple images for reference images 2025-06-28 17:30:28 +08:00
Concedo
ff2cabc28f fixed kontext and photomaker (+1 squashed commits)
Squashed commits:

[de0ac91dd] photomaker use 1 channel
2025-06-28 12:14:05 +08:00
Concedo
5a6cc38f35 fixed a typo 2025-06-28 11:47:07 +08:00
Concedo
ed289227e5 added support for flux kontext 2025-06-28 11:37:19 +08:00
Concedo
0bd648ffa4 photomaker renamed to extra image to handle future extension 2025-06-28 10:26:06 +08:00
Concedo
39b0699c71 fixed savestates with drafting 2025-06-27 20:35:38 +08:00
Concedo
8ce56bd547 tts more silence at the end 2025-06-23 17:15:35 +08:00
Concedo
6039791adf minor bugfixes 2025-06-21 18:41:28 +08:00
Wagner Bruna
08adfb53c9
Configurable VAE threshold limit (#1601)
* add backend support for changing the VAE tiling threshold

* trigger VAE tiling by image area instead of dimensions

I've tested with GGML_VULKAN_MEMORY_DEBUG all resolutions with
the same 768x768 area (even extremes like 64x9216), and many
below that: all consistently allocate 6656 bytes per image pixel.
As tiling is primarily useful to avoid excessive memory usage, it
seems reasonable to enable VAE tiling based on area rather than
maximum image side.

However, as there is currently no user interface option to change
it back to a lower value, it's best to maintain the default
behavior for now.

* replace the notile option with a configurable threshold

This allows selecting a lower threshold value, reducing the
peak memory usage.

The legacy sdnotile parameter gets automatically converted to
the new parameter, if it's the only one supplied.

* simplify tiling checks, 768 default visible in launcher

---------

Co-authored-by: Concedo <39025047+LostRuins@users.noreply.github.com>
2025-06-21 10:14:57 +08:00
Concedo
caea52407a fix photomaker crash 2025-06-21 10:11:39 +08:00
Concedo
684d71e058 add old convert tool 2025-06-21 08:40:04 +08:00
Concedo
ce58d1253f fixed build and workflow 2025-06-21 00:56:27 +08:00
Concedo
2ba7803b95 replace_instruct_placeholders is now default 2025-06-20 22:11:58 +08:00
Concedo
4e40f2aaf4 added photomaker face cloning 2025-06-20 21:33:36 +08:00
Concedo
21881a861d rename restrict square to sdclampedsoft 2025-06-20 15:39:55 +08:00
Concedo
175c99081e merged https://github.com/leejet/stable-diffusion.cpp/issues/588 to fix vae tiling, ref https://github.com/LostRuins/koboldcpp/issues/1603 2025-06-20 11:13:04 +08:00
Concedo
e35c6b8f9b remove t5 masking sdcpp 2025-06-18 21:05:03 +08:00
Concedo
40443a98f5 show available RAM, fixed SD vae tiling noise 2025-06-18 18:44:50 +08:00
Concedo
bfb47cbcd8 Revert "revert padding change for sd chroma"
This reverts commit 7de88802f9.
2025-06-14 10:10:34 +08:00
Concedo
c494525b33 update deprecated apis 2025-06-13 22:21:15 +08:00
Wagner Bruna
f6d2d1ce5c
configurable resolution limit (#1586)
* refactor image gen configuration screen

* make image size limit configurable

* fix resolution limits and keep dimensions closer to the original ratio

* use 0.0 for the configured default image size limit

This prevents the current default value from being saved into the
config files, in case we later decide to adopt a different value.

* export image model version when loading

* restore model-specific default image size limit

* change the image area restriction to be specified by a square side

* move image resolution limits down to the C++ level

* Revert "export image model version when loading"

This reverts commit fa65b23de3.

* Linting Fixes:
PY:
- Inconsistent var name sd_restrict_square -> sd_restrict_square_var
- GUI swap back to using absolute row numbers for now.
- fstring fix
- size_limit -> side_limit inconsistency
C++:
- roundup_64 standalone function
- refactor sd_fix_resolution variable names for clarity
- move "anti crashing" hard total megapixel limit always to be applied after soft total megapixel limit instead of conditionally only when sd_restrict_square is unset

* allow unsafe resolutions if debugmode is on

---------

Co-authored-by: Concedo <39025047+LostRuins@users.noreply.github.com>
2025-06-13 20:05:20 +08:00
Concedo
7d8aa31f1f fixed embeddings, added new parameter to limit max embeddings context 2025-06-10 01:11:55 +08:00