Concedo
7e35954695
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# docs/build.md
# docs/function-calling.md
# examples/eval-callback/eval-callback.cpp
# ggml/CMakeLists.txt
# ggml/src/ggml-cann/ggml-cann.cpp
# ggml/src/ggml-cpu/CMakeLists.txt
# ggml/src/ggml-cpu/kleidiai/kernels.cpp
# ggml/src/ggml-cpu/kleidiai/kernels.h
# ggml/src/ggml-cpu/kleidiai/kleidiai.cpp
# scripts/compare-llama-bench.py
# scripts/server-bench.py
# scripts/tool_bench.py
# tests/test-chat.cpp
# tools/batched-bench/batched-bench.cpp
# tools/llama-bench/llama-bench.cpp
# tools/server/README.md
2025-08-31 23:33:36 +08:00
Concedo
7b396bd917
added v1 voices endpoint, added lcpp aliases for cli, fixed dia wrong voice
2025-08-30 11:20:18 +08:00
Concedo
75c919cfd4
can't resolve the clicking
2025-08-26 17:55:03 +08:00
Concedo
ff1d179e21
kokoro fixes for every better pausing
2025-08-25 22:28:50 +08:00
Concedo
205a0b8d4c
fix kokoro replacement, add 4096 batch size option
2025-08-25 15:57:13 +08:00
Wagner Bruna
f41fc2579e
fix ttscppmain build ( #1707 )
2025-08-25 10:22:33 +08:00
Concedo
02e05f9321
fix for comma pausing in kokoro
2025-08-25 00:18:00 +08:00
Concedo
b0a8d11584
add tts max length for kokoro (+1 squashed commits)
...
Squashed commits:
[c1c6feaf] add tts max length for kokoro
2025-08-24 17:57:29 +08:00
Concedo
3867db34bc
adjust kokoro default voices
Copilot Setup Steps / copilot-setup-steps (push) Has been cancelled
2025-08-22 23:48:29 +08:00
Concedo
80dabbb689
minor adjustments for sdquant: allow backend to do the translation for the type more defensively, adjust the UI dropdown for clarity.
2025-08-22 23:23:32 +08:00
Wagner Bruna
2f8b0ec538
Support q8_0 quantization for image model loading ( #1692 )
...
* Support q8_0 quantization for image model loading
q4_0 may degrade quality significantly, especially for smaller
models like SD 1.5 and SDXL. q8_0 provides a middle-ground,
giving half the memory savings of q4_0 but loading faster and
with less quality loss.
* Accept --sdquant with no parameters
* Use numerical values for the sdquant option
2025-08-22 22:17:15 +08:00
Concedo
9dd6b4c930
improve whisper transcribe apt regex
2025-08-22 17:13:51 +08:00
Wagner Bruna
6003e90e50
Add flash attention and conv2d direct controls for image generation ( #1678 )
...
* Add separate flash attention config for image generation
* Add config option for Conv2D Direct
2025-08-20 12:17:57 +08:00
Concedo
56c0926866
fix metal build
2025-08-19 15:41:16 +08:00
Concedo
9fb0611115
handle contractions correctly, bump defaults
2025-08-18 22:33:44 +08:00
Concedo
2abe11071b
custom voice handling
2025-08-18 16:57:34 +08:00
Concedo
67ef5e6c02
phonemizer fixes, now kokoro works very well
2025-08-18 16:13:16 +08:00
Concedo
3f621be7dd
wip, adding IPA for kokoro
2025-08-18 00:51:12 +08:00
Concedo
3138a151c2
improve hparams for tts
2025-08-17 22:07:13 +08:00
Concedo
685129fb5a
add missing title, set max tts length to 1024, updated lite (+2 squashed commit)
...
Squashed commit:
[0737a028] add missing title
[a42328b0] add max tts length 1024
2025-08-17 21:42:56 +08:00
Concedo
bcaf379509
tts.cpp merged and working in kcpp!
2025-08-17 18:09:28 +08:00
Concedo
52606e9b1d
tts cpp model is now loadable in kcpp
2025-08-17 15:47:22 +08:00
Concedo
9935ac093f
standardize tts linting and formatting
2025-08-17 14:11:30 +08:00
Concedo
cfc1a0d4ef
tts cpp cli builds and runs fine.
2025-08-17 13:53:27 +08:00
Concedo
bc04366a65
builds but crashes
2025-08-17 00:09:03 +08:00
Concedo
302bb8cd39
fixed sdmain compile issues
2025-08-13 22:32:02 +08:00
Concedo
4b2ca1169c
more consistency fixes
2025-08-13 19:28:53 +08:00
Wagner Bruna
5de7ed3d56
WIP: update stable-diffusion.cpp to 5900ef6605c6 (new API) ( #1669 )
...
* Update stable-diffusion.cpp to 5900ef6605c6 (new API)
* Clean up pending LoRA code and simplify LoRA changes to upstream
* Move VAE tiling disabling for TAESD to sdtype_adapter.cpp
* Move auxiliary ctx functions to sdtype_adapter.cpp
* Use ref_images parameter for Kontext images
* Drop clip skip workaround (fixed upstream)
* Workaround for flash attention with img2img
leejet/stable-diffusion.cpp#756
* Workaround for Chroma with flash attention, debug prints
* Disable forcing CLIP weights to F32 for reduced memory usage
2025-08-12 23:25:02 +08:00
Concedo
7b5cf7143f
handle gguf already containing renamed diffusion tensors prefix
2025-08-12 22:42:29 +08:00
Wagner Bruna
eed5577aaa
fix unintended sd model quantization ( #1672 )
...
The recent ggml update added another quant type, GGML_TYPE_MXFP4,
which got the same value as SD_TYPE_COUNT. That made the embedded
sd.cpp quantize to GGML_TYPE_MXFP4 by default.
Photomaker in particular ends up crashing due to
"Missing CPY op for types: f32 mxfp4".
2025-08-08 10:19:58 +08:00
Concedo
9f4d0f6ccf
fixed swa pp bug by retrying smaller batches
2025-07-21 23:34:22 +08:00
Concedo
6d50def409
default kv_unified to true, handle LLAMA_SET_ROWS.
2025-07-21 16:13:20 +08:00
Concedo
15b1034dd6
allow larger img2img
2025-07-20 11:10:03 +08:00
Concedo
f0564f9caf
updated lite, added better separators for multimodal chunks (universal)
2025-07-17 00:11:08 +08:00
Concedo
66755c8fe9
switch to miniaudio, support mp3 for whisper
2025-07-13 23:24:07 +08:00
Concedo
dca49de059
fixed qwen2 audio issues, works fine now (+3 squashed commit)
...
Squashed commit:
[b3053a1ba] updated lite
[5071630d6] fixed mtmd issues, audio works
[06efa5af4] fix mtmd compile
2025-07-12 18:54:41 +08:00
Concedo
e9473305d0
wip2 (+1 squashed commits)
...
Squashed commits:
[4628777b6] wip
2025-07-12 18:54:40 +08:00
Wagner Bruna
d74c16e6e0
enable flash attention for image generation ( #1633 )
2025-07-05 11:20:51 +08:00
Concedo
186227fc26
sync with sd.cpp
2025-06-30 00:10:51 +08:00
Concedo
2594be7d4e
fixed scaling behavior again
2025-06-29 11:36:38 +08:00
Concedo
2635e4b932
try fix segfault in sdcpp
2025-06-29 02:33:03 +08:00
Concedo
485148b293
fixed sdmain compiling
2025-06-28 23:19:13 +08:00
Concedo
6c92a9f0e1
fixed resizing
2025-06-28 23:10:04 +08:00
Concedo
a1175cf34f
merged leejet changes
2025-06-28 22:57:07 +08:00
Concedo
4ec0e0fd21
now accept multiple images for reference images
2025-06-28 17:30:28 +08:00
Concedo
ff2cabc28f
fixed kontext and photomaker (+1 squashed commits)
...
Squashed commits:
[de0ac91dd] photomaker use 1 channel
2025-06-28 12:14:05 +08:00
Concedo
5a6cc38f35
fixed a typo
2025-06-28 11:47:07 +08:00
Concedo
ed289227e5
added support for flux kontext
2025-06-28 11:37:19 +08:00
Concedo
0bd648ffa4
photomaker renamed to extra image to handle future extension
2025-06-28 10:26:06 +08:00
Concedo
39b0699c71
fixed savestates with drafting
2025-06-27 20:35:38 +08:00