Concedo
7de88802f9
revert padding change for sd chroma
2025-06-08 23:48:46 +08:00
Concedo
2d4c1aa5a0
chroma support is now usable
2025-06-08 18:53:59 +08:00
Concedo
30cf433ab4
merge base support for chroma, however its not working correctly
2025-06-08 18:06:23 +08:00
Concedo
cfcdfd69bd
allow embeddings models to use mmap
2025-06-07 10:14:00 +08:00
Concedo
736030bb9f
save and load state upgraded to 3 available states
2025-06-04 22:09:40 +08:00
Concedo
b08dca65ed
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# common/CMakeLists.txt
# common/arg.cpp
# common/chat.cpp
# examples/parallel/README.md
# examples/parallel/parallel.cpp
# ggml/cmake/common.cmake
# ggml/src/CMakeLists.txt
# ggml/src/ggml-cpu/CMakeLists.txt
# ggml/src/ggml-sycl/ggml-sycl.cpp
# ggml/src/ggml-sycl/rope.cpp
# models/ggml-vocab-bert-bge.gguf.inp
# models/ggml-vocab-bert-bge.gguf.out
# models/ggml-vocab-command-r.gguf.inp
# models/ggml-vocab-command-r.gguf.out
# models/ggml-vocab-deepseek-coder.gguf.inp
# models/ggml-vocab-deepseek-coder.gguf.out
# models/ggml-vocab-deepseek-llm.gguf.inp
# models/ggml-vocab-deepseek-llm.gguf.out
# models/ggml-vocab-falcon.gguf.inp
# models/ggml-vocab-falcon.gguf.out
# models/ggml-vocab-gpt-2.gguf.inp
# models/ggml-vocab-gpt-2.gguf.out
# models/ggml-vocab-llama-bpe.gguf.inp
# models/ggml-vocab-llama-bpe.gguf.out
# models/ggml-vocab-llama-spm.gguf.inp
# models/ggml-vocab-llama-spm.gguf.out
# models/ggml-vocab-mpt.gguf.inp
# models/ggml-vocab-mpt.gguf.out
# models/ggml-vocab-phi-3.gguf.inp
# models/ggml-vocab-phi-3.gguf.out
# models/ggml-vocab-qwen2.gguf.inp
# models/ggml-vocab-qwen2.gguf.out
# models/ggml-vocab-refact.gguf.inp
# models/ggml-vocab-refact.gguf.out
# models/ggml-vocab-starcoder.gguf.inp
# models/ggml-vocab-starcoder.gguf.out
# requirements/requirements-gguf_editor_gui.txt
# tests/CMakeLists.txt
# tests/test-chat.cpp
# tests/test-grammar-integration.cpp
# tests/test-json-schema-to-grammar.cpp
# tools/mtmd/CMakeLists.txt
# tools/run/run.cpp
# tools/server/CMakeLists.txt
2025-05-31 13:04:21 +08:00
Wagner Bruna
12f99ba907
fix: workaround for default clip_skip issues ( #1572 )
...
Sets the clip_skip value explicitly to 1 or 2 for all generation
requests, aligning with the tests in the Conditioner objects in
conditioner.hpp.
This should fix #1546 regardless of future changes to the default
behavior of sdcpp. This workaround can be removed once a proper
fix is implemented in sdcpp.
2025-05-31 10:36:30 +08:00
Concedo
fe401ca4c2
fixed a typo
2025-05-30 13:35:42 +08:00
Concedo
a11ab0b08e
reverse clip skip fix as it might be breaking some sdxl models
2025-05-30 10:40:03 +08:00
Concedo
e14aec58bc
embeds no offload qkv
2025-05-29 00:28:02 +08:00
Concedo
fcc1b43c06
embeddings change to encode
2025-05-28 23:24:33 +08:00
Concedo
26bf5b446d
fixed thread count <=0 , fixed clip skip <= 0
2025-05-28 00:38:15 +08:00
Concedo
55cc9acec5
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# .github/workflows/release.yml
# README.md
# ggml/src/ggml-cann/aclnn_ops.cpp
# ggml/src/ggml-cann/ggml-cann.cpp
# tools/mtmd/CMakeLists.txt
# tools/mtmd/clip.cpp
# tools/mtmd/clip.h
2025-05-24 12:10:36 +08:00
Concedo
499283c63a
rename define to match upstream
2025-05-23 17:10:12 +08:00
Concedo
e68a5f448c
add ddim sampler
2025-05-22 21:28:01 +08:00
Wagner Bruna
7dc3e3e64b
store clip skip value on generated images ( #1551 )
2025-05-21 23:37:48 +08:00
Concedo
da7fd4aa57
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# .devops/musa.Dockerfile
# .github/workflows/build.yml
# README.md
# ci/README.md
# docs/docker.md
# examples/lookahead/lookahead.cpp
# examples/lookup/lookup.cpp
# examples/parallel/parallel.cpp
# ggml/src/ggml-musa/CMakeLists.txt
# ggml/src/ggml-sycl/ggml-sycl.cpp
# tests/test-arg-parser.cpp
2025-05-21 23:12:22 +08:00
Concedo
9f976e9c65
swa full used unless ctx shift and fast forward disabled
2025-05-21 22:47:45 +08:00
Concedo
5b6ed445de
better warning message
2025-05-21 21:47:40 +08:00
Concedo
f841b29c41
fixed unicode paths
2025-05-11 14:05:54 +08:00
Concedo
2439014a03
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# .github/workflows/build.yml
# examples/embedding/embedding.cpp
# tools/imatrix/imatrix.cpp
# tools/perplexity/perplexity.cpp
2025-05-08 23:41:02 +08:00
Concedo
c2802af9e8
fix qwen3, fixed sd, fixed glm4
2025-04-29 20:50:46 +08:00
Concedo
cbdee99354
revert sdcpp clip quant changes
2025-04-17 11:11:34 +08:00
Concedo
b2719268df
merge sdcpp fixes
2025-04-17 00:52:49 +08:00
Concedo
fe12b1cbd4
fixed lora, now works quanted too
2025-04-14 23:44:42 +08:00
Concedo
ad2522b319
str splitter
2025-04-14 23:05:36 +08:00
Concedo
2d0b7e37f9
fix build
2025-04-13 22:01:48 +08:00
Concedo
7e1289ade8
fixes for sdcpp
2025-04-12 10:08:23 +08:00
Concedo
8acec907bb
revert sbti image write
2025-04-10 10:43:24 +08:00
Concedo
27f575dc83
inpaining support completed, invert mask added
2025-04-09 23:50:17 +08:00
Concedo
23339ace9b
inpainting works in kcpp!
2025-04-09 23:01:05 +08:00
Concedo
fea3b2bd4a
updated sdcpp prepare for inpaint
...
fixed img2img (+1 squashed commits)
Squashed commits:
[42c48f14] try update sdcpp, feels kind of buggy
2025-04-09 20:26:10 +08:00
Concedo
e37f27632f
clear cpu flag manually for templates, added truncation for embeddings
2025-04-02 00:18:30 +08:00
Concedo
6a709be50a
replace deprecated
2025-03-27 10:27:20 +08:00
Concedo
2bdf1dacff
embeddings done
2025-03-25 22:41:46 +08:00
Concedo
3992fb79cc
wip adding embeddings support
2025-03-24 18:01:23 +08:00
Concedo
c1e58419c7
support for voice cloning is done (+2 squashed commit)
...
Squashed commit:
[e7301628] support for voice cloning is done
[1653c576] wip adding voice cloning
2025-03-21 22:28:59 +08:00
Concedo
e466ce65e2
updated sd metadata
2025-03-16 20:12:43 +08:00
Concedo
bfc30066c9
fixed a clip processing bug
2025-03-15 17:49:49 +08:00
Concedo
0db4ae6237
traded my ink for a pen
2025-03-14 11:58:15 +08:00
Concedo
ca698f0cbe
tweaked sd img metadata
2025-03-13 20:04:29 +08:00
Wagner Bruna
5413be2c1b
sd: add generation parameters to image metadata ( #1416 )
...
Straight adaptation from stable-diffusion.cpp main.cpp.
2025-03-13 19:35:06 +08:00
Concedo
eb1809c105
add more perf stats
2025-03-12 18:58:27 +08:00
Concedo
6b7d2349a7
Rewrite history to fix bad vulkan shader commits without increasing repo size
...
added dpe colab (+8 squashed commit)
Squashed commit:
[b8362da4] updated lite
[ed6c037d] move nsigma into the regular sampler stack
[ac5f61c6] relative filepath fixed
[05fe96ab] export template
[ed0a5a3e] nix_example.md: refactor (#1401 )
* nix_example.md: add override example
* nix_example.md: drop graphics example, already basic nixos knowledge
* nix_example.md: format
* nix_example.md: Vulkan is disabled on macOS
Disabled in: 1ccd253acc
* nix_examples.md: nixpkgs.config.cuda{Arches -> Capabilities}
Fixes: https://github.com/LostRuins/koboldcpp/issues/1367
[675c62f7] AutoGuess: Phi 4 (mini) (#1402 )
[4bf56982 ] phrasing
[b8c0df04 ] Add Rep Pen to Top N Sigma sampler chain (#1397 )
- place after nsigma and before xtc (+3 squashed commit)
Squashed commit:
[87c52b97 ] disable VMM from HIP
[ee8906f3 ] edit description
[e85c0e69 ] Remove Unnecessary Rep Counting (#1394 )
* stop counting reps
* fix range-based initializer
* strike that - reverse it
2025-03-05 00:02:20 +08:00
Concedo
f2ac10c014
added nsigma to lite
2025-02-21 15:11:24 +08:00
EquinoxPsychosis
2740af3660
add top n sigma sampler from llama.cpp ( #1384 )
...
* Add N Sigma Sampler
* update nsigma sampler chain
* xtc position fix
* remove stray newline
---------
Co-authored-by: CasualAutopsy <casual_autopsy@outlook.com>
2025-02-21 14:31:42 +08:00
Concedo
5f74ee3c3b
merge sd fix
2025-02-21 11:16:26 +08:00
Concedo
71016db617
remove tts audio caching
2025-02-12 11:37:43 +08:00
Concedo
bdade9be1b
ensure scale before rep pen
2025-02-09 16:59:56 +08:00
Concedo
72f0fdfe87
wip on hypervisor
2025-02-05 00:25:22 +08:00