Concedo
3667cc0113
fixed stableui btn (+4 squashed commit)
...
Squashed commit:
[1d4714f1] update default amount to gen
[6eacba33] updated lite
[033589af] added first ver sdui
[16f66d57] updated lite
2024-05-06 00:55:16 +08:00
Concedo
a681cdd9ef
Merge branch 'upstream' into concedo_experimental
...
# Conflicts:
# common/sampling.h
# llama.h
# tests/test-chat-template.cpp
2024-04-24 21:29:07 +08:00
Concedo
06e3a6f36e
test workflow (+9 squashed commit)
...
Squashed commit:
[3d1fedab] test workflow
[c26d3a50] test workflow
[70e84f54] test workflow
[3383d040] workflow test
[2262b3c6] workflow test
[cd335d5a] workflow test
[bdbbfaeb] workflow test
[8e9fed4c] testing workflow
[e5b90d66] workflow test
2024-04-11 23:20:08 +08:00
Concedo
5c323a0661
fixed img2img for different sizes
2024-04-08 23:29:46 +08:00
Concedo
1aff35524d
fixed compile issues for ci
2024-04-08 20:32:31 +08:00
Concedo
1ee5f355d4
try fix some compile issues (+1 squashed commits)
...
Squashed commits:
[e920e76b] try fix some compile issues
2024-04-08 20:01:46 +08:00
Concedo
125f84aa02
fixed compiler warnings
2024-04-08 16:40:55 +08:00
Concedo
a530afa1e4
Merge commit ' 280345968d' into concedo_experimental
...
# Conflicts:
# .devops/full-cuda.Dockerfile
# .devops/llama-cpp-cuda.srpm.spec
# .devops/main-cuda.Dockerfile
# .devops/nix/package.nix
# .devops/server-cuda.Dockerfile
# .github/workflows/build.yml
# CMakeLists.txt
# Makefile
# README.md
# ci/run.sh
# docs/token_generation_performance_tips.md
# flake.lock
# llama.cpp
# scripts/LlamaConfig.cmake.in
# scripts/compare-commits.sh
# scripts/server-llm.sh
# tests/test-quantize-fns.cpp
2024-04-07 20:27:17 +08:00
Concedo
0061299cce
fixed quant tools not compiling, updated docs
2024-04-06 23:11:05 +08:00
Concedo
79c8e87922
remove constraint for img dimension
2024-04-06 19:58:58 +08:00
Concedo
743687020d
fixed img2img
2024-04-06 17:29:44 +08:00
Concedo
942fb4b413
fixed removed ref (+1 squashed commits)
...
Squashed commits:
[93f3c270] fixed removed ref (+1 squashed commits)
Squashed commits:
[df361250] remove some files
2024-03-19 19:33:56 +08:00
Concedo
7968bdebbb
added more stats in perf
2024-03-16 16:53:48 +08:00
Concedo
88705cb89a
improve quiet mode for SD
2024-03-12 20:50:39 +08:00
Concedo
6a32c14e86
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# .github/workflows/build.yml
# CMakeLists.txt
# Makefile
# README-sycl.md
# README.md
# flake.lock
# scripts/sync-ggml-am.sh
# scripts/sync-ggml.last
# scripts/sync-ggml.sh
# tests/.gitignore
# tests/test-backend-ops.cpp
2024-03-11 23:00:47 +08:00
Concedo
6990d07a26
tweak sd logging, show progress normally
2024-03-10 11:45:11 +08:00
Concedo
c08d7e5042
wip integration of llava
2024-03-10 11:18:47 +08:00
Concedo
ca19199bc8
prevent sd logging when in quiet mode (+1 squashed commits)
...
Squashed commits:
[a4a1cdd5] fixed type conversion
2024-03-09 16:37:51 +08:00
Concedo
3f475970fa
quiet mode for sd
2024-03-08 19:35:29 +08:00
Concedo
2132bf9ca0
added img sampler aliases
2024-03-08 18:53:34 +08:00
Concedo
d910f2354c
bugfixes
2024-03-05 19:16:54 +08:00
Concedo
4eb3a95cbb
reenable LCM sampler
2024-03-05 17:39:21 +08:00
Concedo
ac43e0115c
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# .devops/nix/package.nix
# README.md
# ggml-metal.m
# llama.cpp
# scripts/sync-ggml.last
# tests/test-backend-ops.cpp
2024-03-05 15:54:05 +08:00
Concedo
c952b4f192
Revert "merge missing functions from sdcpp"
...
This reverts commit 19e1c518f1 .
2024-03-05 15:38:51 +08:00
Concedo
0c59c1ed90
allow specifying width and height
2024-03-03 15:44:15 +08:00
Concedo
e53d21d748
sanitize SD prompt to avoid segfault
2024-03-02 12:05:59 +08:00
Concedo
2d9a90b652
try to fix ci compile errors (+1 squashed commits)
...
Squashed commits:
[d0d49663] fixed log multiline (+1 squashed commits)
Squashed commits:
[81a8befe] try to fix linux build error (+1 squashed commits)
Squashed commits:
[22850dda] try to fix build (+1 squashed commits)
Squashed commits:
[b8294611] missing type
2024-03-01 23:38:15 +08:00
Concedo
80011ed8aa
KCPP SD: add warn and step restriction., updated lite, handle quant mode
2024-03-01 16:41:19 +08:00
Concedo
3463688a0e
image generation is fully working over api (+1 squashed commits)
...
Squashed commits:
[c98ab0b4] single image generation is working now
2024-03-01 14:43:44 +08:00
Concedo
e8f4d7b3da
added model and config endpoints for sdcpp, added more samplers. speed is still not good
2024-02-29 22:56:09 +08:00
bebopkim
257015bb94
Resolve Metal compilation errors for sdcpp ( #720 )
2024-02-29 20:15:45 +08:00
Concedo
5a44d4de2b
refactor and clean identifiers for sd, fix cmake
2024-02-29 18:28:45 +08:00
Concedo
66134bb36e
ui for loading SD models done
2024-02-29 17:08:22 +08:00
Concedo
524ba12abd
refactor - do not use a copy buffer to store generation outputs, instead return a cpp allocated ptr
2024-02-29 14:02:20 +08:00
Concedo
f75e479db0
WIP on sdcpp integration
2024-02-29 00:40:07 +08:00
Concedo
8a919daafb
add as library to makefile
2024-02-28 17:45:00 +08:00
Concedo
17355faf6e
sdcpp is working!
2024-02-28 17:00:18 +08:00
Concedo
19e1c518f1
merge missing functions from sdcpp
2024-02-28 16:38:00 +08:00
Concedo
26696970ce
initial files from sdcpp (not working)
2024-02-28 15:45:13 +08:00
Concedo
ad638285de
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# Makefile
# README.md
# flake.lock
# ggml-cuda.cu
# llama.cpp
# tests/test-backend-ops.cpp
# tests/test-quantize-fns.cpp
2024-02-28 13:41:35 +08:00
Concedo
7eccc5ffa6
change listen count, fix null
2024-02-16 16:01:24 +08:00
Concedo
f9bc7245ab
b64 decoder
2024-02-11 20:35:34 +08:00
Concedo
acb792815e
try fix cuda slowdown
2024-02-05 16:34:15 +08:00
Concedo
35c32fd0f2
refactor some old code with batching
2024-02-05 15:54:45 +08:00
Concedo
5639c1a520
units (+2 squashed commit)
...
Squashed commit:
[166979d9] units coversion
[038dd5d4] get rid of all warnings (+1 squashed commits)
Squashed commits:
[6efd1e1b] get rid of all warnings
2024-01-20 23:53:21 +08:00
Concedo
71e9a64171
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# .github/workflows/nix-ci.yml
# CMakeLists.txt
# Makefile
# ggml-cuda.cu
# ggml-opencl.cpp
# llama.cpp
2024-01-20 23:27:42 +08:00
Concedo
a137b6b9ff
fixed typo
2024-01-20 17:28:14 +08:00
Concedo
680a41ed71
refactor identifiers
2024-01-20 17:26:11 +08:00
Concedo
693f3f0b00
try to use allocator for cuda ggml v3
2024-01-20 12:53:31 +08:00
Concedo
97693e7e97
increase pool buffers
2024-01-20 11:52:39 +08:00