Concedo
6a32c14e86
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# .github/workflows/build.yml
# CMakeLists.txt
# Makefile
# README-sycl.md
# README.md
# flake.lock
# scripts/sync-ggml-am.sh
# scripts/sync-ggml.last
# scripts/sync-ggml.sh
# tests/.gitignore
# tests/test-backend-ops.cpp
2024-03-11 23:00:47 +08:00
Concedo
6990d07a26
tweak sd logging, show progress normally
2024-03-10 11:45:11 +08:00
Concedo
c08d7e5042
wip integration of llava
2024-03-10 11:18:47 +08:00
Concedo
ca19199bc8
prevent sd logging when in quiet mode (+1 squashed commits)
...
Squashed commits:
[a4a1cdd5] fixed type conversion
2024-03-09 16:37:51 +08:00
Concedo
3f475970fa
quiet mode for sd
2024-03-08 19:35:29 +08:00
Concedo
2132bf9ca0
added img sampler aliases
2024-03-08 18:53:34 +08:00
Concedo
d910f2354c
bugfixes
2024-03-05 19:16:54 +08:00
Concedo
4eb3a95cbb
reenable LCM sampler
2024-03-05 17:39:21 +08:00
Concedo
ac43e0115c
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# .devops/nix/package.nix
# README.md
# ggml-metal.m
# llama.cpp
# scripts/sync-ggml.last
# tests/test-backend-ops.cpp
2024-03-05 15:54:05 +08:00
Concedo
c952b4f192
Revert "merge missing functions from sdcpp"
...
This reverts commit 19e1c518f1 .
2024-03-05 15:38:51 +08:00
Concedo
0c59c1ed90
allow specifying width and height
2024-03-03 15:44:15 +08:00
Concedo
e53d21d748
sanitize SD prompt to avoid segfault
2024-03-02 12:05:59 +08:00
Concedo
2d9a90b652
try to fix ci compile errors (+1 squashed commits)
...
Squashed commits:
[d0d49663] fixed log multiline (+1 squashed commits)
Squashed commits:
[81a8befe] try to fix linux build error (+1 squashed commits)
Squashed commits:
[22850dda] try to fix build (+1 squashed commits)
Squashed commits:
[b8294611] missing type
2024-03-01 23:38:15 +08:00
Concedo
80011ed8aa
KCPP SD: add warn and step restriction., updated lite, handle quant mode
2024-03-01 16:41:19 +08:00
Concedo
3463688a0e
image generation is fully working over api (+1 squashed commits)
...
Squashed commits:
[c98ab0b4] single image generation is working now
2024-03-01 14:43:44 +08:00
Concedo
e8f4d7b3da
added model and config endpoints for sdcpp, added more samplers. speed is still not good
2024-02-29 22:56:09 +08:00
bebopkim
257015bb94
Resolve Metal compilation errors for sdcpp ( #720 )
2024-02-29 20:15:45 +08:00
Concedo
5a44d4de2b
refactor and clean identifiers for sd, fix cmake
2024-02-29 18:28:45 +08:00
Concedo
66134bb36e
ui for loading SD models done
2024-02-29 17:08:22 +08:00
Concedo
524ba12abd
refactor - do not use a copy buffer to store generation outputs, instead return a cpp allocated ptr
2024-02-29 14:02:20 +08:00
Concedo
f75e479db0
WIP on sdcpp integration
2024-02-29 00:40:07 +08:00
Concedo
8a919daafb
add as library to makefile
2024-02-28 17:45:00 +08:00
Concedo
17355faf6e
sdcpp is working!
2024-02-28 17:00:18 +08:00
Concedo
19e1c518f1
merge missing functions from sdcpp
2024-02-28 16:38:00 +08:00
Concedo
26696970ce
initial files from sdcpp (not working)
2024-02-28 15:45:13 +08:00
Concedo
ad638285de
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# Makefile
# README.md
# flake.lock
# ggml-cuda.cu
# llama.cpp
# tests/test-backend-ops.cpp
# tests/test-quantize-fns.cpp
2024-02-28 13:41:35 +08:00
Concedo
7eccc5ffa6
change listen count, fix null
2024-02-16 16:01:24 +08:00
Concedo
f9bc7245ab
b64 decoder
2024-02-11 20:35:34 +08:00
Concedo
acb792815e
try fix cuda slowdown
2024-02-05 16:34:15 +08:00
Concedo
35c32fd0f2
refactor some old code with batching
2024-02-05 15:54:45 +08:00
Concedo
5639c1a520
units (+2 squashed commit)
...
Squashed commit:
[166979d9] units coversion
[038dd5d4] get rid of all warnings (+1 squashed commits)
Squashed commits:
[6efd1e1b] get rid of all warnings
2024-01-20 23:53:21 +08:00
Concedo
71e9a64171
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# .github/workflows/nix-ci.yml
# CMakeLists.txt
# Makefile
# ggml-cuda.cu
# ggml-opencl.cpp
# llama.cpp
2024-01-20 23:27:42 +08:00
Concedo
a137b6b9ff
fixed typo
2024-01-20 17:28:14 +08:00
Concedo
680a41ed71
refactor identifiers
2024-01-20 17:26:11 +08:00
Concedo
693f3f0b00
try to use allocator for cuda ggml v3
2024-01-20 12:53:31 +08:00
Concedo
97693e7e97
increase pool buffers
2024-01-20 11:52:39 +08:00
Concedo
21f0ce2502
do not use vmm for ggml v3
2024-01-20 11:30:22 +08:00
Concedo
db14de5c32
fossilize ggml library ver 3, to support ggjtv3
2024-01-20 10:49:25 +08:00
Concedo
230a638512
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# .github/workflows/docker.yml
# CMakeLists.txt
# Makefile
# README.md
# llama.cpp
# tests/test-grad0.cpp
2023-12-22 14:40:13 +08:00
Concedo
76a3ba42eb
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# ggml.c
# ggml.h
# requirements.txt
# tests/test-quantize-perf.cpp
2023-12-16 22:58:53 +08:00
Concedo
a6eb9b8010
Fix GPT2 not loading due to graph too small
2023-11-26 23:06:42 +08:00
Concedo
dc4078c039
fixed segfault with all non-gguf models
2023-11-20 22:31:56 +08:00
Concedo
45ad1b97f8
max nodes 8192
2023-11-18 11:02:35 +08:00
Concedo
35a97e14b2
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# Makefile
# README.md
# docs/token_generation_performance_tips.md
# grammars/README.md
# scripts/sync-ggml.sh
# tests/CMakeLists.txt
# tests/test-grad0.cpp
# tests/test-opt.cpp
2023-11-15 16:59:53 +08:00
Concedo
8c14c81b33
hopefully this fixes the dotnet nonsense
2023-11-03 11:23:56 +08:00
Concedo
42eabf2f2f
rope fixes
2023-11-02 20:41:16 +08:00
Concedo
bc4ff72317
not working merge
2023-11-02 17:52:40 +08:00
Concedo
cc5b282350
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# CMakeLists.txt
# Makefile
# build.zig
# flake.lock
# flake.nix
# ggml.c
2023-10-31 20:44:04 +08:00
Concedo
5db89b90b7
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# .gitignore
# CMakeLists.txt
# Makefile
# README.md
# build.zig
# ggml-opencl.cpp
# tests/CMakeLists.txt
# tests/test-double-float.cpp
# tests/test-sampling.cpp
2023-10-25 23:58:15 +08:00
Concedo
c9983a72d6
prevent lora with clblast
2023-10-25 15:18:03 +08:00