Concedo
4eb3a95cbb
reenable LCM sampler
2024-03-05 17:39:21 +08:00
Concedo
ac43e0115c
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# .devops/nix/package.nix
# README.md
# ggml-metal.m
# llama.cpp
# scripts/sync-ggml.last
# tests/test-backend-ops.cpp
2024-03-05 15:54:05 +08:00
Concedo
b67a906244
add stub for useless endpoints
2024-03-04 19:29:08 +08:00
Concedo
d4a12133e7
added SD samplers endpoint
2024-03-04 14:26:49 +08:00
Concedo
b8bc500a4b
updated readme and lite
2024-03-03 23:31:52 +08:00
Concedo
0c59c1ed90
allow specifying width and height
2024-03-03 15:44:15 +08:00
Concedo
fa1d8b8d95
updated lite
2024-03-02 22:33:02 +08:00
Concedo
fda905a36a
fixed unable to load config
2024-03-02 18:08:45 +08:00
Concedo
e1b213ae96
increase steps limit
2024-03-02 12:08:19 +08:00
Concedo
59c5448ac8
fixed colab (+1 squashed commits)
...
Squashed commits:
[1d1c686f] updated colab and docs
2024-03-02 10:09:07 +08:00
Concedo
0978134f65
fix macos tunnel
2024-03-02 02:03:13 +08:00
Concedo
040de7d899
try add tunnels for macos
2024-03-01 17:52:09 +08:00
Concedo
55af5446ad
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# README.md
# ci/run.sh
# llama.cpp
# scripts/sync-ggml.last
2024-03-01 17:41:37 +08:00
Concedo
e5861e993d
fix benchmark
2024-03-01 16:54:25 +08:00
Concedo
80011ed8aa
KCPP SD: add warn and step restriction., updated lite, handle quant mode
2024-03-01 16:41:19 +08:00
Concedo
3463688a0e
image generation is fully working over api (+1 squashed commits)
...
Squashed commits:
[c98ab0b4] single image generation is working now
2024-03-01 14:43:44 +08:00
Concedo
e8f4d7b3da
added model and config endpoints for sdcpp, added more samplers. speed is still not good
2024-02-29 22:56:09 +08:00
Concedo
5a44d4de2b
refactor and clean identifiers for sd, fix cmake
2024-02-29 18:28:45 +08:00
Concedo
66134bb36e
ui for loading SD models done
2024-02-29 17:08:22 +08:00
Concedo
524ba12abd
refactor - do not use a copy buffer to store generation outputs, instead return a cpp allocated ptr
2024-02-29 14:02:20 +08:00
Concedo
f75e479db0
WIP on sdcpp integration
2024-02-29 00:40:07 +08:00
Concedo
ad638285de
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# Makefile
# README.md
# flake.lock
# ggml-cuda.cu
# llama.cpp
# tests/test-backend-ops.cpp
# tests/test-quantize-fns.cpp
2024-02-28 13:41:35 +08:00
Concedo
71898cf728
unlock custom contextsize
2024-02-27 18:10:43 +08:00
Concedo
39ae58ef0d
fix tooltip glitch
2024-02-26 11:35:58 +08:00
YellowRoseCx
7b85917827
add additional tooltips ( #710 )
2024-02-26 11:15:57 +08:00
Concedo
a6ba735b07
up version for 1.59.1 makefile changes
2024-02-26 10:40:12 +08:00
Concedo
a6aff3fba0
fix typo
2024-02-25 19:40:40 +08:00
Concedo
1bcbd2e21b
updated lite
2024-02-24 17:59:44 +08:00
Concedo
f3a0e05d91
added noavx2 vulkan
2024-02-22 16:56:25 +08:00
Concedo
2d71256d21
try to make prints flush
2024-02-21 17:16:49 +08:00
Concedo
6181b46eef
added nocertify mode
2024-02-19 16:05:17 +08:00
Concedo
db0834593b
hide smartconext toggle when contextshift toggle is on
2024-02-18 14:09:07 +08:00
Concedo
e8e86ecf9f
fixed SSL not working with streaming
2024-02-16 17:04:07 +08:00
Concedo
7eccc5ffa6
change listen count, fix null
2024-02-16 16:01:24 +08:00
Concedo
39f8cbd1f3
send done for textcompletions too
2024-02-13 23:39:00 +08:00
Concedo
99b7cf7d1c
stream switch to LF newline
2024-02-13 23:12:03 +08:00
Concedo
3cec37c2e0
Merge branch 'master' into concedo_experimental
...
# Conflicts:
# .flake8
# .github/workflows/python-lint.yml
# flake.lock
# ggml-cuda.cu
# ggml-quants.c
# llama.cpp
# pocs/vdot/q8dot.cpp
# pocs/vdot/vdot.cpp
# tests/test-quantize-fns.cpp
# tests/test-quantize-perf.cpp
2024-02-13 00:14:22 +08:00
Concedo
603fe941c1
increase cloudflared check size
2024-02-12 17:19:58 +08:00
Concedo
6f3196ad8e
fix benchmark line
2024-02-10 21:49:14 +08:00
Concedo
c3d1a7d123
benchmark coherence fix
2024-02-09 19:03:48 +08:00
Concedo
35111ce01a
row split mode is now a toggle
2024-02-09 18:35:58 +08:00
Concedo
d1aff0e964
benchmark only save under 1mb
2024-02-09 15:40:29 +08:00
Concedo
992eea71d7
fixes for vulkan multigpu
2024-02-09 14:42:27 +08:00
Concedo
fe424a5466
tensor split active text
2024-02-09 12:02:23 +08:00
Concedo
4cd571db89
vulkan multigpu, show uptime
2024-02-08 16:54:38 +08:00
Concedo
de7be2f4e0
benchmarker done
2024-02-07 22:04:53 +08:00
Concedo
5cd9b1d23a
placeholder for benchmark
2024-02-06 21:48:07 +08:00
Concedo
f43667f499
runmode untouched fix
2024-02-05 21:52:33 +08:00
Concedo
330921db15
runmode untouched fix
2024-02-05 20:26:08 +08:00
Alexander Abushady
4cb956c7db
Quadratic Sampling UI ( #652 )
...
* Quadratic Sampling UI
Kalomaze's Quadratic Sampling, now has a UI within KCPP.
* remove debug prints
* cleanup, add smooth sampler to dynatemp
---------
Co-authored-by: Concedo <39025047+LostRuins@users.noreply.github.com>
2024-02-04 16:26:27 +08:00