Commit graph

539 commits

Author SHA1 Message Date
Concedo
fac6373b13 fix tools 2024-05-24 23:50:08 +08:00
Concedo
09adfa70ad limit default threads to max 8 to deal with ecores 2024-05-22 14:47:57 +08:00
Concedo
618e60c279 model download if its a url 2024-05-21 18:56:11 +08:00
Concedo
2cbf39cba2 disable ui resize on macos 2024-05-17 15:56:10 +08:00
Concedo
1db3421c52 multiple minor fixes 2024-05-17 15:47:53 +08:00
Concedo
6d9d846bdd prevent mixing lora and quant 2024-05-16 00:29:03 +08:00
Concedo
08993696c3 try apply lora on load 2024-05-15 22:53:23 +08:00
Concedo
44443edfda rep pen slope works (+1 squashed commits)
Squashed commits:

[535ad566] experiment with rep pen range
2024-05-15 17:20:57 +08:00
Concedo
5ce2fdad24 taesd for sdxl, add lora loading done 2024-05-14 23:02:56 +08:00
Concedo
2ee808a747 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/workflows/build.yml
#	CMakeLists.txt
#	README.md
#	ci/run.sh
#	llama.cpp
#	models/ggml-vocab-llama-bpe.gguf.inp
#	models/ggml-vocab-llama-bpe.gguf.out
#	requirements.txt
#	scripts/compare-llama-bench.py
#	scripts/sync-ggml.last
#	tests/CMakeLists.txt
#	tests/test-backend-ops.cpp
#	tests/test-grammar-integration.cpp
#	tests/test-tokenizer-1-bpe.cpp
2024-05-14 19:28:47 +08:00
Concedo
5d15f8f76a vae test 2024-05-14 19:17:01 +08:00
Concedo
4807b66907 wip sd 2024-05-13 23:23:16 +08:00
Concedo
bd95ee7d9a temporary version for archiving 2024-05-13 21:53:58 +08:00
Concedo
d8a52321da ditched the coherent flag 2024-05-13 20:38:51 +08:00
Concedo
f4746572d9 wildcare sdui url 2024-05-12 11:09:59 +08:00
Concedo
eff01660e4 re-added smart context due to people complaining 2024-05-11 17:25:03 +08:00
Concedo
702be65ed1 don't show embedded sdui if no model 2024-05-11 08:56:56 +08:00
Concedo
1effe16861 fixed horde worker flag 2024-05-11 01:17:04 +08:00
Concedo
7967377ebc fix for sdui showing when sdmodel not loaded, and not showing when remote tunnel is used. 2024-05-10 23:40:20 +08:00
Concedo
69570daf31 tidy argparse 2024-05-10 17:28:08 +08:00
Concedo
dbe72b959e tidy up and refactor code to support old flags 2024-05-10 16:50:53 +08:00
Concedo
eccc2ddca2 better warnings 2024-05-10 11:27:40 +08:00
Concedo
6f23ca24fb deprecated some old flags 2024-05-10 10:57:52 +08:00
Concedo
868446bd1a replace sdconfig and hordeconfig 2024-05-09 22:43:50 +08:00
Concedo
8730881530 try to fix resize window issues 2024-05-09 14:45:54 +08:00
Concedo
62ea3eee4a announce sdui url 2024-05-06 18:15:34 +08:00
Concedo
6c000cbe7a Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.flake8
#	.github/workflows/bench.yml
#	.github/workflows/python-lint.yml
#	.pre-commit-config.yaml
#	Makefile
#	README.md
#	models/ggml-vocab-bert-bge.gguf.inp
#	models/ggml-vocab-bert-bge.gguf.out
#	models/ggml-vocab-deepseek-coder.gguf.inp
#	models/ggml-vocab-deepseek-coder.gguf.out
#	models/ggml-vocab-deepseek-llm.gguf.inp
#	models/ggml-vocab-deepseek-llm.gguf.out
#	models/ggml-vocab-falcon.gguf.inp
#	models/ggml-vocab-falcon.gguf.out
#	models/ggml-vocab-gpt-2.gguf.inp
#	models/ggml-vocab-gpt-2.gguf.out
#	models/ggml-vocab-llama-bpe.gguf.inp
#	models/ggml-vocab-llama-bpe.gguf.out
#	models/ggml-vocab-llama-spm.gguf.inp
#	models/ggml-vocab-llama-spm.gguf.out
#	models/ggml-vocab-mpt.gguf.inp
#	models/ggml-vocab-mpt.gguf.out
#	models/ggml-vocab-phi-3.gguf
#	models/ggml-vocab-phi-3.gguf.inp
#	models/ggml-vocab-phi-3.gguf.out
#	models/ggml-vocab-refact.gguf
#	models/ggml-vocab-starcoder.gguf.inp
#	models/ggml-vocab-starcoder.gguf.out
#	requirements/requirements-convert.txt
#	scripts/compare-llama-bench.py
#	scripts/run-with-preset.py
#	scripts/verify-checksum-models.py
#	tests/CMakeLists.txt
#	tests/test-tokenizer-0.cpp
2024-05-06 18:09:45 +08:00
Concedo
173c7272d5 EOS bypass mode added 2024-05-06 18:01:49 +08:00
Concedo
3667cc0113 fixed stableui btn (+4 squashed commit)
Squashed commit:

[1d4714f1] update default amount to gen

[6eacba33] updated lite

[033589af] added first ver sdui

[16f66d57] updated lite
2024-05-06 00:55:16 +08:00
Concedo
0c381f9ded increase interrogate length 2024-05-05 00:40:49 +08:00
Concedo
5ca267dc9c remove unnecessary prints 2024-05-04 23:28:21 +08:00
Concedo
a3718c6354 1.64.1 to fix llava issues 2024-05-04 10:38:20 +08:00
Concedo
a34a09d196 replace destroy with quit for tk 2024-05-03 15:57:13 +08:00
Concedo
4c5d307f59 fixed benchmark interrupt (+2 squashed commit)
Squashed commit:

[6e334c8b] require enter key to be pressed

[d50d49b6] fixed bench script
2024-05-02 23:22:47 +08:00
Concedo
0d8c4a9b73 remove quick lowvram option 2024-05-02 14:21:44 +08:00
Concedo
fb7e72352e benchmark includes ver 2024-05-02 14:17:48 +08:00
Concedo
e9978bfac0 resize window dimensions 2024-05-01 17:38:49 +08:00
Concedo
c65448d17a add flash attention toggle 2024-04-30 21:29:11 +08:00
Concedo
f6ab0f873f gui benchmark 2024-04-28 11:42:26 +08:00
Concedo
c230b78906 refactored a lot of code, remove bantokens, move it to api 2024-04-27 17:57:13 +08:00
Concedo
4ec8a9c57b expose stop reason in generation 2024-04-27 01:12:12 +08:00
Concedo
327682fb97 undeprecate openblas, since it seems like there are still some valid cases for it 2024-04-26 20:21:29 +08:00
Concedo
544c36f751 merge, deprecate openblas 2024-04-26 19:24:59 +08:00
Concedo
0871c7cbd1 Add additional debug info and increased ctx sizes, fixed a bug loading vulkan config 2024-04-25 23:07:37 +08:00
Concedo
15ed96c25a added support for resizing the window 2024-04-24 19:35:35 +08:00
Concedo
65bf69d104 Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	.github/workflows/bench.yml
#	.github/workflows/server.yml
#	CMakeLists.txt
#	Makefile
#	README-sycl.md
#	flake.lock
2024-04-23 22:37:42 +08:00
Concedo
f449da3ba9 add additional warning for skiplauncher 2024-04-23 18:26:16 +08:00
Concedo
b4d2031215 merged, added ability to render special tokens 2024-04-22 18:19:58 +08:00
Concedo
a7b79ed2d7 smart buffered stop sequence workaround for SSE streaming mode. 2024-04-22 15:22:44 +08:00
Concedo
22c49e6b1e Merge branch 'upstream' into concedo_experimental
# Conflicts:
#	CMakeLists.txt
#	Makefile
#	README-sycl.md
#	README.md
#	scripts/compare-commits.sh
#	tests/test-backend-ops.cpp
2024-04-19 10:35:50 +08:00