Concedo
|
6c000cbe7a
|
Merge branch 'upstream' into concedo_experimental
# Conflicts:
# .flake8
# .github/workflows/bench.yml
# .github/workflows/python-lint.yml
# .pre-commit-config.yaml
# Makefile
# README.md
# models/ggml-vocab-bert-bge.gguf.inp
# models/ggml-vocab-bert-bge.gguf.out
# models/ggml-vocab-deepseek-coder.gguf.inp
# models/ggml-vocab-deepseek-coder.gguf.out
# models/ggml-vocab-deepseek-llm.gguf.inp
# models/ggml-vocab-deepseek-llm.gguf.out
# models/ggml-vocab-falcon.gguf.inp
# models/ggml-vocab-falcon.gguf.out
# models/ggml-vocab-gpt-2.gguf.inp
# models/ggml-vocab-gpt-2.gguf.out
# models/ggml-vocab-llama-bpe.gguf.inp
# models/ggml-vocab-llama-bpe.gguf.out
# models/ggml-vocab-llama-spm.gguf.inp
# models/ggml-vocab-llama-spm.gguf.out
# models/ggml-vocab-mpt.gguf.inp
# models/ggml-vocab-mpt.gguf.out
# models/ggml-vocab-phi-3.gguf
# models/ggml-vocab-phi-3.gguf.inp
# models/ggml-vocab-phi-3.gguf.out
# models/ggml-vocab-refact.gguf
# models/ggml-vocab-starcoder.gguf.inp
# models/ggml-vocab-starcoder.gguf.out
# requirements/requirements-convert.txt
# scripts/compare-llama-bench.py
# scripts/run-with-preset.py
# scripts/verify-checksum-models.py
# tests/CMakeLists.txt
# tests/test-tokenizer-0.cpp
|
2024-05-06 18:09:45 +08:00 |
|
Concedo
|
173c7272d5
|
EOS bypass mode added
|
2024-05-06 18:01:49 +08:00 |
|
Concedo
|
3667cc0113
|
fixed stableui btn (+4 squashed commit)
Squashed commit:
[1d4714f1] update default amount to gen
[6eacba33] updated lite
[033589af] added first ver sdui
[16f66d57] updated lite
|
2024-05-06 00:55:16 +08:00 |
|
Concedo
|
0c381f9ded
|
increase interrogate length
|
2024-05-05 00:40:49 +08:00 |
|
Concedo
|
5ca267dc9c
|
remove unnecessary prints
|
2024-05-04 23:28:21 +08:00 |
|
Concedo
|
a3718c6354
|
1.64.1 to fix llava issues
|
2024-05-04 10:38:20 +08:00 |
|
Concedo
|
a34a09d196
|
replace destroy with quit for tk
|
2024-05-03 15:57:13 +08:00 |
|
Concedo
|
4c5d307f59
|
fixed benchmark interrupt (+2 squashed commit)
Squashed commit:
[6e334c8b] require enter key to be pressed
[d50d49b6] fixed bench script
|
2024-05-02 23:22:47 +08:00 |
|
Concedo
|
0d8c4a9b73
|
remove quick lowvram option
|
2024-05-02 14:21:44 +08:00 |
|
Concedo
|
fb7e72352e
|
benchmark includes ver
|
2024-05-02 14:17:48 +08:00 |
|
Concedo
|
e9978bfac0
|
resize window dimensions
|
2024-05-01 17:38:49 +08:00 |
|
Concedo
|
c65448d17a
|
add flash attention toggle
|
2024-04-30 21:29:11 +08:00 |
|
Concedo
|
f6ab0f873f
|
gui benchmark
|
2024-04-28 11:42:26 +08:00 |
|
Concedo
|
c230b78906
|
refactored a lot of code, remove bantokens, move it to api
|
2024-04-27 17:57:13 +08:00 |
|
Concedo
|
4ec8a9c57b
|
expose stop reason in generation
|
2024-04-27 01:12:12 +08:00 |
|
Concedo
|
327682fb97
|
undeprecate openblas, since it seems like there are still some valid cases for it
|
2024-04-26 20:21:29 +08:00 |
|
Concedo
|
544c36f751
|
merge, deprecate openblas
|
2024-04-26 19:24:59 +08:00 |
|
Concedo
|
0871c7cbd1
|
Add additional debug info and increased ctx sizes, fixed a bug loading vulkan config
|
2024-04-25 23:07:37 +08:00 |
|
Concedo
|
15ed96c25a
|
added support for resizing the window
|
2024-04-24 19:35:35 +08:00 |
|
Concedo
|
65bf69d104
|
Merge branch 'upstream' into concedo_experimental
# Conflicts:
# .github/workflows/bench.yml
# .github/workflows/server.yml
# CMakeLists.txt
# Makefile
# README-sycl.md
# flake.lock
|
2024-04-23 22:37:42 +08:00 |
|
Concedo
|
f449da3ba9
|
add additional warning for skiplauncher
|
2024-04-23 18:26:16 +08:00 |
|
Concedo
|
b4d2031215
|
merged, added ability to render special tokens
|
2024-04-22 18:19:58 +08:00 |
|
Concedo
|
a7b79ed2d7
|
smart buffered stop sequence workaround for SSE streaming mode.
|
2024-04-22 15:22:44 +08:00 |
|
Concedo
|
22c49e6b1e
|
Merge branch 'upstream' into concedo_experimental
# Conflicts:
# CMakeLists.txt
# Makefile
# README-sycl.md
# README.md
# scripts/compare-commits.sh
# tests/test-backend-ops.cpp
|
2024-04-19 10:35:50 +08:00 |
|
Concedo
|
b0d796fb49
|
use different cublas binaries
|
2024-04-17 17:14:22 +08:00 |
|
Concedo
|
eb380b500f
|
detect if other programs already using port
|
2024-04-15 15:54:03 +08:00 |
|
Concedo
|
010b2a5a71
|
fixed bad handling for malformed requests
|
2024-04-14 20:37:08 +08:00 |
|
Concedo
|
cfbe8cffbd
|
updated lite, added extra check in pyinstaller cleanup
|
2024-04-14 00:22:21 +08:00 |
|
Concedo
|
ab68447ad3
|
upver
|
2024-04-10 21:36:51 +08:00 |
|
Concedo
|
d8fb97211f
|
chat completions forced append stop sequences if already existing
|
2024-04-10 19:25:30 +08:00 |
|
Concedo
|
d1bb126605
|
Merge branch 'upstream' into concedo
# Conflicts:
# README.md
# llama.cpp
# otherarch/sdcpp/SDCPP_LICENSE
# scripts/sync-ggml-am.sh
# scripts/sync-ggml.sh
|
2024-04-09 17:18:35 +08:00 |
|
Concedo
|
d54af7fa31
|
updated swagger json link fix
|
2024-04-09 14:55:27 +08:00 |
|
Concedo
|
2bc11e00df
|
fixed stop sequence overwriting
|
2024-04-08 21:37:47 +08:00 |
|
Concedo
|
1ee5f355d4
|
try fix some compile issues (+1 squashed commits)
Squashed commits:
[e920e76b] try fix some compile issues
|
2024-04-08 20:01:46 +08:00 |
|
Concedo
|
fc881b4deb
|
fixed ssl cert path set if empty, added some basic cleanup of old temp dirs
|
2024-04-08 18:06:53 +08:00 |
|
Concedo
|
aa5124439d
|
horde workers pause themselves if recent local usage is detected (+1 squashed commits)
Squashed commits:
[7ebb80bc] horde workers pause themselves if recent local usage is detected
|
2024-04-07 23:22:50 +08:00 |
|
Concedo
|
6166fdfde4
|
added support for OAI chat completions adapter file, added default stop sequences to prevent chat compl leakage
|
2024-04-07 10:35:20 +08:00 |
|
Concedo
|
743687020d
|
fixed img2img
|
2024-04-06 17:29:44 +08:00 |
|
Concedo
|
073a279e70
|
change reference from kobold horde to ai horde
|
2024-03-18 22:35:49 +08:00 |
|
Concedo
|
5410e4644a
|
symlink docs
|
2024-03-17 22:27:26 +08:00 |
|
Concedo
|
9342071f9c
|
don't print url for localhost if remote tunnel
|
2024-03-16 22:19:04 +08:00 |
|
Concedo
|
7968bdebbb
|
added more stats in perf
|
2024-03-16 16:53:48 +08:00 |
|
Concedo
|
93d3871056
|
Merge branch 'upstream' into concedo_experimental
# Conflicts:
# .github/workflows/build.yml
# CMakeLists.txt
# Makefile
# ggml-metal.m
|
2024-03-15 10:37:48 +08:00 |
|
Concedo
|
f20fb7d778
|
mmq defaults to disabled only if full offload is possible
|
2024-03-14 23:34:45 +08:00 |
|
Concedo
|
f3b7651102
|
added ignoremissing param
|
2024-03-14 13:46:42 +08:00 |
|
Concedo
|
ec5dea14d7
|
merged, try to fix metal build
|
2024-03-14 11:15:50 +08:00 |
|
Concedo
|
ba950716a9
|
Merge branch 'master' into concedo_experimental
# Conflicts:
# CMakeLists.txt
# Makefile
# Package.swift
# README.md
# build.zig
# llama.cpp
# tests/test-tokenizer-1-bpe.cpp
# tests/test-tokenizer-1-llama.cpp
|
2024-03-13 11:21:58 +08:00 |
|
Concedo
|
edb05e761f
|
Update some prints
|
2024-03-12 21:40:36 +08:00 |
|
Concedo
|
6c6ad93f01
|
added basic support for password protection (+2 squashed commit)
Squashed commit:
[ff91ca72] added basic support for password protection
[91b0b208] updated docs
|
2024-03-12 19:47:12 +08:00 |
|
Concedo
|
a69bc44e7a
|
edit colab (+1 squashed commits)
Squashed commits:
[c7ccb99d] update colab with llava
|
2024-03-12 15:24:53 +08:00 |
|