Commit graph

451 commits

Author SHA1 Message Date
Concedo
793cfd136c fixed 70B detection again, try fix horde issues, fixed lite unicode issue, fixed cmake for cuda 2023-08-09 01:05:00 +08:00
Concedo
cae6a847ad cuda free only for non mmq (+2 squashed commit)
Squashed commit:

[3aca763a] only cuda free for non mmq

[e69a8c9f] revert to pool alloc to try again
2023-08-07 17:12:05 +08:00
Concedo
e99416cdfe blasbatchsize 2023-08-06 17:47:59 +08:00
Concedo
bcfdd0e662 fixed bbs -1 and allow bbs = 2048 2023-08-06 17:47:05 +08:00
Concedo
18bb0ab127 up ver, support 16k ctx 2023-08-04 21:47:17 +08:00
Concedo
f0764c6cfb fix indentation, increase server thread count 2023-08-04 10:29:56 +08:00
Concedo
d09e54aad1 Merge remote-tracking branch 'duncan/api-stream-double-write-fix' into concedo_experimental 2023-08-04 10:22:53 +08:00
duncannah
63ec711a70
fix: still send full result after streaming 2023-08-03 14:35:43 +02:00
Concedo
4709545c06 Merge remote-tracking branch 'duncan/api-stream-double-write-fix' into concedo_experimental 2023-08-03 12:52:43 +08:00
Concedo
ba2040d1df compile fix for ARM NEON 2023-08-03 12:52:06 +08:00
duncannah
9281c2801f
fix: don't send headers twice when streaming 2023-08-02 23:42:43 +02:00
Concedo
46682e5cb3 added mmq launch flag 2023-08-01 17:57:13 +08:00
Concedo
84ce184c4f layout 2023-07-31 17:33:31 +08:00
YellowRoseCx
f27972777f
correct semantic error in import_vars (#355)
* Hide unavailable backends & Add tooltip over backend count

Hides unavailable backends from the user and if the program is launched without any backends made, it shows an error message to them stating no backends were found and to make them using the 'make' command

Add tooltip when hovering over backend count label

hovering over the new label that shows the backend count will explain what the numbers are, and show the users which backends are not available or built

* add some code comments

* hide "missing" if all are built

move tooltip functions to helper functions section. hides the string "Missing: ..." from showing if all backends are available
" if len(runopts)==6 else + "

* small typo fix

* remove wrongly added leftover device choosing code

* fix labels

* move tooltip to function

* import vars logic fix

---------

Co-authored-by: Concedo <39025047+LostRuins@users.noreply.github.com>
2023-07-31 15:51:35 +08:00
Concedo
5ea5d19d6a SSE emoji fix 2023-07-30 22:31:20 +08:00
Concedo
90a37d63d5 up ver, added warning for max context 2023-07-30 18:07:14 +08:00
YellowRoseCx
c8af65760f
Hide unavailable backends & Add tooltip over backend count (#352)
* Hide unavailable backends & Add tooltip over backend count

Hides unavailable backends from the user and if the program is launched without any backends made, it shows an error message to them stating no backends were found and to make them using the 'make' command

Add tooltip when hovering over backend count label

hovering over the new label that shows the backend count will explain what the numbers are, and show the users which backends are not available or built

* add some code comments

* hide "missing" if all are built

move tooltip functions to helper functions section. hides the string "Missing: ..." from showing if all backends are available
" if len(runopts)==6 else + "

* small typo fix

* remove wrongly added leftover device choosing code

* fix labels

* move tooltip to function

---------

Co-authored-by: Concedo <39025047+LostRuins@users.noreply.github.com>
2023-07-30 17:50:55 +08:00
Concedo
45456fa6ca switch noavx2 to not use openblas, as it has incompatible instructions 2023-07-30 16:47:33 +08:00
Concedo
23825abee1 fix wrong key 2023-07-30 14:30:46 +08:00
Concedo
9589d52079 added help link 2023-07-29 17:33:15 +08:00
Concedo
e4b42e5b15 fixed gui bugs 2023-07-29 11:15:57 +08:00
Concedo
94e0a06daf updated lite, up ver (+1 squashed commits)
Squashed commits:

[7d6520f] updated lite, up ver
2023-07-26 11:03:17 +08:00
Concedo
66e4b5141e fix horde worker host and client agent 2023-07-25 18:18:41 +08:00
Concedo
d8d2449bfb better label (+1 squashed commits)
Squashed commits:

[f573b2c] cuda 3 target arch
2023-07-24 23:07:31 +08:00
Concedo
7555dae4cc ditch advanced subparsers 2023-07-24 22:40:36 +08:00
Concedo
825e34baa3 default horde name and better handling for horde (+3 squashed commit)
Squashed commit:

[fadfa60] better idle handling for horde worker

[a3971e6] updated lite

[2ca2b79] seems to not generate rubbish
2023-07-24 18:41:41 +08:00
Concedo
c7136f03d9 added support for tensor_split parameter as an advanced parameter. 2023-07-24 17:16:19 +08:00
Concedo
280abaf029 added stop reason in the perf endpoint 2023-07-24 11:55:35 +08:00
Concedo
0cca0726fe reduce number of retries, fixed maxlength > maxctx bug 2023-07-23 09:59:34 +08:00
Concedo
fa0270df7c added some checks to skip generation if busy 2023-07-22 23:10:04 +08:00
Concedo
2807d98fd4 touchup (+2 squashed commit)
Squashed commit:

[8b06458] fixed broken param order

[7eabdc0] very broken, do not use
2023-07-22 22:57:56 +08:00
Concedo
3aec3038d4 bump scratch buffers 2023-07-22 18:12:18 +08:00
Concedo
52c5856a08 auto populate horde model name 2023-07-22 16:03:12 +08:00
Concedo
dd3f8dabed updated cluster to horde.koboldai.net 2023-07-22 12:42:40 +08:00
Concedo
236d0e8955 add tip about using other workers 2023-07-22 12:29:22 +08:00
Concedo
701bf0a6cd reduce sleep time between jobs 2023-07-22 11:56:43 +08:00
Concedo
52c98228aa bugfixes for missing params 2023-07-22 11:37:44 +08:00
Concedo
d7ab6adbc1 embedded horde worker is ready 2023-07-22 11:21:32 +08:00
Concedo
75064b4ada wip on embedded horde worker 2023-07-22 01:30:25 +08:00
Concedo
e85557f798 launcher for rope 2023-07-20 17:45:50 +08:00
Concedo
39dc1a46c4 added token count, updated lite 2023-07-20 14:41:06 +08:00
Concedo
2a88d6d3ec Merge remote-tracking branch 'ycros/api-modelbusy-fix' into concedo_experimental 2023-07-19 18:32:13 +08:00
Concedo
e9467f5a44 auto rope scale adjustments, added sched yield fix for apple, adjust warning for mirostat 2023-07-19 16:44:44 +08:00
Concedo
374fffb9c6 Reworking rope WIP 2023-07-19 00:54:41 +08:00
Concedo
0a11f50da8 reenabled sched_yield, reduced sampler warning msg to once per session 2023-07-18 20:26:18 +08:00
Concedo
6d32e7fc8b Merge commit 'a6803cab94' into concedo_experimental
# Conflicts:
#	.devops/tools.sh
#	Makefile
#	build.zig
#	flake.nix
#	ggml-cuda.cu
#	ggml.h
#	tests/test-grad0.c
#	tests/test-opt.c
2023-07-18 19:12:06 +08:00
Ycros
fd90d52127 API: Replace modelbusy bool with a lock.
- Also remove duplicate Content-Type header on streams responses.
2023-07-18 20:09:50 +10:00
Concedo
1d1111e10f expose timing info in web api 2023-07-11 18:56:06 +08:00
Concedo
7222877069 Merge remote-tracking branch 'ren/concedo' into concedo_experimental 2023-07-11 18:45:36 +08:00
Concedo
5ca204d527 Merge remote-tracking branch 'yellowrose/pr/open/LostRuins/koboldcpp/multigpu-cuda-gui' into concedo_experimental
# Conflicts:
#	koboldcpp.py
2023-07-11 18:22:54 +08:00