Concedo
|
8a964e76c8
|
integrated mirostat as a launch parameter, works on all models
|
2023-05-06 00:47:17 +08:00 |
|
Concedo
|
851f55325a
|
Merge remote-tracking branch 'temp/concedo' into concedo_experimental
|
2023-05-05 23:55:53 +08:00 |
|
Concedo
|
2edbcebe27
|
added optional force versioning flag
|
2023-05-05 22:02:00 +08:00 |
|
Hendrik Langer
|
8131bc8b56
|
add new sampling algorithm mirostat
|
2023-05-05 13:23:47 +02:00 |
|
Concedo
|
4857739ab5
|
allow specifying a different thread count for GPU blas
|
2023-05-03 21:19:59 +08:00 |
|
Concedo
|
89044502fe
|
just use RT
|
2023-05-03 11:07:36 +08:00 |
|
Concedo
|
f43a63235b
|
priority adjustment for linux fixed
|
2023-05-03 10:16:43 +08:00 |
|
Concedo
|
966cd2ce91
|
Merge remote-tracking branch 'temp/concedo' into concedo_experimental
# Conflicts:
# koboldcpp.py
|
2023-05-02 22:43:34 +08:00 |
|
Concedo
|
58f25dce86
|
added flag to increase processs priority
|
2023-05-02 22:26:55 +08:00 |
|
Sergey Kucher
|
069b3d4c37
|
Adds --mlock argument
|
2023-05-02 16:19:37 +03:00 |
|
Concedo
|
5a10ea50da
|
up ver
|
2023-05-02 18:19:08 +08:00 |
|
Concedo
|
6f702f2700
|
fixed stop sequence crash
|
2023-05-02 14:56:50 +08:00 |
|
Concedo
|
433fa1e8b2
|
fix for stop sequence missing, added print for exception when loading GUI
|
2023-05-02 14:18:04 +08:00 |
|
Concedo
|
4d38795563
|
add UI for token unbanning
|
2023-05-01 12:10:21 +08:00 |
|
Concedo
|
f149114395
|
up ver
|
2023-04-29 19:42:21 +08:00 |
|
Concedo
|
7afad2b9b5
|
integrated the new samplers
|
2023-04-29 19:41:41 +08:00 |
|
Concedo
|
67ee2b93a7
|
removed bad import.
|
2023-04-29 09:59:16 +08:00 |
|
Concedo
|
f75de52b25
|
add short delay before exit gui
|
2023-04-28 15:09:17 +08:00 |
|
Concedo
|
e97c7099b0
|
created new tkinter GUI
|
2023-04-28 15:03:48 +08:00 |
|
Concedo
|
e8a389f85b
|
updated kobold lite, added debug mode, changed streaming mode to now use the same url when launching
|
2023-04-28 11:41:03 +08:00 |
|
Concedo
|
2499632cdc
|
up version
|
2023-04-27 17:27:10 +08:00 |
|
Concedo
|
137efe2b8f
|
updated embedded kobold lite, force streaming mode if stream flag is used
|
2023-04-27 17:16:55 +08:00 |
|
Concedo
|
5eec5d6ed9
|
Added backwards compatibility to an earlier version of NeoX.
|
2023-04-25 20:34:18 +08:00 |
|
Concedo
|
bff998f871
|
Slight refactor of the python code: credits to @LuxF3rre
|
2023-04-25 19:20:14 +08:00 |
|
Concedo
|
59fb174678
|
fixed compile errors, made mmap automatic when lora is selected, added updated quantizers and quantization handling for gpt neox gpt 2 and gptj
|
2023-04-24 23:20:06 +08:00 |
|
Concedo
|
3962eb39c7
|
added token unbanning
|
2023-04-24 21:50:20 +08:00 |
|
Concedo
|
8e615c8245
|
Merge branch 'master' into concedo_experimental
# Conflicts:
# README.md
|
2023-04-24 12:20:08 +08:00 |
|
Concedo
|
cd6c121357
|
reinstated the reusable buffers -> approx 10% speedup for prompt processing
|
2023-04-22 22:49:27 +08:00 |
|
Concedo
|
6e908c1792
|
added lora support
|
2023-04-22 12:29:38 +08:00 |
|
Concedo
|
c454f8b848
|
Gpt NeoX / Pythia integration completed
|
2023-04-22 11:23:25 +08:00 |
|
Concedo
|
7ba36c2c6c
|
trying to put out penguin based fires. sorry for inconvenience
|
2023-04-20 23:15:07 +08:00 |
|
Concedo
|
49697d86d8
|
adjusted down the buf memory allocation now that realloc seems to work
|
2023-04-20 17:51:13 +08:00 |
|
Concedo
|
3e88616439
|
fixed WONKY CODE
|
2023-04-20 16:41:32 +08:00 |
|
Concedo
|
346cd68903
|
make linux and OSX build process equal to windows. Now it will build all applicable libraries, for a full build do make LLAMA_OPENBLAS=1 LLAMA_CLBLAST=1
|
2023-04-20 15:53:55 +08:00 |
|
Concedo
|
45ec09d31b
|
fast forwarding for rwkv for unmodified contexts
|
2023-04-19 15:09:35 +08:00 |
|
Concedo
|
c200b674f4
|
updated kobold lite, work on rwkv, added exe path to model load params, added launch parameter
|
2023-04-18 17:36:44 +08:00 |
|
Concedo
|
1f4a69c051
|
version number api
|
2023-04-17 19:31:15 +08:00 |
|
Concedo
|
763ad172c0
|
arranged files, updated kobold lite, modified makefile for extra link args on linux, started RWKV implementation
|
2023-04-17 17:31:45 +08:00 |
|
Concedo
|
c757fbee1d
|
fixes to stopper tokens, fixed BLAS mode for GPT2 and GPTJ, updated kobold lite
|
2023-04-16 21:54:18 +08:00 |
|
Concedo
|
6548d3b3fb
|
Added prints for stopping sequences, made makefile 1% friendlier to arch linux users
|
2023-04-16 20:43:17 +08:00 |
|
Concedo
|
525184930d
|
added a kobold API compatible implementation of stopping sequences
|
2023-04-16 18:37:49 +08:00 |
|
Concedo
|
ad5676810a
|
merge CLBlast improvements - GPU dequant
|
2023-04-16 01:17:40 +08:00 |
|
Concedo
|
3eb1c1850e
|
accept non positional model arg
|
2023-04-16 00:23:07 +08:00 |
|
Concedo
|
1b1c0730f5
|
Idk why people keep thinking its an error lol.
|
2023-04-14 22:58:45 +08:00 |
|
Concedo
|
adb4df78d6
|
Added SmartContext mode, a way of prompt context manipulation that avoids frequent context recalculation.
|
2023-04-14 21:24:16 +08:00 |
|
Concedo
|
ca297c190f
|
up version
|
2023-04-13 14:38:38 +08:00 |
|
Concedo
|
1bd5992da4
|
clean and refactor handling of flags
|
2023-04-12 23:25:31 +08:00 |
|
Concedo
|
4faae0afa9
|
Merged upstream, fixed OSX compile errors, integrated noavx2 build into main
|
2023-04-12 18:08:55 +08:00 |
|
Concedo
|
23c675b2e6
|
integrated optional (experimentl) CLBlast support
|
2023-04-11 23:33:44 +08:00 |
|
Concedo
|
1f6aa47b6e
|
Merge branch 'master' into concedo
# Conflicts:
# README.md
|
2023-04-11 16:53:41 +08:00 |
|