Commit graph

565 commits

Author SHA1 Message Date
Concedo
b9904c3093 up ver 2023-05-08 11:13:16 +08:00
Concedo
89d70886a4 added support for setting custom context size at load time (memory allocation) 2023-05-08 11:11:25 +08:00
Concedo
ff93b394da fixed a typo 2023-05-06 12:37:34 +08:00
Concedo
8a964e76c8 integrated mirostat as a launch parameter, works on all models 2023-05-06 00:47:17 +08:00
Concedo
851f55325a Merge remote-tracking branch 'temp/concedo' into concedo_experimental 2023-05-05 23:55:53 +08:00
Concedo
2edbcebe27 added optional force versioning flag 2023-05-05 22:02:00 +08:00
Hendrik Langer
8131bc8b56 add new sampling algorithm mirostat 2023-05-05 13:23:47 +02:00
Concedo
4857739ab5 allow specifying a different thread count for GPU blas 2023-05-03 21:19:59 +08:00
Concedo
89044502fe just use RT 2023-05-03 11:07:36 +08:00
Concedo
f43a63235b priority adjustment for linux fixed 2023-05-03 10:16:43 +08:00
Concedo
966cd2ce91 Merge remote-tracking branch 'temp/concedo' into concedo_experimental
# Conflicts:
#	koboldcpp.py
2023-05-02 22:43:34 +08:00
Concedo
58f25dce86 added flag to increase processs priority 2023-05-02 22:26:55 +08:00
Sergey Kucher
069b3d4c37 Adds --mlock argument 2023-05-02 16:19:37 +03:00
Concedo
5a10ea50da up ver 2023-05-02 18:19:08 +08:00
Concedo
6f702f2700 fixed stop sequence crash 2023-05-02 14:56:50 +08:00
Concedo
433fa1e8b2 fix for stop sequence missing, added print for exception when loading GUI 2023-05-02 14:18:04 +08:00
Concedo
4d38795563 add UI for token unbanning 2023-05-01 12:10:21 +08:00
Concedo
f149114395 up ver 2023-04-29 19:42:21 +08:00
Concedo
7afad2b9b5 integrated the new samplers 2023-04-29 19:41:41 +08:00
Concedo
67ee2b93a7 removed bad import. 2023-04-29 09:59:16 +08:00
Concedo
f75de52b25 add short delay before exit gui 2023-04-28 15:09:17 +08:00
Concedo
e97c7099b0 created new tkinter GUI 2023-04-28 15:03:48 +08:00
Concedo
e8a389f85b updated kobold lite, added debug mode, changed streaming mode to now use the same url when launching 2023-04-28 11:41:03 +08:00
Concedo
2499632cdc up version 2023-04-27 17:27:10 +08:00
Concedo
137efe2b8f updated embedded kobold lite, force streaming mode if stream flag is used 2023-04-27 17:16:55 +08:00
Concedo
5eec5d6ed9 Added backwards compatibility to an earlier version of NeoX. 2023-04-25 20:34:18 +08:00
Concedo
bff998f871 Slight refactor of the python code: credits to @LuxF3rre 2023-04-25 19:20:14 +08:00
Concedo
59fb174678 fixed compile errors, made mmap automatic when lora is selected, added updated quantizers and quantization handling for gpt neox gpt 2 and gptj 2023-04-24 23:20:06 +08:00
Concedo
3962eb39c7 added token unbanning 2023-04-24 21:50:20 +08:00
Concedo
8e615c8245 Merge branch 'master' into concedo_experimental
# Conflicts:
#	README.md
2023-04-24 12:20:08 +08:00
Concedo
cd6c121357 reinstated the reusable buffers -> approx 10% speedup for prompt processing 2023-04-22 22:49:27 +08:00
Concedo
6e908c1792 added lora support 2023-04-22 12:29:38 +08:00
Concedo
c454f8b848 Gpt NeoX / Pythia integration completed 2023-04-22 11:23:25 +08:00
Concedo
7ba36c2c6c trying to put out penguin based fires. sorry for inconvenience 2023-04-20 23:15:07 +08:00
Concedo
49697d86d8 adjusted down the buf memory allocation now that realloc seems to work 2023-04-20 17:51:13 +08:00
Concedo
3e88616439 fixed WONKY CODE 2023-04-20 16:41:32 +08:00
Concedo
346cd68903 make linux and OSX build process equal to windows. Now it will build all applicable libraries, for a full build do make LLAMA_OPENBLAS=1 LLAMA_CLBLAST=1 2023-04-20 15:53:55 +08:00
Concedo
45ec09d31b fast forwarding for rwkv for unmodified contexts 2023-04-19 15:09:35 +08:00
Concedo
c200b674f4 updated kobold lite, work on rwkv, added exe path to model load params, added launch parameter 2023-04-18 17:36:44 +08:00
Concedo
1f4a69c051 version number api 2023-04-17 19:31:15 +08:00
Concedo
763ad172c0 arranged files, updated kobold lite, modified makefile for extra link args on linux, started RWKV implementation 2023-04-17 17:31:45 +08:00
Concedo
c757fbee1d fixes to stopper tokens, fixed BLAS mode for GPT2 and GPTJ, updated kobold lite 2023-04-16 21:54:18 +08:00
Concedo
6548d3b3fb Added prints for stopping sequences, made makefile 1% friendlier to arch linux users 2023-04-16 20:43:17 +08:00
Concedo
525184930d added a kobold API compatible implementation of stopping sequences 2023-04-16 18:37:49 +08:00
Concedo
ad5676810a merge CLBlast improvements - GPU dequant 2023-04-16 01:17:40 +08:00
Concedo
3eb1c1850e accept non positional model arg 2023-04-16 00:23:07 +08:00
Concedo
1b1c0730f5 Idk why people keep thinking its an error lol. 2023-04-14 22:58:45 +08:00
Concedo
adb4df78d6 Added SmartContext mode, a way of prompt context manipulation that avoids frequent context recalculation. 2023-04-14 21:24:16 +08:00
Concedo
ca297c190f up version 2023-04-13 14:38:38 +08:00
Concedo
1bd5992da4 clean and refactor handling of flags 2023-04-12 23:25:31 +08:00