Concedo
cd6c121357
reinstated the reusable buffers -> approx 10% speedup for prompt processing
2023-04-22 22:49:27 +08:00
Concedo
6e908c1792
added lora support
2023-04-22 12:29:38 +08:00
Concedo
c454f8b848
Gpt NeoX / Pythia integration completed
2023-04-22 11:23:25 +08:00
Concedo
7ba36c2c6c
trying to put out penguin based fires. sorry for inconvenience
2023-04-20 23:15:07 +08:00
Concedo
49697d86d8
adjusted down the buf memory allocation now that realloc seems to work
2023-04-20 17:51:13 +08:00
Concedo
3e88616439
fixed WONKY CODE
2023-04-20 16:41:32 +08:00
Concedo
346cd68903
make linux and OSX build process equal to windows. Now it will build all applicable libraries, for a full build do make LLAMA_OPENBLAS=1 LLAMA_CLBLAST=1
2023-04-20 15:53:55 +08:00
Concedo
45ec09d31b
fast forwarding for rwkv for unmodified contexts
2023-04-19 15:09:35 +08:00
Concedo
c200b674f4
updated kobold lite, work on rwkv, added exe path to model load params, added launch parameter
2023-04-18 17:36:44 +08:00
Concedo
1f4a69c051
version number api
2023-04-17 19:31:15 +08:00
Concedo
763ad172c0
arranged files, updated kobold lite, modified makefile for extra link args on linux, started RWKV implementation
2023-04-17 17:31:45 +08:00
Concedo
c757fbee1d
fixes to stopper tokens, fixed BLAS mode for GPT2 and GPTJ, updated kobold lite
2023-04-16 21:54:18 +08:00
Concedo
6548d3b3fb
Added prints for stopping sequences, made makefile 1% friendlier to arch linux users
2023-04-16 20:43:17 +08:00
Concedo
525184930d
added a kobold API compatible implementation of stopping sequences
2023-04-16 18:37:49 +08:00
Concedo
ad5676810a
merge CLBlast improvements - GPU dequant
2023-04-16 01:17:40 +08:00
Concedo
3eb1c1850e
accept non positional model arg
2023-04-16 00:23:07 +08:00
Concedo
1b1c0730f5
Idk why people keep thinking its an error lol.
2023-04-14 22:58:45 +08:00
Concedo
adb4df78d6
Added SmartContext mode, a way of prompt context manipulation that avoids frequent context recalculation.
2023-04-14 21:24:16 +08:00
Concedo
ca297c190f
up version
2023-04-13 14:38:38 +08:00
Concedo
1bd5992da4
clean and refactor handling of flags
2023-04-12 23:25:31 +08:00
Concedo
4faae0afa9
Merged upstream, fixed OSX compile errors, integrated noavx2 build into main
2023-04-12 18:08:55 +08:00
Concedo
23c675b2e6
integrated optional (experimentl) CLBlast support
2023-04-11 23:33:44 +08:00
Concedo
1f6aa47b6e
Merge branch 'master' into concedo
...
# Conflicts:
# README.md
2023-04-11 16:53:41 +08:00
Concedo
69b85f5b61
fixed a few OOM errors with larger contexts - I cannot figure out why they happen, so I am forced to increase the buffer size.
2023-04-11 00:14:57 +08:00
Concedo
f53238f570
Merged the upstream updates for model loading code, and ditched the legacy llama loaders since they were no longer needed.
2023-04-10 12:00:34 +08:00
Concedo
18a154715e
added version label, improved file type checks
2023-04-10 01:03:09 +08:00
Concedo
1543c700d8
added a missing endpoint for tavern
2023-04-09 17:41:33 +08:00
Concedo
b91abc3316
increase default blas batch size
2023-04-09 15:27:43 +08:00
Concedo
26a7933084
hide the tiny tkinter window
2023-04-09 01:01:34 +08:00
Concedo
f322a5820e
fixed positional port arg
2023-04-07 17:46:33 +08:00
Concedo
4f5faf9612
some users report that this repo is now being flagged as malicious?
...
no idea why, but I am removing all prebuilt binaries except libopenblas. windows users can still obtain it from /releases and osx and linux users can rebuild from source code.
2023-04-06 21:49:43 +08:00
Concedo
b56f872b61
update embedded kobold lite
2023-04-06 16:34:51 +08:00
Concedo
3d650d0e25
remove dependency of psutil, fixed compile error on WSL, handle exceptions when sending http response, added multiline for embedded kobold
2023-04-06 11:08:19 +08:00
Concedo
dd2abd8bc7
lower default thread threshold
2023-04-04 00:42:49 +08:00
Concedo
eb5b22dda2
rebrand to koboldcpp
2023-04-03 10:35:18 +08:00