Concedo
|
49697d86d8
|
adjusted down the buf memory allocation now that realloc seems to work
|
2023-04-20 17:51:13 +08:00 |
|
Concedo
|
cc407f283a
|
messing around with memory allocation to bandaid the random ooms with various gpt2 and gptj models
|
2023-04-19 20:18:55 +08:00 |
|
Concedo
|
45ec09d31b
|
fast forwarding for rwkv for unmodified contexts
|
2023-04-19 15:09:35 +08:00 |
|
Concedo
|
c757fbee1d
|
fixes to stopper tokens, fixed BLAS mode for GPT2 and GPTJ, updated kobold lite
|
2023-04-16 21:54:18 +08:00 |
|
Concedo
|
1bd5992da4
|
clean and refactor handling of flags
|
2023-04-12 23:25:31 +08:00 |
|
Concedo
|
69b85f5b61
|
fixed a few OOM errors with larger contexts - I cannot figure out why they happen, so I am forced to increase the buffer size.
|
2023-04-11 00:14:57 +08:00 |
|
Concedo
|
18a154715e
|
added version label, improved file type checks
|
2023-04-10 01:03:09 +08:00 |
|
Concedo
|
d8e37bfe75
|
new gpt2 format supported
|
2023-04-08 17:35:36 +08:00 |
|