Concedo
|
b5d6284190
|
increase initial buffer too
|
2023-04-23 00:07:33 +08:00 |
|
Concedo
|
d2f14b2b1f
|
add an extra buffer to mem allocations
|
2023-04-23 00:04:32 +08:00 |
|
Concedo
|
4fa3dfe8bc
|
just doesn't work properly on windows. will leave it as a manual flag for others
|
2023-04-22 10:57:38 +08:00 |
|
Concedo
|
7ba36c2c6c
|
trying to put out penguin based fires. sorry for inconvenience
|
2023-04-20 23:15:07 +08:00 |
|
Concedo
|
49697d86d8
|
adjusted down the buf memory allocation now that realloc seems to work
|
2023-04-20 17:51:13 +08:00 |
|
Concedo
|
cc407f283a
|
messing around with memory allocation to bandaid the random ooms with various gpt2 and gptj models
|
2023-04-19 20:18:55 +08:00 |
|
Concedo
|
45ec09d31b
|
fast forwarding for rwkv for unmodified contexts
|
2023-04-19 15:09:35 +08:00 |
|
Concedo
|
c757fbee1d
|
fixes to stopper tokens, fixed BLAS mode for GPT2 and GPTJ, updated kobold lite
|
2023-04-16 21:54:18 +08:00 |
|
Concedo
|
1bd5992da4
|
clean and refactor handling of flags
|
2023-04-12 23:25:31 +08:00 |
|
Concedo
|
69b85f5b61
|
fixed a few OOM errors with larger contexts - I cannot figure out why they happen, so I am forced to increase the buffer size.
|
2023-04-11 00:14:57 +08:00 |
|
Concedo
|
18a154715e
|
added version label, improved file type checks
|
2023-04-10 01:03:09 +08:00 |
|
Concedo
|
d8e37bfe75
|
new gpt2 format supported
|
2023-04-08 17:35:36 +08:00 |
|