Concedo
|
18bb0ab127
|
up ver, support 16k ctx
|
2023-08-04 21:47:17 +08:00 |
|
Concedo
|
523fc3be52
|
fixed rwkv, standardized new ctx usage
|
2023-07-10 20:05:53 +08:00 |
|
Concedo
|
2827920044
|
fix compile errors, rwkv not working
|
2023-07-10 18:23:25 +08:00 |
|
Concedo
|
bfeb3471d7
|
fix typos
|
2023-07-03 21:36:42 +08:00 |
|
Concedo
|
86469d15c4
|
fix for yr-rocm, large gpu scratch
|
2023-06-30 12:40:08 +08:00 |
|
Concedo
|
b4698abafc
|
Wip, CUDA porting malloc improvements, gpu accel for non-llama, backport old quants
|
2023-06-28 18:20:46 +08:00 |
|
Concedo
|
8342fe81b1
|
revert the wstring tokenization. coherency was affected
|
2023-06-24 12:58:49 +08:00 |
|
Concedo
|
0485fa65a2
|
wstring convert for mpt
|
2023-06-24 11:43:42 +08:00 |
|
Concedo
|
490cf395f8
|
better alloc error
|
2023-06-23 22:51:51 +08:00 |
|
Concedo
|
f39a746089
|
bug fixes for openblas
|
2023-06-23 22:45:22 +08:00 |
|
Concedo
|
43c2891afa
|
option to not use scratch
|
2023-06-23 19:01:36 +08:00 |
|
Concedo
|
d5e4cf7ffe
|
handle ctx manip
|
2023-06-23 19:01:15 +08:00 |
|
Concedo
|
e6ddb15c3a
|
cleanup
|
2023-06-22 10:38:27 +08:00 |
|
Concedo
|
1b71752a9f
|
Implemented basic GPU offloading for MPT, GPT-2, GPT-J and GPT-NeoX
|
2023-06-22 00:43:25 +08:00 |
|
Concedo
|
9270056269
|
fixed compile error in cmake VS
|
2023-06-05 11:48:04 +08:00 |
|
Concedo
|
c1b293d31a
|
fixed MPT ooms
|
2023-06-03 18:37:13 +08:00 |
|
Concedo
|
6f82e17b7a
|
added MPT support
|
2023-06-03 16:14:08 +08:00 |
|