Commit graph

8 commits

Author SHA1 Message Date
Concedo
abb9ad789c fixed other arch 2023-05-24 00:20:43 +08:00
Concedo
72836d4eac fixing more compile issues 2023-05-15 20:10:54 +08:00
Concedo
6504150fac just testing cublas 2023-05-15 20:01:22 +08:00
Concedo
032a171867 integrated q5 formats 2023-04-28 12:58:39 +08:00
Concedo
69b85f5b61 fixed a few OOM errors with larger contexts - I cannot figure out why they happen, so I am forced to increase the buffer size. 2023-04-11 00:14:57 +08:00
Concedo
d8e37bfe75 new gpt2 format supported 2023-04-08 17:35:36 +08:00
Concedo
1490cdd71d change GPT-J and GPT2 KVs to use fp16 instead 2023-04-05 15:53:07 +08:00
Concedo
14273fea7a integrated gpt2 support 2023-04-04 23:15:47 +08:00