kvcache-ai-ktransformers/ktransformers/ktransformers_ext/operators
2024-08-08 09:04:36 +00:00
..
custom_marlin/quantize Initial commit 2024-07-27 16:06:58 +08:00
llamafile 1) Linear and MLP operators support qlen>1; 2) All operators now share a single memory buffer; 3) Refactor CPUInfer submit/sync logic. 2024-08-08 09:04:36 +00:00