kvcache-ai-ktransformers/ktransformers/ktransformers_ext/examples
2024-08-28 16:11:43 +00:00
..
test_attention.py [feature] release 0.1.3 2024-08-28 16:11:43 +00:00
test_linear.py 1) Linear and MLP operators support qlen>1; 2) All operators now share a single memory buffer; 3) Refactor CPUInfer submit/sync logic. 2024-08-08 09:04:36 +00:00
test_mlp.py 1) Linear and MLP operators support qlen>1; 2) All operators now share a single memory buffer; 3) Refactor CPUInfer submit/sync logic. 2024-08-08 09:04:36 +00:00
test_moe.py 1) Linear and MLP operators support qlen>1; 2) All operators now share a single memory buffer; 3) Refactor CPUInfer submit/sync logic. 2024-08-08 09:04:36 +00:00