kvcache-ai-ktransformers/ktransformers/operators
2024-08-12 11:41:26 +00:00
..
__init__.py Initial commit 2024-07-27 16:06:58 +08:00
attention.py Initial commit 2024-07-27 16:06:58 +08:00
base_operator.py Initial commit 2024-07-27 16:06:58 +08:00
experts.py [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
layer_wise_prefill.py [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
linear.py [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
RoPE.py [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00