kvcache-ai-ktransformers/ktransformers/operators
2024-08-15 10:44:59 +08:00
..
__init__.py Initial commit 2024-07-27 16:06:58 +08:00
attention.py [fix] format classes and files name 2024-08-15 10:44:59 +08:00
base_operator.py Initial commit 2024-07-27 16:06:58 +08:00
cpuinfer.py [feature] experts can be injected using CPUInfer 2024-08-14 16:10:54 +08:00
experts.py [fix] format classes and files name 2024-08-15 10:44:59 +08:00
linear.py [fix] format classes and files name 2024-08-15 10:44:59 +08:00
models.py [fix] format classes and files name 2024-08-15 10:44:59 +08:00
RoPE.py [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00