kvcache-ai-ktransformers/ktransformers/optimize/optimize_rules
2024-08-12 11:41:26 +00:00
..
DeepSeek-V2-Chat-multi-gpu-4.yaml [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
DeepSeek-V2-Chat-multi-gpu.yaml [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
DeepSeek-V2-Chat.yaml [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
DeepSeek-V2-Lite-Chat-multi-gpu.yaml [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
Mixtral.yaml [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
Qwen2-57B-A14B-Instruct-multi-gpu.yaml [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
Qwen2-57B-A14B-Instruct.yaml [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00