kvcache-ai-ktransformers/ktransformers/ktransformers_ext/cuda
2024-08-12 12:53:12 +00:00
..
custom_gguf [feature] support q2_k & q3_k dequantize on gpu 2024-08-12 12:53:12 +00:00
gptq_marlin [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
binding.cpp [feature] support q2_k & q3_k dequantize on gpu 2024-08-12 12:53:12 +00:00
setup.py [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00