kvcache-ai-ktransformers/ktransformers/ktransformers_ext/cuda
2024-09-13 08:34:23 +00:00
..
custom_gguf fix some dequant function dosen't support multi gpu bug 2024-09-13 08:34:23 +00:00
gptq_marlin [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
binding.cpp Support IQ4_XS dequantize 2024-09-02 09:10:19 +07:00
setup.py [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00