kvcache-ai-ktransformers/ktransformers/ktransformers_ext/cuda
2025-02-15 15:16:00 +00:00
..
custom_gguf toy support for experts on GPU, no CUDA Graph 2025-02-15 15:16:00 +00:00
gptq_marlin [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
binding.cpp Support IQ4_XS dequantize 2024-09-02 09:10:19 +07:00
setup.py [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00