kvcache-ai-ktransformers/ktransformers/ktransformers_ext/cuda
Xiaodong Ye f88c05a6f1 Ensure backward compatibility with Torch 2.2
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
2025-02-24 21:55:30 +08:00
..
custom_gguf Ensure backward compatibility with Torch 2.2 2025-02-24 21:55:30 +08:00
gptq_marlin [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
binding.cpp Ensure backward compatibility with Torch 2.2 2025-02-24 21:55:30 +08:00
setup.py [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
test_dequant.py optimize gguf dequant, save mem, support Q2_K 2025-02-22 06:13:01 +00:00