kvcache-ai-ktransformers/ktransformers/ktransformers_ext/cuda
2025-03-15 12:34:03 -04:00
..
custom_gguf fix rocm compilation 2025-03-15 12:34:03 -04:00
gptq_marlin merge main; Add torch q8 linear 2025-03-14 05:52:07 -04:00
binding.cpp Ensure backward compatibility with Torch 2.2 2025-02-24 21:55:30 +08:00
setup.py [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
test_dequant.py optimize gguf dequant, save mem, support Q2_K 2025-02-22 06:13:01 +00:00