kvcache-ai-ktransformers

mirror of https://github.com/kvcache-ai/ktransformers.git synced 2025-09-09 13:55:27 +00:00

History

Azure-Tang 4a31237346 fix rocm compilation		2025-03-15 12:34:03 -04:00
..
custom_gguf	fix rocm compilation	2025-03-15 12:34:03 -04:00
gptq_marlin	merge main; Add torch q8 linear	2025-03-14 05:52:07 -04:00
binding.cpp	Ensure backward compatibility with Torch 2.2	2025-02-24 21:55:30 +08:00
setup.py	[ADD] support multi-gpu qlen>1 q5_k	2024-08-12 11:41:26 +00:00
test_dequant.py	optimize gguf dequant, save mem, support Q2_K	2025-02-22 06:13:01 +00:00