kvcache-ai-ktransformers/csrc/ktransformers_ext/cuda
2025-05-09 10:38:29 +00:00
..
custom_gguf Fix some build error for ROCM 2025-04-17 11:34:33 +08:00
gptq_marlin refactor folders 2025-03-31 22:45:37 +08:00
binding.cpp add balance-serve, support concurrence 2025-03-31 22:55:32 +08:00
setup.py refactor folders 2025-03-31 22:45:37 +08:00
test_dequant.py support safetensor load, delete architectures argument 2025-05-09 10:38:29 +00:00