kvcache-ai-ktransformers/ktransformers/ktransformers_ext/cuda/gptq_marlin
2024-08-12 11:41:26 +00:00
..
gptq_marlin.cu [ADD] support multi-gpu qlen>1 q5_k 2024-08-12 11:41:26 +00:00
gptq_marlin.cuh Initial commit 2024-07-27 16:06:58 +08:00
gptq_marlin_dtypes.cuh Initial commit 2024-07-27 16:06:58 +08:00
ops.h Initial commit 2024-07-27 16:06:58 +08:00