kvcache-ai-ktransformers/ktransformers/util
2025-02-23 15:37:09 +08:00
..
cuda_graph_runner.py [feature] release 0.1.3 2024-08-28 16:11:43 +00:00
custom_gguf.py fix bf16 load, TODO: refactor cpu dequant 2025-02-23 15:37:09 +08:00
modeling_rope_utils.py update rope calculation; update modeling.py; update gate for moe 2025-02-01 07:32:21 +00:00
textstream.py Initial commit 2024-07-27 16:06:58 +08:00
utils.py optimize gguf dequant, save mem, support Q2_K 2025-02-22 06:13:01 +00:00