kvcache-ai-ktransformers/ktransformers/util
2025-02-01 07:32:21 +00:00
..
cuda_graph_runner.py [feature] release 0.1.3 2024-08-28 16:11:43 +00:00
custom_gguf.py Support IQ4_XS dequantize 2024-09-02 09:10:19 +07:00
modeling_rope_utils.py update rope calculation; update modeling.py; update gate for moe 2025-02-01 07:32:21 +00:00
textstream.py Initial commit 2024-07-27 16:06:58 +08:00
utils.py Fix: the tokens return by prefill_and_generate 2024-09-05 05:29:23 +00:00