kvcache-ai-ktransformers/ktransformers/util
2025-10-23 17:55:03 +08:00
..
ascend fix:修复balance_server tp=1 不开图下沉报错 2025-09-22 20:52:07 +08:00
cuda_graph_runner.py [feature] release 0.1.3 2024-08-28 16:11:43 +00:00
custom_gguf.py 处理检视意见 2025-10-23 11:28:42 +08:00
custom_loader.py 合并fix some bugs 2025-10-20 12:34:36 +00:00
modeling_rope_utils.py update rope calculation; update modeling.py; update gate for moe 2025-02-01 07:32:21 +00:00
npu_graph_runner.py fix:修复balance_server tp=1 不开图下沉报错 2025-09-22 20:52:07 +08:00
textstream.py Initial commit 2024-07-27 16:06:58 +08:00
utils.py fix transformers local_chat 2025-10-23 17:51:19 +08:00
vendors.py merge main; Add torch q8 linear 2025-03-14 05:52:07 -04:00
weight_loader.py support safetensor load, delete architectures argument 2025-05-09 10:38:29 +00:00