kvcache-ai-ktransformers/ktransformers/util
Atream c5f036e8a4
Merge pull request #333 from kvcache-ai/feat_experts_gpu
toy support for experts on GPU, no CUDA Graph
2025-02-15 23:30:24 +08:00
..
cuda_graph_runner.py [feature] release 0.1.3 2024-08-28 16:11:43 +00:00
custom_gguf.py Merge pull request #333 from kvcache-ai/feat_experts_gpu 2025-02-15 23:30:24 +08:00
modeling_rope_utils.py update rope calculation; update modeling.py; update gate for moe 2025-02-01 07:32:21 +00:00
textstream.py Initial commit 2024-07-27 16:06:58 +08:00
utils.py warm_up before capture 2025-02-14 15:52:21 +00:00