Commit graph

2 commits

Author SHA1 Message Date
Atream
c189d55bd1 toy support for experts on GPU, no CUDA Graph 2025-02-15 15:16:00 +00:00
MorphisZhang
aea4243712 Add optimization config for Deepseek V3/R1 with 4 GPUs 2025-02-13 16:32:28 +08:00