[docs]: fix and add MiniMax-M2 tutorial images. (#1752)

2026-04-26 10:50:59 +00:00 · 2025-12-25 20:14:35 +08:00 · 2025-12-25 20:14:35 +08:00 · 63796374c1
commit 63796374c1
parent be668074de
3 changed files with 2 additions and 0 deletions
--- a/doc/assets/MiniMax-M2_comparison.png
+++ b/doc/assets/MiniMax-M2_comparison.png
--- a/doc/assets/MiniMax-M2_speed.png
+++ b/doc/assets/MiniMax-M2_speed.png
--- a/doc/en/kt-kernel/MiniMax-M2.1-Tutorial.md
+++ b/doc/en/kt-kernel/MiniMax-M2.1-Tutorial.md
@ -168,6 +168,8 @@ The following benchmarks were measured with single concurrency (Prefill tps / De
 | 1 x RTX 5090 (32 GB) | 2 x AMD EPYC 9355 | PCIe 5.0 | 408 / 32.1 | 1196 / 31.4 | 2540 / 27.6 |
 | 2 x RTX 5090 (32 GB) | 2 x AMD EPYC 9355 | PCIe 5.0 | 414 / 35.9 | 1847 / 35.5 | 4007 / 33.1 |

+![Throughput in 2 x RTX 5090](../../assets/MiniMax-M2_speed.png)
+
 ### Comparison with llama.cpp

 We benchmarked KT-Kernel + Sglang against llama.cpp to demonstrate the performance advantages of our CPU-GPU heterogeneous inference approach.