Merge pull request #1436 from kvcache-ai/Atream-patch-4

Update Kimi-K2.md
2025-09-07 21:19:51 +00:00 · 2025-07-12 11:58:25 +08:00 · 2025-07-12 11:58:25 +08:00 · df19681ec4
commit df19681ec4
parent 8e2c67d655 90245d8a6b
1 changed files with 3 additions and 0 deletions
--- a/doc/en/Kimi-K2.md
+++ b/doc/en/Kimi-K2.md
@ -5,6 +5,9 @@
 ### Overview
 We are very pleased to announce that Ktransformers now supports Kimi-K2.

+On a single-socket CPU with one consumer-grade GPU, running the Q4_K_M model yields roughly 10 TPS and requires about 600 GB of VRAM.  
+With a dual-socket CPU and sufficient system memory, enabling NUMA optimizations increases performance to about 14 TPS.
+
 ### Model & Resource Links

 - Official Kimi-K2 Release: