From 90245d8a6b2264c3ab3bc2acdbec8cc69386421f Mon Sep 17 00:00:00 2001 From: Atream <80757050+Atream@users.noreply.github.com> Date: Sat, 12 Jul 2025 11:57:51 +0800 Subject: [PATCH] Update Kimi-K2.md --- doc/en/Kimi-K2.md | 3 +++ 1 file changed, 3 insertions(+) diff --git a/doc/en/Kimi-K2.md b/doc/en/Kimi-K2.md index b5f546a..1839ede 100644 --- a/doc/en/Kimi-K2.md +++ b/doc/en/Kimi-K2.md @@ -5,6 +5,9 @@ ### Overview We are very pleased to announce that Ktransformers now supports Kimi-K2. +On a single-socket CPU with one consumer-grade GPU, running the Q4_K_M model yields roughly 10 TPS and requires about 600 GB of VRAM. +With a dual-socket CPU and sufficient system memory, enabling NUMA optimizations increases performance to about 14 TPS. + ### Model & Resource Links - Official Kimi-K2 Release: