Merge pull request #1436 from kvcache-ai/Atream-patch-4

Update Kimi-K2.md
This commit is contained in:
Atream 2025-07-12 11:58:25 +08:00 committed by GitHub
commit df19681ec4
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -5,6 +5,9 @@
### Overview
We are very pleased to announce that Ktransformers now supports Kimi-K2.
On a single-socket CPU with one consumer-grade GPU, running the Q4_K_M model yields roughly 10 TPS and requires about 600 GB of VRAM.
With a dual-socket CPU and sufficient system memory, enabling NUMA optimizations increases performance to about 14 TPS.
### Model & Resource Links
- Official Kimi-K2 Release: