mirror of
https://github.com/kvcache-ai/ktransformers.git
synced 2025-09-09 05:54:06 +00:00
Merge pull request #1436 from kvcache-ai/Atream-patch-4
Update Kimi-K2.md
This commit is contained in:
commit
df19681ec4
1 changed files with 3 additions and 0 deletions
|
@ -5,6 +5,9 @@
|
||||||
### Overview
|
### Overview
|
||||||
We are very pleased to announce that Ktransformers now supports Kimi-K2.
|
We are very pleased to announce that Ktransformers now supports Kimi-K2.
|
||||||
|
|
||||||
|
On a single-socket CPU with one consumer-grade GPU, running the Q4_K_M model yields roughly 10 TPS and requires about 600 GB of VRAM.
|
||||||
|
With a dual-socket CPU and sufficient system memory, enabling NUMA optimizations increases performance to about 14 TPS.
|
||||||
|
|
||||||
### Model & Resource Links
|
### Model & Resource Links
|
||||||
|
|
||||||
- Official Kimi-K2 Release:
|
- Official Kimi-K2 Release:
|
||||||
|
|
Loading…
Add table
Add a link
Reference in a new issue