From 6b551a6ee3166e0f6c71cd75df6c4b0bbb83ebec Mon Sep 17 00:00:00 2001 From: hybcloud <43987901+hybcloud@users.noreply.github.com> Date: Wed, 5 Mar 2025 11:09:29 +0800 Subject: [PATCH] fix minor typo --- doc/en/DeepseekR1_V3_tutorial.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/doc/en/DeepseekR1_V3_tutorial.md b/doc/en/DeepseekR1_V3_tutorial.md index fb26b77..9c031dc 100644 --- a/doc/en/DeepseekR1_V3_tutorial.md +++ b/doc/en/DeepseekR1_V3_tutorial.md @@ -86,7 +86,7 @@ Memory: standard DDR5-4800 server DRAM (1 TB), each socket with 8×DDR5-4800 #### Change Log - Longer Context (from 4K to 8K for 24GB VRAM) and Slightly Faster Speed (+15%):
Integrated the highly efficient Triton MLA Kernel from the fantastic sglang project, enable much longer context length and slightly faster prefill/decode speed -- We suspect that some of the improvements come from the change of hardwre platform (4090D->4090) +- We suspect that some of the improvements come from the change of hardware platform (4090D->4090) #### Benchmark Results