update Go DeepSeek request estimates for cache pricing changes (#24575)

2026-04-29 13:09:46 +00:00 · 2026-04-27 13:06:51 +08:00 · 2026-04-27 13:06:51 +08:00 · 1e191ba815
commit 1e191ba815
parent f19d863689
19 changed files with 108 additions and 90 deletions
--- a/packages/web/src/content/docs/it/go.mdx
+++ b/packages/web/src/content/docs/it/go.mdx
@ -106,17 +106,18 @@ La tabella seguente fornisce una stima del conteggio delle richieste in base a p
 | MiniMax M2.5       | 6,300                | 15,900                | 31,800            |
 | Qwen3.6 Plus       | 3,300                | 8,200                 | 16,300            |
 | Qwen3.5 Plus       | 10,200               | 25,200                | 50,500            |
-| DeepSeek V4 Pro    | 1,300                | 3,250                 | 6,500             |
-| DeepSeek V4 Flash  | 7,450                | 18,600                | 37,300            |
+| DeepSeek V4 Pro    | 3,450                | 8,550                 | 17,150            |
+| DeepSeek V4 Flash  | 5,750                | 14,350                | 28,650            |

 Le stime si basano sui pattern medi di richieste osservati:

 - GLM-5/5.1 — 700 di input, 52.000 in cache, 150 token di output per richiesta
 - Kimi K2.5/K2.6 — 870 di input, 55.000 in cache, 200 token di output per richiesta
- DeepSeek V4 Pro/Flash — 700 input, 52,000 cached, 150 output tokens per request
+- DeepSeek V4 Pro — 750 di input, 82.000 in cache, 290 token di output per richiesta
+- DeepSeek V4 Flash — 790 di input, 68.000 in cache, 280 token di output per richiesta
 - MiniMax M2.7/M2.5 — 300 di input, 55.000 in cache, 125 token di output per richiesta
- Qwen3.5 Plus — 410 input, 47,000 cached, 140 output tokens per request
- Qwen3.6 Plus — 500 input, 57,000 cached, 190 output tokens per request
+- Qwen3.5 Plus — 410 di input, 47.000 in cache, 140 token di output per richiesta
+- Qwen3.6 Plus — 500 di input, 57.000 in cache, 190 token di output per richiesta
 - MiMo-V2-Pro — 350 di input, 41.000 in cache, 250 token di output per richiesta
 - MiMo-V2-Omni — 1000 di input, 60.000 in cache, 140 token di output per richiesta
 - MiMo-V2.5-Pro — 350 di input, 41.000 in cache, 250 token di output per richiesta