Merge pull request #1069 from kvcache-ai/qiyuxinlin-patch-4

Update balance-serve.md
This commit is contained in:
wang jiahao 2025-04-07 19:20:00 +08:00 committed by GitHub
commit f4ae7c85ed
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -112,7 +112,7 @@ Use our optimized configuration for constrained VRAM:
```bash ```bash
python ktransformers/server/main.py \ python ktransformers/server/main.py \
--port 10002 --port 10002 \
--model_path <path_to_safetensor_config> \ --model_path <path_to_safetensor_config> \
--gguf_path <path_to_gguf_files> \ --gguf_path <path_to_gguf_files> \
--optimize_config_path ktransformers/optimize/optimize_rules/DeepSeek-V3-Chat-serve.yaml \ --optimize_config_path ktransformers/optimize/optimize_rules/DeepSeek-V3-Chat-serve.yaml \