mirror of
https://github.com/kvcache-ai/ktransformers.git
synced 2025-09-11 15:54:37 +00:00
Merge pull request #1069 from kvcache-ai/qiyuxinlin-patch-4
Update balance-serve.md
This commit is contained in:
commit
f4ae7c85ed
1 changed files with 1 additions and 1 deletions
|
@ -112,7 +112,7 @@ Use our optimized configuration for constrained VRAM:
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
python ktransformers/server/main.py \
|
python ktransformers/server/main.py \
|
||||||
--port 10002
|
--port 10002 \
|
||||||
--model_path <path_to_safetensor_config> \
|
--model_path <path_to_safetensor_config> \
|
||||||
--gguf_path <path_to_gguf_files> \
|
--gguf_path <path_to_gguf_files> \
|
||||||
--optimize_config_path ktransformers/optimize/optimize_rules/DeepSeek-V3-Chat-serve.yaml \
|
--optimize_config_path ktransformers/optimize/optimize_rules/DeepSeek-V3-Chat-serve.yaml \
|
||||||
|
|
Loading…
Add table
Add a link
Reference in a new issue