fix: improve /model --fast description clarity and prevent accidental activation (#3077)

Replace vague "background tasks" with specific "prompt suggestions and speculative execution" in the --fast flag description across all i18n locales, docs, and VS Code schema. Update example model name from qwen3.5-flash to qwen3-coder-flash. Also fix completion logic to require a non-empty partial arg before suggesting --fast, preventing Tab+Enter from accidentally entering fast model mode. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-28 11:41:04 +00:00 · 2026-04-10 12:09:46 +08:00 · 2026-04-10 12:09:46 +08:00 · 5482044e59
commit 5482044e59
parent 746f67f436
12 changed files with 41 additions and 37 deletions
--- a/docs/users/configuration/settings.md
+++ b/docs/users/configuration/settings.md
@ -204,9 +204,9 @@ The `extra_body` field allows you to add custom parameters to the request body s

 #### fastModel

-| Setting     | Type   | Description                                                                                                                                                                                                                                           | Default |
-| ----------- | ------ | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ------- |
-| `fastModel` | string | Model for background tasks ([suggestion generation](../features/followup-suggestions), speculation). Leave empty to use the main model. A smaller/faster model (e.g., `qwen3.5-flash`) reduces latency and cost. Can also be set via `/model --fast`. | `""`    |
+| Setting     | Type   | Description                                                                                                                                                                                                                                                      | Default |
+| ----------- | ------ | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ------- |
+| `fastModel` | string | Model used for generating [prompt suggestions](../features/followup-suggestions) and speculative execution. Leave empty to use the main model. A smaller/faster model (e.g., `qwen3-coder-flash`) reduces latency and cost. Can also be set via `/model --fast`. | `""`    |

 #### context

--- a/docs/users/features/commands.md
+++ b/docs/users/features/commands.md
@ -56,21 +56,21 @@ Commands specifically for controlling interface and output language.

 Commands for managing AI tools and models.

-| Command          | Description                                       | Usage Examples                                |
-| ---------------- | ------------------------------------------------- | --------------------------------------------- |
-| `/mcp`           | List configured MCP servers and tools             | `/mcp`, `/mcp desc`                           |
-| `/tools`         | Display currently available tool list             | `/tools`, `/tools desc`                       |
-| `/skills`        | List and run available skills                     | `/skills`, `/skills <name>`                   |
-| `/plan`          | Switch to plan mode or exit plan mode             | `/plan`, `/plan <task>`, `/plan exit`         |
-| `/approval-mode` | Change approval mode for tool usage               | `/approval-mode <mode (auto-edit)> --project` |
-| →`plan`          | Analysis only, no execution                       | Secure review                                 |
-| →`default`       | Require approval for edits                        | Daily use                                     |
-| →`auto-edit`     | Automatically approve edits                       | Trusted environment                           |
-| →`yolo`          | Automatically approve all                         | Quick prototyping                             |
-| `/model`         | Switch model used in current session              | `/model`                                      |
-| `/model --fast`  | Set or select the fast model for background tasks | `/model --fast qwen3.5-flash`                 |
-| `/extensions`    | List all active extensions in current session     | `/extensions`                                 |
-| `/memory`        | Manage AI's instruction context                   | `/memory add Important Info`                  |
+| Command          | Description                                   | Usage Examples                                |
+| ---------------- | --------------------------------------------- | --------------------------------------------- |
+| `/mcp`           | List configured MCP servers and tools         | `/mcp`, `/mcp desc`                           |
+| `/tools`         | Display currently available tool list         | `/tools`, `/tools desc`                       |
+| `/skills`        | List and run available skills                 | `/skills`, `/skills <name>`                   |
+| `/plan`          | Switch to plan mode or exit plan mode         | `/plan`, `/plan <task>`, `/plan exit`         |
+| `/approval-mode` | Change approval mode for tool usage           | `/approval-mode <mode (auto-edit)> --project` |
+| →`plan`          | Analysis only, no execution                   | Secure review                                 |
+| →`default`       | Require approval for edits                    | Daily use                                     |
+| →`auto-edit`     | Automatically approve edits                   | Trusted environment                           |
+| →`yolo`          | Automatically approve all                     | Quick prototyping                             |
+| `/model`         | Switch model used in current session          | `/model`                                      |
+| `/model --fast`  | Set a lighter model for prompt suggestions    | `/model --fast qwen3-coder-flash`             |
+| `/extensions`    | List all active extensions in current session | `/extensions`                                 |
+| `/memory`        | Manage AI's instruction context               | `/memory add Important Info`                  |

 ### 1.5 Built-in Skills

--- a/docs/users/features/followup-suggestions.md
+++ b/docs/users/features/followup-suggestions.md
@ -49,7 +49,7 @@ By default, suggestions use the same model as your main conversation. For faster
 ### Via command

 ```
-/model --fast qwen3.5-flash
+/model --fast qwen3-coder-flash
 ```

 Or use `/model --fast` (without a model name) to open a selection dialog.
@ -58,11 +58,11 @@ Or use `/model --fast` (without a model name) to open a selection dialog.

 ```json
 {
-  "fastModel": "qwen3.5-flash"
+  "fastModel": "qwen3-coder-flash"
 }
 ```

-The fast model is used for background tasks like suggestion generation. When not configured, the main conversation model is used as fallback.
+The fast model is used for prompt suggestions and speculative execution. When not configured, the main conversation model is used as fallback.

 Thinking/reasoning mode is automatically disabled for all background tasks (suggestion generation and speculation), regardless of your main model's thinking configuration. This avoids wasting tokens on internal reasoning that isn't needed for these tasks.

@ -75,13 +75,13 @@ These settings can be configured in `settings.json`:
 | `ui.enableFollowupSuggestions` | boolean | `true`  | Enable or disable followup suggestions                             |
 | `ui.enableCacheSharing`        | boolean | `true`  | Use cache-aware forked queries to reduce cost (experimental)       |
 | `ui.enableSpeculation`         | boolean | `false` | Speculatively execute suggestions before submission (experimental) |
-| `fastModel`                    | string  | `""`    | Model for background tasks (suggestion generation, speculation)    |
+| `fastModel`                    | string  | `""`    | Model for prompt suggestions and speculative execution             |

 ### Example

 ```json
 {
-  "fastModel": "qwen3.5-flash",
+  "fastModel": "qwen3-coder-flash",
  "ui": {
    "enableFollowupSuggestions": true,
    "enableCacheSharing": true