Commit graph

3801 commits

Author SHA1 Message Date
DennisYu07
1ce8502ebf Merge branch 'main' into feat/hook_sessionstart_sessionend 2026-03-17 20:41:08 -07:00
DennisYu07
b236e4152f Merge branch 'main' into feat/hook_sessionstart_sessionend 2026-03-17 20:34:13 -07:00
tanzhenxin
080271031d
Merge pull request #2400 from QwenLM/feat/system-prompt-sdk
feat: add system prompt customization options in SDK and CLI
2026-03-18 11:29:21 +08:00
tanzhenxin
a60fadd822
Merge pull request #2403 from simon100500/fix/duplicate-finish-chunk-tool-calls
fix(pipeline): handle duplicate finish_reason chunks from OpenRouter
2026-03-18 11:20:12 +08:00
tanzhenxin
22f0437369 chore: bump version to 0.13.0
Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-18 10:41:32 +08:00
qqqys
3a92be09e0 test(cli): remove promptTokens prop from LoadingIndicator tests 2026-03-18 00:22:35 +08:00
qqqys
476d6bc4fc test(file-handler): enhance tests for FileMessageHandler with fuzzy search and path filtering 2026-03-18 00:20:11 +08:00
qqqys
617874f152 fix(ui): handle optional metrics in Composer component 2026-03-17 21:37:02 +08:00
qqqys
7a554b1226 refactor(file-handler): improve file watcher management and cache clearing 2026-03-17 21:21:53 +08:00
qqqys
ebeb7ed690 refactor(completion): enhance trigger detection logic for completion suggestions 2026-03-17 20:55:12 +08:00
tanzhenxin
61347577ce refactor(core): centralize tool output truncation logic
Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>

- Add truncateToolOutput helper in truncation.ts to centralize threshold reading, file saving, and telemetry logging

- Refactor shell.ts to use the new helper, removing duplicate code

- Add truncation support for MCP tool output while preserving non-text content (images, audio, resources)

- Refactor getDisplayFromParts to work on transformed Part[] instead of raw MCP response

This reduces code duplication and ensures consistent truncation behavior across shell and MCP tools.
2026-03-17 20:24:20 +08:00
qqqys
03e59256c4 feat(ui): enhance LoadingIndicator to display token counts and improve formatting
- Added candidatesTokens prop to LoadingIndicator for displaying token counts.
- Updated formatting to show elapsed time and token counts inline.
- Refactored tests to validate new token display functionality and formatting changes.
- Introduced formatTokenCount utility for consistent token count representation.

This improves user feedback during loading states by providing clearer information on token usage.
2026-03-17 20:10:54 +08:00
LaZzyMan
28149e0cc4 fix test ci 2026-03-17 19:15:58 +08:00
qwen-code-ci-bot
ac30c98a26
chore: bump version to 0.12.6 (#2442)
Some checks failed
Qwen Code CI / Lint (push) Failing after 5s
Qwen Code CI / CodeQL (push) Failing after 5s
Qwen Code CI / Test (push) Has been skipped
Qwen Code CI / Test-1 (push) Has been skipped
Qwen Code CI / Test-2 (push) Has been skipped
Qwen Code CI / Test-3 (push) Has been skipped
Qwen Code CI / Test-4 (push) Has been skipped
Qwen Code CI / Test-5 (push) Has been skipped
Qwen Code CI / Test-6 (push) Has been skipped
Qwen Code CI / Test-7 (push) Has been skipped
Qwen Code CI / Test-8 (push) Has been skipped
E2E Tests / E2E Test (Linux) - sandbox:none (push) Failing after 3s
E2E Tests / E2E Test (Linux) - sandbox:docker (push) Failing after 4s
Qwen Code CI / Post Coverage Comment (push) Has been skipped
E2E Tests / E2E Test - macOS (push) Has been cancelled
Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-17 19:00:26 +08:00
LaZzyMan
8722dc9dd6 fix remove useless output 2026-03-17 18:53:42 +08:00
Mingholy
de29a8d987
Merge pull request #2438 from QwenLM/fix-max-token-limit
fix: improve max_tokens handling with conservative defaults
2026-03-17 18:45:48 +08:00
LaZzyMan
0897ddd75c i18n: add auth command translations for all 6 languages 2026-03-17 18:28:32 +08:00
mingholy.lmh
f300e3ab32 fix: respect user-configured max_tokens for unknown Anthropic models
Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-17 18:16:21 +08:00
LaZzyMan
9a3041335f feat: add auth command 2026-03-17 18:11:22 +08:00
mingholy.lmh
3a22ba9659 test: fix test expectations for new DEFAULT_OUTPUT_TOKEN_LIMIT (32000)
- Update anthropic tests: 32768 → 32000
- Update dashscope tests: 32768 → 32000
- Update unknown model test to respect user config (40000 preserved)

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-17 18:10:59 +08:00
mingholy.lmh
ec292ec581 feat: distinguish known/unknown models for output token limit handling
- Add hasExplicitOutputLimit() to detect models with defined output limits
- For known models: cap user max_tokens to model limit (avoid API errors)
- For unknown models (deployment aliases, self-hosted): respect user config
- Update tests to cover new behavior

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-17 18:04:33 +08:00
mingholy.lmh
45495e44b1 chore: reduce DEFAULT_OUTPUT_TOKEN_LIMIT from 32768 to 32000 for legacy model support
Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-17 17:47:26 +08:00
tanzhenxin
78faa365cb feat(tools): allow read-file access to OS temp directory
- Add os.tmpdir() to allowed paths in read-file tool
- Add tests for reading files from OS temp directory
- Add terminal capture scenario for PR review testing

This supports the PR review workflow which saves context to temp files.

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-17 17:13:23 +08:00
mingholy.lmh
4f58306a15 fix: improve max_tokens handling with conservative defaults
- Increase DEFAULT_OUTPUT_TOKEN_LIMIT from 16K to 32K
- Remove auto-detection from modelsConfig, apply at provider level
- Use conservative default (min of model limit and 32K) when user hasn't configured max_tokens
- Respect user configuration but cap at model's max output limit to avoid API errors

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-17 17:01:41 +08:00
tanzhenxin
1a977b62f3 refactor(skills): improve PR review workflow for better agent coordination
- Checkout PR branch instead of remote viewing for full file access
- Save PR context to temp file to avoid repeating in agent prompts
- Add guidance to prevent 4x diff duplication across agents
- Include environment restoration step after review

This enables agents to read files directly and use git diff against base branch,
improving review quality and reducing prompt bloat.

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-17 16:50:25 +08:00
qqqys
1788be9c57 refactor(search): implement backend fuzzy search and improve file handling
- Removed client-side filtering for search queries; fuzzy search is now handled by the backend.
- Enhanced file search initialization and caching mechanisms in FileMessageHandler.
- Added file watchers for cache invalidation on file system changes.
- Updated completion trigger logic to prioritize '@' over '/' for path-like queries.
- Reset last query on file selection to ensure fresh search results.

This refactor improves search efficiency and maintains accurate file references in the application.
2026-03-17 16:38:14 +08:00
tanzhenxin
e133627e8a feat(core): execute task tools concurrently for improved performance
Task tools spawn independent sub-agents with no shared mutable state,
making them safe to run in parallel. This change executes all task
tools concurrently while keeping other tools sequential to preserve
any implicit ordering the model may rely on.

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-17 15:45:17 +08:00
tanzhenxin
12293033b4 refactor(agents): remove outputFile from tool result events
Remove unused outputFile property from AgentToolResultEvent and its
associated test case. This property is not needed for agent tool
result handling.

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-17 14:29:02 +08:00
LaZzyMan
2506276ae5 fix test ci 2026-03-17 14:16:53 +08:00
DragonnZhang
7886ec6c8d fix(keypress): handle unsupported Kitty CSI-u keys and recover plain text
- Add helper functions for better code organization (createPrintableKey,
  getCompleteCsiSequenceLength, parsePlainTextPrefix)
- Drop unsupported Kitty CSI-u keys without blocking subsequent input
- Recover plain text that arrives in same chunk after unsupported CSI-u keys
- Add comprehensive tests for edge cases (CAPS_LOCK, event metadata variants)

Improves robustness of Kitty keyboard protocol parsing by gracefully
handling unsupported key codes and ensuring plain text input is not lost.

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-17 14:02:41 +08:00
tanzhenxin
edd8388b27 Merge branch 'main' into feature/arena-agent-collaboration 2026-03-17 14:00:47 +08:00
tanzhenxin
dbfa5b3e8e
Merge pull request #2423 from QwenLM/test/shell-and-encoding-utilities
Some checks failed
Qwen Code CI / Lint (push) Failing after 3s
Qwen Code CI / CodeQL (push) Failing after 3s
Qwen Code CI / Test (push) Has been skipped
Qwen Code CI / Test-1 (push) Has been skipped
Qwen Code CI / Test-2 (push) Has been skipped
Qwen Code CI / Test-3 (push) Has been skipped
Qwen Code CI / Test-4 (push) Has been skipped
Qwen Code CI / Test-5 (push) Has been skipped
Qwen Code CI / Test-6 (push) Has been skipped
Qwen Code CI / Test-7 (push) Has been skipped
Qwen Code CI / Test-8 (push) Has been skipped
E2E Tests / E2E Test (Linux) - sandbox:docker (push) Failing after 4s
E2E Tests / E2E Test (Linux) - sandbox:none (push) Failing after 3s
Qwen Code CI / Post Coverage Comment (push) Has been skipped
E2E Tests / E2E Test - macOS (push) Has been cancelled
fix(shell): resolve Windows encoding issues for non-ASCII output
2026-03-16 23:07:40 +08:00
tanzhenxin
17939baa66 feat(core): auto-detect UTF-8 BOM for PowerShell scripts on Windows
- Add needsUtf8Bom() to detect when UTF-8 BOM is needed based on file
  extension and system code page
- PowerShell 5.1 on non-UTF-8 Windows systems (e.g. GBK) requires BOM
  to read scripts correctly
- Remove default UTF8 encoding; undefined now triggers auto-detection
- Add tests for needsUtf8Bom() covering Windows/non-Windows scenarios

This ensures PowerShell scripts are written with UTF-8 BOM on systems
that need it, fixing character encoding issues for non-ASCII content.

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-16 22:44:53 +08:00
zach
46b9c75f83 fix(cli): show newest-first history for Ctrl+R command search 2026-03-16 14:16:03 +00:00
tanzhenxin
82e0064871 refactor(core): use dynamic terminal dimensions for replay
- Remove hardcoded REPLAY_TERMINAL_COLS/ROWS/SCROLLBACK constants
- Pass actual terminal dimensions to replayTerminalOutput()
- Increase scrollback buffer to 10000 for better output capture

This ensures terminal replay uses the actual terminal size instead of
fixed dimensions, improving output accuracy.

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-16 21:16:20 +08:00
tanzhenxin
922fca51af test(services): simplify os module mock in fileSystemService tests
Refactor the vi.mock for 'os' to use a simpler direct mock object
instead of the importOriginal pattern, making the test setup more concise.

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-16 20:35:48 +08:00
tanzhenxin
9b1bd731d7 refactor(core): improve platform-specific encoding and shell utilities
- Make CRLF conversion for .bat/.cmd files Windows-only
- Extract PowerShell UTF-8 prefix into reusable function
- Replace custom UTF-8 validation with Node.js built-in isUtf8()

This ensures .bat/.cmd files are only converted on Windows where cmd.exe
actually requires CRLF, and reduces code duplication for shell encoding.

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-16 20:29:21 +08:00
tanzhenxin
08c1ce94c0 chore(shell): remove Codex CLI reference from comment
This removes an unnecessary external reference from the codebase.

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-16 19:21:10 +08:00
qwen-code-ci-bot
bcbd82d2d4
chore: bump version to 0.12.5 (#2422)
Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-16 19:05:05 +08:00
tanzhenxin
d8dab9acd7 docs(encoding): clarify detectEncodingFromBuffer responsibility
Update JSDoc to make clear that detectEncodingFromBuffer only performs
chardet statistical detection and returns null on failure. Callers like
getCachedEncodingForBuffer are responsible for providing fallback logic.

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-16 18:57:01 +08:00
tanzhenxin
f93e5f0d46 refactor(encoding): consolidate system encoding fallback logic
Move system encoding fallback from detectEncodingFromBuffer into
getCachedEncodingForBuffer for clearer responsibility. Remove unused
WINDOWS_UTF8_CODE_PAGE export and inline the value.

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-16 18:53:51 +08:00
tanzhenxin
3dd0bacb8d fix(shell): force UTF-8 output for PowerShell on Windows
- Prefix PowerShell commands with [Console]::OutputEncoding=UTF8
- Re-detect encoding on full buffer after streaming completes
- Move system encoding fallback into detectEncodingFromBuffer

This ensures CJK and other non-ASCII characters are correctly decoded
on Windows systems with non-UTF-8 system codepages (e.g., GBK/CP936).

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-16 17:39:47 +08:00
Mingholy
f9016165c7
Merge pull request #2356 from netbrah/fix/auto-detect-max-output-tokens
fix: auto-detect max_tokens from model when not set by provider
2026-03-16 17:34:35 +08:00
Mingholy
b4b0041a34
Merge pull request #2411 from QwenLM/fix-default-output-limit
Increase DEFAULT_OUTPUT_TOKEN_LIMIT from 8K to 16K
2026-03-16 17:34:22 +08:00
mingholy.lmh
6f67b12446 fix: lint error 2026-03-16 17:21:32 +08:00
mingholy.lmh
7f0942066b fix(models): improve max_tokens auto-detection source tracking and add tests
Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>

- Fix generationConfigSources to preserve existing source info when auto-detecting max_tokens

- Add unit tests for max_tokens fallback logic
2026-03-16 17:21:04 +08:00
netbrah
6e34b3102b fix: auto-detect max_tokens from model when not set by provider
When modelProviders config does not specify samplingParams.max_tokens,
requests to non-Qwen models (Claude, GPT, Gemini, etc.) omit max_tokens
entirely. Many APIs default to a small value (e.g., Anthropic via
VertexAI defaults to 4096), causing long responses to be truncated
mid-generation — often breaking tool call parameters.

Fix: apply tokenLimit(model, 'output') as a fallback in
applyResolvedModelDefaults(), following the same pattern already used
for contextWindowSize and modalities auto-detection.

Output limits from tokenLimits.ts:
  - Claude Opus 4.6: 128K
  - Claude Sonnet 4.6 / fallback: 64K
  - GPT-5.x: 128K
  - Gemini 3.x: 64K
  - Qwen 3.5: 64K

Made-with: Cursor
2026-03-16 17:21:04 +08:00
tanzhenxin
9b822958dc refactor(encoding): try UTF-8 first in buffer encoding detection
- Rename outputEncoding to detectedEncoding for clarity
- Add isValidUtf8 helper using TextDecoder in fatal mode
- Restructure detection: UTF-8 → chardet → system encoding
- Update tests to use non-UTF-8 bytes for accurate testing

This prevents misclassifying UTF-8 output as legacy codepages on systems
where the system encoding (e.g., GBK) could also decode those bytes.

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-16 16:15:06 +08:00
tanzhenxin
dca3ea1c95 refactor(shell): remove Windows encoding wrapper logic
Remove wrapCommandForWindowsEncoding and forceUtf8Output parameter,
relying solely on getCachedEncodingForBuffer for encoding detection.

This simplifies the shell execution flow by removing the chcp 65001
command prefixing approach.

Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>
2026-03-16 15:32:35 +08:00
DennisYu07
4c7694daa8
Merge pull request #1904 from Sakuranda/fix/1892-windows-path-env
fix(core): normalize Windows PATH-like env keys for shell execution
2026-03-16 14:34:23 +08:00