qwen-code

mirror of https://github.com/QwenLM/qwen-code.git synced 2026-04-29 12:11:09 +00:00

Author	SHA1	Message	Date
wenshao	cb359dc996	fix(review): default remote for PR numbers + fork-aware autofix push 1. Pure integer PR numbers default to origin (no URL-based remote matching available). URL-based PRs use the matched remote. 2. Autofix push uses matched remote instead of hardcoded origin. Push failure is expected for fork PRs where user lacks push access — handled gracefully with informative message. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 07:25:48 +08:00
wenshao	f357517dd2	fix(review): temp file cleanup in cross-repo + base-branch remote 1. Cross-repo lightweight mode: no longer skips Step 11 entirely. Worktree removal is skipped (none created) but temp files are still cleaned up. 2. Step 2 base-branch fetch: use the matched remote from Step 1 (e.g., upstream for forks) instead of hardcoded origin. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 07:23:47 +08:00
wenshao	cbf2aa06ac	feat(review): add Java and C/C++ support for deterministic analysis Step 3 now supports: - Java: mvn compile, checkstyle, spotbugs, pmd (Maven); gradle compileJava, checkstyleMain (Gradle) - C/C++: clang-tidy (when compile_commands.json available) Agent 5 build/test precedence now includes Maven and Gradle before Makefile to avoid duplicate builds. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 07:19:56 +08:00
wenshao	b2ec7b068e	fix(review): check all git remotes for cross-repo detection Previously compared URL owner/repo against gh repo view (current repo only). This caused fork-based workflows to be wrongly classified as cross-repo (e.g., local clone of wenshao/jdk reviewing openjdk/jdk PR). Now checks all git remotes (git remote -v) for a match. If any remote points to the URL's repo, proceeds with full worktree mode using that remote for fetch. Only falls back to lightweight mode if NO remote matches. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 07:14:20 +08:00
wenshao	3825962a8b	fix(review): Step 9 owner/repo must respect cross-repo mode Step 9 hardcoded gh repo view to get owner/repo, which returns the current repo — wrong for cross-repo reviews. Now explicitly branches: same-repo uses gh repo view, cross-repo uses URL-extracted owner/repo. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 05:07:32 +08:00
wenshao	720796e699	fix(review): safe branch cleanup + cross-repo agent count - git branch -D: add 2>/dev/null \|\| true to both cleanup sites (Step 1 stale cleanup + Step 11) to prevent abort if ref missing - Cross-repo doc: clarify Agents 1-4 only (Agent 5 build/test requires local codebase, not available in cross-repo mode) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 05:02:13 +08:00
wenshao	6596362342	fix(review): 4 issues found in self-audit 1. Line 22: "Step 2" → "Step 4" (agents are Step 4, rules are Step 2) 2. PR section: clarify "same-repo URL" to avoid ambiguity with cross-repo lightweight path 3. Cross-file analysis: add "same-repo only" qualifier (no local files in cross-repo mode for grep_search) 4. Cross-repo lightweight mode: add existing PR comment fetching to avoid duplicating human feedback Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 04:24:45 +08:00
wenshao	875944544c	fix(review): explicitly require parallel agent launch in single batch Strengthen the parallel instruction from "five parallel review agents" to explicitly requiring all 5 task tool calls in one response. Without this, the LLM may serialize agents (wait for each to finish), losing the wall-clock time benefit of parallel execution. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 04:22:34 +08:00
wenshao	559f2efd42	fix(review): fix cross-repo mode and add documentation SKILL.md: - Step 9 must use owner/repo from URL (not gh repo view) for cross-repo - Step 2 (project rules) skipped in cross-repo mode (no local files) User doc: add Cross-repo PR Review section with same-repo vs cross-repo capability comparison table. DESIGN.md: add "Why cross-repo uses lightweight mode" section explaining CLI tools are inherently repo-local and our approach is best available. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 04:18:55 +08:00
wenshao	0e78ced15a	fix(review): support cross-repo PR URLs in lightweight mode Instead of rejecting cross-repo PR URLs, run in lightweight mode: use gh pr diff <url> to get diff directly (no worktree needed), skip deterministic analysis and build/test, and review diff text only with LLM agents. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 04:16:06 +08:00
wenshao	afbede1d34	fix(review): detect and reject cross-repo PR URLs When a PR URL points to a different repo (e.g., other-org/other-repo), the review would silently operate on the wrong PR in the current repo. Now verifies URL owner/repo matches current repo before proceeding. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 04:15:40 +08:00
wenshao	af4ae2cb83	fix: update stale Step 1.5 reference to Step 3 in DESIGN.md Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 04:13:28 +08:00
wenshao	01c105b5e7	refactor(review): renumber steps from sub-steps to sequential 1-11 Replace confusing sub-step numbering (1, 1.1, 1.5, 2, 2.5, 2.6, 3, 3.5, 4, 4.5, 5) with clean sequential numbering (1-11). Mapping: 1→1, 1.1→2, 1.5→3, 2→4, 2.5→5, 2.6→6, 3→7, 3.5→8, 4→9, 4.5→10, 5→11 Updated all cross-references in SKILL.md, user docs, and PR description. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 04:12:45 +08:00
wenshao	b4ae04ec29	fix(review): capture headRefOid before autofix to prevent line drift If autofix pushes a new commit in Step 3.5, the PR HEAD changes. Step 4's inline comments would then reference the autofix commit where line numbers may have shifted, causing comments on wrong lines. Fix: capture headRefOid in Step 1 (before autofix) and reuse in Step 4. Also fix stale Step 5 comment about worktree/commit SHA. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 03:57:14 +08:00
wenshao	e5127e6fc0	fix(review): shell safety for no-findings path, fix DESIGN.md table - No-findings LGTM path now uses write_file + --body-file instead of inline --body (consistent with shell safety guidance) - DESIGN.md: remove duplicate Agent 5 row from token table - PR description: add DESIGN.md to scope list Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 03:54:57 +08:00
wenshao	fd3d2a8359	docs: add Fork Subagent as future optimization in DESIGN.md Current 7 LLM calls send ~350K input tokens (50K system prompt × 7 independent subagents). Fork Subagent would share prompt cache across all forks, reducing to ~120K effective tokens (~65% savings). Documented as future optimization pending Fork Subagent platform feature implementation. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 03:48:35 +08:00
wenshao	09c95c44c8	fix(review): address 5 Copilot comments - Add error handling for git fetch and gh pr view failures in Step 1 - Skip worktree cleanup on autofix commit/push failure (preserve uncommitted fixes for manual recovery) - Fix Agent 5 counting: it's 1 of the 5 LLM agents (not a separate zero-cost stage). Remove misleading "zero LLM" annotation and duplicate row from token efficiency table. - Reverse audit skip-verification already implemented (comment #53 was stale) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 03:42:20 +08:00
wenshao	ccaab1779b	docs: add /review design document with architecture rationale Records the WHY behind key decisions: - Why 5 agents (not 1 like Copilot): dimensional coverage - Why batch verification (not N agents): O(1) not O(N) cost - Why reverse audit is separate: different cognitive task - Why worktree (not stash/checkout): eliminates a class of bugs - Why "silence > noise": Copilot's 60M review data - Why base-branch rules: security against PR injection - Why follow-up tips (not blocking prompts): UX consistency - Why 7 fixed LLM calls: coverage vs cost balance - Rejected alternatives table with reasoning Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 03:40:17 +08:00
wenshao	6d7afafe3c	perf(review): skip verification for reverse audit findings Reverse audit agent already has full context (all confirmed findings + entire diff), so its findings don't need a second opinion. This brings the actual LLM call count to 7 (5 review + 1 verify + 1 reverse), matching the documented claim. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 03:38:49 +08:00
wenshao	d6b9b35350	fix(review): handle interrupted review cleanup If a previous review was interrupted (Ctrl+C, crash), stale worktree and local ref would block the next review. Now Step 1 checks for and cleans up stale .qwen/tmp/review-pr-<N> worktree and qwen-review/pr-<N> ref before creating new ones. Step 5 also cleans up the local ref alongside the worktree. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 03:35:00 +08:00
wenshao	6b28920a07	docs: fix autofix section redundancy and add pre-fix verdict note - Remove duplicate worktree commit+push bullet (lines 107 vs 109) - Add note that PR submission uses pre-fix verdict since remote isn't updated until autofix push completes Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 03:31:48 +08:00
wenshao	ce47f64ae6	docs: replace Mermaid with plain-text pipeline diagram Mermaid only renders on GitHub; shows as raw code on Nextra, Docusaurus, VS Code preview, and offline viewing. Plain-text ASCII diagram is universally compatible and includes LLM call cost annotations on each stage. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 03:30:34 +08:00
wenshao	1db1dec517	docs: replace step list with Mermaid flowchart in How It Works Visual pipeline diagram showing: - Sequential flow from scope detection to cleanup - 5 parallel agents subgraph - Decision branches for autofix and PR comments - Zero-LLM-cost stages marked - GitHub renders Mermaid natively Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 03:29:20 +08:00
wenshao	295e907d25	fix(review): address 4 Copilot comments on worktree and verification - Step 4.5: use absolute paths for reports/cache in worktree mode (relative paths would land in worktree and be deleted) - Step 1: fetch into qwen-review/pr-<N> ref to avoid clobbering existing local branches - Step 2.6: reverse audit findings use batch verification (not one-per-finding), consistent with Step 2.5 - Doc: clarify reverse audit findings are also batch-verified Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 03:25:24 +08:00
wenshao	9b9bccd27d	docs: update user doc with token efficiency, fix follow-up table - Add Token Efficiency section showing fixed 7 LLM calls breakdown - Fix follow-up table: "fix these issues" is local-only (worktree cleaned up after PR review) - Update PR description with worktree, batch verification, cross-model review, PR comment dedup, and expanded test plan Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 03:22:54 +08:00
wenshao	e65e5bd353	perf(review): replace N verification agents with single batch verification Previously, each finding got its own independent verification agent (N findings = N LLM calls). Now a single verification agent receives all findings at once and verifies them in one pass. Token cost: 6+N variable calls → 7 fixed calls (5 review + 1 verify + 1 reverse audit) Quality: minimal impact — batch verification has fuller cross-finding context Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 03:19:23 +08:00
wenshao	08a797cf76	fix(review): address 4 Copilot comments - Add model attribution to no-findings LGTM path - Handle empty string from getModel() with .trim() \|\| 'unknown' - Add tests for {{model}} with args and empty model ID - Fix doc contradiction: PR autofix pushes automatically from worktree Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 03:13:23 +08:00
wenshao	a5cc2c38cb	fix(review): fix 5 worktree issues found in audit 1. Remove gh pr checkout --detach (modifies working tree, defeats worktree purpose). Use git fetch only. 2. Add dependency installation step (npm ci etc.) in worktree — without it, all TS/JS linting/building fails. 3. Cache and reports written to main project dir, not worktree (would be deleted in Step 5). 4. "fix these issues" tip only for local reviews — worktree is cleaned up after PR review, so interactive fixing not possible. 5. Autofix push uses explicit remote branch name from Step 1. 6. Move incremental check before dependency install to avoid wasting time when no new changes. 7. Fix Step 3 reference: "from Steps 2.5 and 2.6" (includes reverse audit findings). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 03:11:16 +08:00
wenshao	dd2de17de5	feat(review): use ephemeral worktree for PR reviews Replace the stash + checkout + restore flow with an isolated git worktree for PR reviews. This eliminates: - Stash orphan risks (multiple early exit paths) - Wrong-branch risks (Step 5 restore failures) - Build cache pollution (worktree has its own state) - All stash-related error handling complexity New flow: - Step 1: git worktree add .qwen/tmp/review-pr-<number> - All agents operate in the worktree directory - Autofix commits and pushes from the worktree - Step 5: git worktree remove (--force for dirty worktrees) User's working tree is never modified during PR reviews. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 03:07:14 +08:00
wenshao	e502deee28	fix(review): add ms values to Step 1.5 timeout specification run_shell_command expects timeout in milliseconds. Without explicit ms values, implementations may pass 120/60 and time out immediately. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 02:58:45 +08:00
wenshao	5effbb696f	feat(review): read existing PR comments to avoid duplicate feedback For PR reviews, fetch existing inline and general comments via gh api before launching agents. A summary of already-discussed issues is passed to agents so they don't re-report problems that humans or other tools have already flagged. Added to Exclusion Criteria: "Issues already discussed in existing PR comments." Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 01:07:35 +08:00
wenshao	ba8a3a741f	fix(review): fix section numbering, nav entry, and step hierarchy - commands.md: renumber 1.6→1.7→1.8→1.9 after inserting 1.5 Built-in Skills - SKILL.md: promote Reverse audit from ### to ## Step 2.6 for consistent step hierarchy - _meta.ts: add code-review to Features navigation sidebar Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 01:04:21 +08:00
wenshao	757bd9865a	feat(review): model-aware incremental cache for cross-model review The incremental review cache now stores modelId alongside commitSha. When the same PR is re-reviewed with a different model: - Cache detects model change → runs full review (not skipped) - Informs user: "Previous review used X. Running full review with Y for a second opinion." Same SHA + same model still skips as before. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 01:01:52 +08:00
wenshao	50d25733d7	feat(review): add reverse audit step to find coverage gaps Add Step 2.6: after all findings are verified and aggregated, a single reverse audit agent reviews the diff with full knowledge of what was already found, specifically looking for important issues that all previous agents missed. - Only reports Critical/Suggestion level gaps (not Nice to have) - Findings go through the same verification as other agents - Single agent call — minimal cost overhead - If nothing is found, initial review had strong coverage This formalizes the "multi-round undirected audit" pattern that proved effective during the development of this PR (14 rounds, 40+ issues). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 00:56:08 +08:00
wenshao	95a62da039	docs: fix review doc accuracy and remove non-existent /simplify code-review.md: - Add PR URL support to Quick Start - Add "no changes" behavior note - Fix copilot-instructions.md precedence (prefer .github/, not both) - Fix "automatically gitignored" → user must ensure .gitignore coverage - Clarify reports directory is project-relative - Add "What's NOT Flagged" section (exclusion criteria) commands.md: - Replace non-existent /simplify with actual bundled skills (/loop, /qc-helper) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 00:50:26 +08:00
wenshao	ac179b0e02	docs: add /review user documentation Add comprehensive user documentation for the /review command covering: - Quick start examples for all modes (local, PR, file, --comment) - Pipeline overview with all steps explained - Review agents table (5 agents + their focus areas) - Deterministic analysis (supported languages and tools) - Severity levels and PR comment filtering rules - Autofix workflow - PR inline comments (what gets posted vs terminal-only) - Follow-up actions (fix/post comments/commit) - Project review rules (.qwen/review-rules.md etc.) - Incremental review and caching - Review report persistence - Cross-file impact analysis - Design philosophy Also add /review and /simplify to the commands reference page under a new "Built-in Skills" section with link to full docs. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 00:47:31 +08:00
wenshao	fe83b82dab	feat(review): add "fix these issues" follow-up tip After review with unfixed findings (autofix declined/partial/N/A), suggest "type fix these issues" so the LLM can interactively fix each finding using the edit tool without re-running the review. Follow-up tips now cover the complete post-review flow: - Unfixed findings → "fix these issues" - PR with findings → "post comments" - Local all clear → "commit" Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 00:38:15 +08:00
wenshao	5ff22dcd4f	feat(review): add commit tip for local reviews with Approve verdict After local review with no critical issues, suggest "type commit to commit your changes" — the follow-up system picks this up as ghost text so users can Tab to commit. PR reviews keep the existing "post comments" tip. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 00:36:38 +08:00
wenshao	0b290e7031	fix(review): fix comment template, build precedence, and timeout unit - Comment template: replace [{severity}] with {prefix} placeholder so auto-fixed prefix is not dropped - Agent 5: run exactly one build + one test command using precedence order to avoid duplicates (e.g., Makefile wrapping npm) - Clarify timeout as 120000ms for run_shell_command Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 00:35:04 +08:00
wenshao	1d185a19a9	fix(review): avoid re-running review when posting PR comments Changed the tip from "/review <number> --comment" (which re-runs the full review) to "post comments" (which reuses existing findings in the same conversation and jumps directly to Step 4). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 00:30:45 +08:00
wenshao	e4b8d194db	fix(review): use actual PR number in --comment tip The tip placeholder <number> was being output literally, causing the follow-up suggestion system to generate commands with wrong PR IDs. Now explicitly instructs the LLM to substitute the real PR number. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 00:29:11 +08:00
wenshao	d03a7aaf1a	fix(review): clarify PR comment prefix template format Replace ambiguous {prefix} placeholder with concrete examples showing the full Markdown bold + severity tag format for normal and auto-fixed findings. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 00:07:06 +08:00
wenshao	687d0357fc	fix(review): restrict low-confidence findings to terminal only Low-confidence findings now appear only in the terminal "Needs Human Review" section and are never posted as PR inline comments. This resolves the contradiction between "silence > noise" and posting uncertain findings on PRs. Also clarified that "confirmed (low confidence)" is for issues likely real but needing human judgment, not vague suspicions (those should be rejected). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 00:00:56 +08:00
wenshao	a3b3cbe156	fix(review): exclude deleted files from linter target list Deleted/renamed files in the diff would cause per-file linters to fail on non-existent paths, producing false deterministic failures. Now uses --diff-filter=d to exclude deletions from the changed files list. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 23:54:46 +08:00
wenshao	71733b5dcb	fix(review): fix misleading test name and comment Rename "should use unknown when model is not available" to "when getModel returns undefined" — the mock config does define getModel, it just returns undefined. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 23:47:59 +08:00
wenshao	3ac54581a8	fix(review): address Copilot comments and fix CI build failure - Copilot-instructions.md precedence: prefer .github/ path, do not load both when both exist - Simplify getModel() call: remove unnecessary typeof guard since Config already defines getModel() - Fix TS2352 type error in test: use proper mock cast pattern - Add getModel to base mockConfig for test consistency Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 23:31:42 +08:00
wenshao	fab4dc5949	feat(review): add model attribution to PR review summary Add {{model}} template variable support in BundledSkillLoader. When a skill body contains {{model}}, it is replaced with the runtime model ID from config.getModel(). Only skills that use the variable are affected. The /review skill now appends a model attribution footer to PR review summaries: "Reviewed by {model} via Qwen Code /review" This enables cross-model review workflows (e.g., develop with model A, review with model B) with accurate attribution in PR comments. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 23:21:44 +08:00
wenshao	fe3d596d72	fix(review): include both staged and unstaged files in local review Step 1.5 said "the diff output" (singular) but local reviews produce two diffs (git diff + git diff --staged). Changed files list now explicitly takes the union of both. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 23:12:23 +08:00
wenshao	48340c3acf	fix(review): preserve highest severity during dedup merge When deduplication merges findings with different severities (e.g., a Critical typecheck error with a Suggestion from LLM review), the merged finding now uses the highest severity. Deterministic severity is treated as authoritative and cannot be downgraded. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 23:01:59 +08:00
wenshao	ab0259c1a1	fix(review): restore environment on early exit paths When PR checkout fails or incremental review finds no new changes, restore the environment (checkout original branch, pop stash) before stopping. Previously these early exits left the stash orphaned. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 22:52:10 +08:00

1 2 3 4 5 ...

5241 commits