spawn

vrr/spawn

mirror of https://github.com/OpenRouterTeam/spawn.git synced 2026-05-11 05:00:07 +00:00

Author	SHA1	Message	Date
A	0e145c2e8a	refactor: Remove dead getState() exports from cloud modules (#2108 ) Removed `getState()` from hetzner, gcp, daytona, sprite, and digitalocean modules. These functions were exported but never called from production code or tests. The aws module retains its `getState()` which is tested in custom-flag.test.ts to verify region state mutation. Also bumps CLI patch version (0.12.2 → 0.12.3) as required per project rules. Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-02 10:58:48 -08:00
Ahmed Abushagur	b35ecdfaff	fix(e2e): drop --timeout flag from openclaw agent command (#2109 ) The outer cloud_exec_long already enforces a timeout via INPUT_TEST_TIMEOUT. The inner --timeout 60 was redundant and could cause premature kills before the outer timeout expired. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: A <258483684+la14-1@users.noreply.github.com>	2026-03-02 10:57:44 -08:00
A	9c7fd0c7da	fix: add 30s fetch timeout to all cloud API client wrappers (#2110 ) * fix: add 30s AbortSignal.timeout to all cloud API fetch wrappers All four cloud provider API client wrapper functions (lightsailRest, hetznerApi, doFetch, daytonaApi) were missing fetch timeouts, while every other fetch call in the codebase already used AbortSignal.timeout. A stalled TCP connection to any cloud provider would cause the CLI to hang indefinitely with no user feedback or recovery path. Agent: team-lead Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * fix: apply biome formatting to fetch timeout changes --------- Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-02 13:55:34 -05:00
Ahmed Abushagur	35badf8d1b	fix(e2e): fail hard when OpenClaw gateway doesn't start (#2111 ) The gateway startup was silently swallowed with log_warn, masking real failures. Now tracks whether the port came up and fails the test with the gateway log contents if it didn't. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-02 13:51:29 -05:00
A	1c35f648b9	test: Remove conditional always-pass guards in manifest-integrity (#2107 ) The script content tests had `if (existsSync(scriptPath))` guards that silently skipped files if they weren't found on disk. Since the prior test in the same describe block already asserts that all implemented entries have script files, these guards were dead code that could mask failures. Remove them so every sampled script is unconditionally read and validated. Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-02 11:50:12 -05:00
A	e3229ebe65	fix(digitalocean): throw on non-2xx responses to prevent silent destroy failures (#2106 ) The destroyServer function only checked for status 204 (success) and responses containing a `message` field. Any other non-204 response (e.g., 403 with no message, 500 with HTML body) fell through to the success path, logging "Droplet destroyed" and returning normally. This caused `spawn delete` to mark the droplet as deleted in history while it was still running and incurring charges. Now all non-204 responses unconditionally throw, matching the pattern established in Hetzner (#2105) and Daytona (#2102). Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-02 09:22:25 -05:00
A	c61a9c6085	fix(hetzner): throw on non-2xx responses to prevent silent failures (#2105 ) hetznerApi() was returning raw response text on non-2xx final attempts instead of throwing, identical to the bug fixed in daytonaApi() by PR #2102. Impact: a 5xx when fetching SSH keys caused createServer() to receive null from parseJsonObj(), fall back to an empty SSH key list, and provision a server the user cannot SSH into -- with no error or warning. Fix matches the pattern from lightsailRest() (AWS) and PR #2102 (Daytona): throw with the HTTP status code after retries exhaust on any !resp.ok response. Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-02 05:36:42 -08:00
A	23acf62e1a	refactor: Remove dead code and stale references (#2104 ) Update stale comments referencing the monolithic commands.ts (replaced by commands/ directory in #2095). The shim at commands.ts still exists for backward compat but all internal code paths now live under commands/. - Fix test file comments: point checkEntity to commands/shared.ts, cmdInteractive to commands/interactive.ts, cmdRun paths to commands/run.ts, cmdUpdate to commands/update.ts, cmdCloudInfo to commands/info.ts, cmdListClear to commands/list.ts - Fix guidance-data.ts: update stale extraction comment and circular-dep notes to reference commands/run.ts instead of commands.ts - Fix CLAUDE.md file structure diagram: show commands/ directory and note commands.ts is a compatibility shim - Fix packages/cli/README.md: update directory structure diagram and "Adding a New Command" guide to reflect the per-module layout Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-02 08:32:02 -05:00
A	9a88460b1d	fix(daytona): throw on non-2xx responses to prevent silent destroy failures (#2102 ) daytonaApi() returned the raw response body on all final attempts regardless of HTTP status. destroyServer() checked hasApiError() which only matched 4xx patterns, so persistent 500/502/503 responses were silently treated as success — users were told "Sandbox destroyed" when billing continued. Fix: throw on !resp.ok after retries exhaust, consistent with other cloud modules (aws, gcp). destroyServer() now uses try/catch. testDaytonaToken() already had try/catch so the hasApiError() check was redundant. Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-02 07:17:30 -05:00
A	afa17d09ff	test: remove Bun.spawnSync subprocess calls from ssh-keys tests (#2101 ) * test: remove Bun.spawnSync subprocess calls from ssh-keys tests Replace Bun.spawnSync calls to ssh-keygen in createFakeKeyPair helper with plain file writes, and mock Bun.spawnSync via spyOn for all tests that exercise getKeyType, generateSshKey, and getSshFingerprint. Cuts test runtime from 1212ms to ~47ms (25x speedup) and brings the test file into compliance with the CLAUDE.md no-subprocess-spawning policy. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix: apply biome formatting to ssh-keys test Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-02 04:25:24 -05:00
Ahmed Abushagur	9242d44cbb	fix(e2e): add --force to sprite destroy in teardown (#2100 ) Without --force, sprite destroy prompts for confirmation in non-interactive E2E mode and silently fails ("Ok, come back later!"), leaving stale instances running indefinitely. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 00:24:43 -08:00
Ahmed Abushagur	297e7ff21c	fix(sprite): add 60s timeout to destroyServer to prevent hanging (#2096 ) The sprite destroy command can hang indefinitely when the Sprite API is unresponsive. Add a 60s timeout using the existing killWithTimeout utility (same pattern as runSprite). Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com> Co-authored-by: A <258483684+la14-1@users.noreply.github.com>	2026-03-01 22:37:57 -08:00
A	3911b5bc28	refactor: resolve conflicts — merge packages/shared into packages/cli/src/shared (#2092 ) Rebased fix/issue-2083 onto main after commands.ts split (PR #2095). Key resolutions: - commands.ts: kept HEAD shim (re-exports from ./commands/index.ts) - package.json: kept PR version 0.12.0 without @openrouter/spawn-shared dep - Fixed @openrouter/spawn-shared imports in commands/shared.ts, commands/update.ts, and __tests__/orchestrate.test.ts that were added after the PR branched All 1390 tests pass, biome lint clean. Agent: pr-maintainer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-01 22:05:41 -08:00
A	dc489fa652	docs: update __tests__/README.md to reflect current test structure (#2098 ) The README was referencing commands.test.ts and integration.test.ts which no longer exist (split into 20+ specialized files), and incorrectly stated the test runner was vitest (banned — project uses bun:test). Rewrote to accurately document all 44 test files with their coverage scope. Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-02 00:04:53 -05:00
A	3d5812602c	refactor: convert hermes scripts to thin-wrapper pattern (#2094 ) - hetzner/hermes.sh: add thin-shim header comment, blank line after _ensure_bun definition, and section comments (Local checkout, Remote) to match the canonical pattern used by aws/gcp/sprite/daytona - digitalocean/hermes.sh: add detailed _run_with_restart comment block and inline section comments (Normal exit, SIGTERM, Other failure) to match digitalocean/claude.sh Both scripts now produce identical output to their cloud's reference script (e.g. aws/hermes.sh, digitalocean/claude.sh) when the agent name is substituted. Fixes #2082 Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 20:27:59 -08:00
A	aa7584096d	test: centralize @clack/prompts mock in test-helpers.ts (#2090 ) * test: centralize @clack/prompts mock in test-helpers.ts Adds mockClackPrompts() factory to test-helpers.ts, eliminating ~15-line duplicate mock.module blocks from 19 test files. When @clack/prompts adds a new export, only one file needs updating instead of 19. Fixes #2080 Agent: test-engineer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * style: fix Biome formatting after merge with main Agent: pr-maintainer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 20:27:24 -08:00
A	65b872afa3	refactor: split commands.ts into per-command modules (#2095 ) * refactor: split commands.ts into per-command modules commands.ts (3,522 lines) is split into focused modules under packages/cli/src/commands/: - shared.ts: helpers, entity resolution, credentials - interactive.ts: cmdInteractive, cmdAgentInteractive - run.ts: cmdRun, cmdRunHeadless - list.ts: cmdList, cmdLast, cmdListClear - delete.ts: cmdDelete - info.ts: cmdMatrix, cmdAgents, cmdClouds, cmdAgentInfo, cmdCloudInfo - update.ts: cmdUpdate - help.ts: cmdHelp - pick.ts: cmdPick - index.ts: barrel re-export commands.ts is kept as a 2-line compatibility shim so all existing imports from "./commands.js" continue to work unchanged. Fixes #2076 Agent: complexity-hunter Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * fix: apply biome formatting Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 23:24:15 -05:00
A	4802852fac	fix: derive agent lists dynamically in usage messages (#2089 ) Six of seven cloud main.ts files had hardcoded agent lists that were stale (missing hermes, added in #2084). Replace all hardcoded lists with Object.keys(agents).join(", ") so they stay in sync automatically when new agents are added. Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com>	2026-03-01 23:21:15 -05:00
A	29cb0d4c69	test: add unit tests for shared/orchestrate.ts (#2093 ) * test: add unit tests for shared/orchestrate.ts Add 19 focused tests for runOrchestration covering: - Cloud lifecycle method ordering (authenticate -> provision -> install -> launch) - API key acquisition and injection into agent.envVars - process.exit forwarding of interactiveSession exit codes - Optional hooks: preProvision (non-fatal on error), configure, preLaunch - Model selection gating via agent.modelPrompt / modelDefault - Restart loop wrapping for non-local clouds vs raw passthrough for local - saveLaunchCmd receives unwrapped command - prepareStdinForHandoff and offerGithubAuth integration Fixes #2077 Agent: test-engineer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * style: fix Biome formatting in orchestrate.test.ts Agent: pr-maintainer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 23:19:33 -05:00
A	608e8b4bdd	refactor: eliminate 7 identical agents.ts boilerplate files (#2088 ) * refactor: eliminate 7 identical agents.ts boilerplate files Adds createCloudAgents() factory to shared/agent-setup.ts, reducing each cloud's agents.ts from 16-line copy-paste to a single call. Net reduction of 49 lines across 9 files. Fixes #2078 Agent: complexity-hunter Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * chore: apply biome formatting --------- Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 22:06:57 -05:00
A	b755c6966c	feat: add local/hermes to complete the 7x7 matrix (#2091 ) Fixes #2079 — local/hermes was the only remaining missing entry in the cloud×agent matrix. All 49 entries are now implemented. Agent: ux-engineer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 22:04:38 -05:00
Ahmed Abushagur	6dabe88016	fix(qa): add key preflight, retry loop, and failure issue filing for schedule mode (#2087 ) - Request missing API keys via key-server in quality mode (was fixtures-only) - Retry quality cycle up to 3 times before giving up - File a GitHub issue with log tail when all retries are exhausted Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-01 20:44:59 -05:00
A	2605c9cb83	refactor: Remove dead code and stale references (#2086 ) - Add getSpawnCloudConfigPath(cloud) helper to shared/ui.ts, eliminating four identical 3-line getConfigPath() functions across hetzner, daytona, digitalocean, and aws cloud modules - Remove duplicate homedir/join imports from hetzner, daytona, digitalocean, and aws now that the shared helper centralizes the path construction - Update commands.ts hasCloudConfigCredentials to use the shared helper and drop its stale homedir import - Bump CLI to 0.11.24 (patch) Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-01 20:43:12 -05:00
A	9043143a5a	test: Remove duplicate and theatrical tests (#2085 ) Remove 14 redundant tests across two files: - check-entity.test.ts: Remove 6 individual "valid entities" tests (claude, codex, cline as agents; sprite, hetzner, vultr as clouds) that are fully covered by the loop-based "all manifest agents/clouds validate correctly" describe blocks which exhaustively test all entities. - check-entity.test.ts: Remove 6 individual "wrong-type detection" tests (3 clouds-as-agents, 3 agents-as-clouds) that are covered by the loop tests "should reject every agent key when checked as cloud" and "should reject every cloud key when checked as agent". - cloud-init.test.ts: Consolidate 3 NODE_INSTALL_CMD tests into 1. "is a non-empty string" is theatrical (tests the constant exists, not what it does). Merge with the two content checks into a single test. Test count: 1385 → 1371. Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-01 20:41:40 -05:00
A	d713f9650f	feat: add hermes agent to 4 clouds, bump install wait to 600s (#2084 ) - Add hermes shim scripts for GCP, Hetzner, DigitalOcean, and Daytona - Update manifest.json matrix entries from "missing" to "implemented" - Bump default INSTALL_WAIT from 300s to 600s to fix zeroclaw timeout on small VMs where Rust compilation takes 8-12 minutes - Update cloud READMEs with hermes usage docs - Bump CLI version to 0.11.18 Co-authored-by: Ahmed Abushagur <ahmed@abushagur.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 19:31:50 -05:00
A	69d1971abf	fix(security): remove space from token validation charset in key-request.sh (#2074 ) API tokens never contain spaces; allowing them risks word splitting in downstream unquoted uses of these env vars. Updated both the shell regex in key-request.sh and the corresponding TypeScript regexes in digitalocean.ts to stay in sync. Fixes #2072 Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 17:10:22 -05:00
A	bb4deaf24c	fix: reset stale cache flag, guard gcloud null, validate DO config (#2073 ) - manifest.ts: Reset _staleCache on successful fetch/cache load so isStaleCache() doesn't falsely report stale data after reconnecting - gcp.ts: Replace getGcloudCmd()! with requireGcloudCmd() that throws a descriptive error instead of crashing with null dereference - digitalocean.ts: Replace unvalidated JSON.parse return with parseJsonObj() + isString()/isNumber() guards for type safety Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 17:08:38 -05:00
A	b066b3a1ac	test: Remove duplicate and theatrical tests (#2070 ) * test: Remove duplicate and theatrical tests Remove 4 duplicate tests spread across security and command resolution test files: - security-edge-cases.test.ts: Remove "should accept prompts with dollar signs in safe contexts" (duplicate of security.test.ts "should accept dollar signs in non-expansion contexts") - security-edge-cases.test.ts: Remove "should accept prompts with pipe to non-shell commands" (duplicate of security.test.ts "should accept prompts with pipes to other commands") - security-edge-cases.test.ts: Remove "should accept prompts with semicolons not followed by rm" (duplicate of security-encoding.test.ts "should accept semicolons not followed by rm") - commands-swap-resolve.test.ts: Remove "should not log resolution for already- lowercase exact keys" (duplicate of commands-resolve-run.test.ts "should not log resolution when exact keys are used" — identical cmdRun("claude", "sprite") call) No functional behavior changes. Test count: 1389 → 1385. * fix: remove trailing blank line for biome format --------- Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-01 15:53:27 -05:00
A	dbec3e768c	fix(security): add restrictive file permissions to Sprite saveVmConnection (#2068 ) The Sprite saveVmConnection() wrote ~/.spawn/last-connection.json without restrictive permissions (defaulting to umask 0o644/0o755), unlike the shared saveVmConnection() in history.ts which correctly uses mode 0o700 for the directory and 0o600 for the file. On multi-user systems this could expose server names and connection metadata to other users. Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 15:44:15 -05:00
A	43843a882b	refactor: Remove dead setupOpenclawBatched export and unused batched setup mechanism (#2069 ) - Delete the exported `setupOpenclawBatched` function from `agent-setup.ts` — it was never imported or called anywhere in the codebase (confirmed via exhaustive grep) - Remove the unused `setup?` field from the `AgentConfig` interface in `agents.ts` — no agent implementation ever assigned this property - Remove the dead `if (agent.setup)` branch from `orchestrate.ts` — the batched path was always unreachable because no agent provided a `setup` callback Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-01 15:43:43 -05:00
A	548cfdf0b1	fix(security): apply base64 exec escaping to remaining 4 cloud drivers (#2067 ) PR #2064 fixed _exec_long shell injection for DigitalOcean and Sprite but missed the same bash -c '${cmd}' pattern in Hetzner, GCP, AWS, and Daytona. Apply the same base64-encoding fix to all four. Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 11:50:33 -08:00
A	8025376ee6	fix: use ignore stdin for SSH commands to prevent deadlock on Hetzner and DigitalOcean (#2066 ) runServer and runServerCapture on Hetzner and DigitalOcean used stdio:["pipe",...] for stdin but called proc.stdin!.end() AFTER await proc.exited. If a remote SSH command reads from stdin (apt prompts, read calls), the process deadlocks until the 5-minute timeout fires. AWS and GCP correctly use stdio:["ignore",...]. Fix: change stdin from "pipe" to "ignore" in runServer and runServerCapture for both Hetzner and DigitalOcean, removing the now-unnecessary stdin.end() calls. Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 18:48:33 +00:00
A	862030b776	fix(security): escape cmd args in _exec_long to prevent shell injection (#2064 ) Base64-encode the command before embedding it in bash -c to prevent single-quote breakout in _sprite_exec_long and _digitalocean_exec_long. Fixes #2063 Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-01 12:42:27 -05:00
A	631722151c	fix(hetzner): add SPAWN_CUSTOM guard to promptServerType (#2065 ) Every other cloud provider (GCP, DO, Daytona) gates their size/type picker behind SPAWN_CUSTOM !== "1" so users get a fast default launch. Hetzner's promptLocation had the guard but promptServerType was missing it, causing an unexpected interactive picker on the cheapest/most-used cloud when running without --custom. Bump CLI to 0.11.19. Agent: team-lead Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-01 12:41:32 -05:00
A	902f3091d3	test: Remove duplicate and theatrical tests (#2061 ) * test: Remove duplicate and theatrical tests - Remove 3 duplicate/always-pass tests from commands-update-download.test.ts: "should reject script without shebang via validateScriptContent" (already covered in download-and-failure.test.ts and cmdrun-happy-path.test.ts), "should reject script with dangerous pattern" (duplicate + always-pass or-chain), "should show script-not-found message when both URLs 404" (duplicate of existing 404 test) - Remove 5 theatrical tests from custom-flag.test.ts that only verify constant arrays have entries with defined id/label fields (SERVER_TYPES, LOCATIONS, DROPLET_SIZES, DO_REGIONS, SANDBOX_SIZES) — these test constant existence, not behavior, and fail due to @openrouter/spawn-shared import error - Bump CLI version to 0.11.18 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix: Remove trailing blank lines in custom-flag.test.ts for biome format Agent: pr-maintainer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: B <6723574+louisgv@users.noreply.github.com>	2026-03-01 09:09:29 -08:00
A	add83bbd6c	refactor: Remove dead code and stale references (#2062 ) Remove orphaned sh/test/fixtures/ directory. These shell fixture files (_shared_agent_assertions.sh, hetzner/_env.sh, hetzner/_api_assertions.sh, digitalocean/_env.sh, digitalocean/_api_assertions.sh) were part of a mock test harness (mock.sh) that was removed from the repository. The fixture files reference `assert_api_called` and `MOCK_LOG` variables that are never defined anywhere, confirming they are unreachable dead code. Scan results: - Dead code (sh/test/fixtures/): 5 orphaned fixture files removed - Dead code (sh/shared, packages/cli/src/): none found - Stale references to non-existent files: none found - Python usage (python3 -c / python -c): none found - Duplicate utilities across cloud modules: loadTokenFromConfig pattern exists in hetzner/daytona/digitalocean but reads from different cloud- specific config paths — cannot be consolidated (confirmed intentional) - Stale comments: none found beyond those already fixed in prior PRs -- qa/code-quality Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-01 11:45:24 -05:00
A	3fa0c82c91	refactor: Remove dead code and stale references (#2059 ) * refactor: Remove dead code and stale references Fix stale path comment in sh/shared/key-request.sh that referenced the wrong location for loadTokenFromConfig (cli/src/ instead of packages/cli/src/). Also updated wording from "Must match" to "Keep in sync with" to more accurately describe the relationship. Scan results (no other issues found): - Dead code (sh/shared, packages/cli/src): none found - Stale references to non-existent files: none found - Python usage (python3 -c / python -c): none found - Duplicate utilities across cloud modules: none (cloud-specific config loading functions share the same pattern but read from different paths and cannot be consolidated) - Stale comments: one stale path in key-request.sh (fixed) -- qa/code-quality Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * refactor: Remove dead code and stale references Remove duplicate `log_step` function from `sh/shared/github-auth.sh`. `log_step` was identical to `log_info` (same printf format, same output stream) and had no semantic distinction. All 6 call sites are updated to use `log_info` directly. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-01 08:26:10 -05:00
A	cef14ce9ea	fix(sprite): pass timeoutSecs through to runSprite, add kill-on-timeout (#2060 ) runSprite was wired as CloudRunner.runServer but silently dropped the timeoutSecs parameter. All other clouds (Hetzner, DO, AWS, GCP, Daytona) implement kill-on-timeout via setTimeout+killWithTimeout; Sprite had zero timeout protection, so a hung agent install (e.g. ZeroClaw's 600s Rust compile, Claude Code's 300s install) would hang forever on Sprite. Matches the pattern used by every other cloud provider. Agent: team-lead Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 08:21:26 -05:00
A	40e14c2b6b	test: Remove duplicate and theatrical tests (#2058 ) - manifest-type-contracts.test.ts: Replace 42 per-agent/per-cloud silently-skipping tests (if field === undefined { return }) with 6 aggregate tests that filter to entries that actually have the field and assert the field count > 0 so the test can't pass vacuously. Affected: pre_launch, config_files, notes (agents); defaults, notes, icon (clouds). - history.test.ts: Remove always-pass test "throws for SPAWN_HOME pointing to /root when user home is different" — it silently returns early whenever the CI environment runs as root (which it always does), providing zero signal. The adjacent "throws for SPAWN_HOME outside home directory" test already covers this semantic. Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-01 08:20:44 -05:00
Ahmed Abushagur	45caf4b96b	fix(sprite): fix all 6 Sprite agent installs for E2E (#2057 ) * fix(sprite): fix all 6 Sprite agent installs for E2E - Use `npm install -g --prefix` instead of `npm config set prefix` to avoid creating .npmrc that conflicts with nvm on Sprite VMs - Fix shell environment setup to only modify .bash_profile (not .bashrc) so non-interactive bash -c commands retain PATH config - Add $HOME/.cargo/bin to PATH for zeroclaw (Sprite has no ~/.cargo/env) - Add $HOME/.local/bin to PATH config for Sprite shell environment - Add sprite E2E cloud driver with org detection, config corruption fix, direct command embedding (not $1 positional), and retry logic - Fix provision.sh to kill full process tree after timeout (prevents orphaned sprite exec sessions from corrupting config) - Fix verify.sh zeroclaw check to not rely on ~/.cargo/env existing Tested: 6/6 Sprite agents pass E2E (claude, codex, openclaw, zeroclaw, opencode, kilocode). Hermes is not in the Sprite manifest. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: biome format - collapse runSprite call to single line Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-01 07:15:09 -05:00
A	66bd992198	refactor: Remove dead code and stale references (#2055 ) Fix stale path comment in sh/shared/key-request.sh that referenced the wrong location for loadTokenFromConfig (cli/src/ instead of packages/cli/src/). Also updated wording from "Must match" to "Keep in sync with" to more accurately describe the relationship. Scan results (no other issues found): - Dead code (sh/shared, packages/cli/src): none found - Stale references to non-existent files: none found - Python usage (python3 -c / python -c): none found - Duplicate utilities across cloud modules: none (cloud-specific config loading functions share the same pattern but read from different paths and cannot be consolidated) - Stale comments: one stale path in key-request.sh (fixed) -- qa/code-quality Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-01 04:45:54 -05:00
A	41adfbdb0a	test: Remove duplicate and theatrical tests (#2054 ) * test: Remove duplicate and theatrical tests - check-entity.test.ts: Remove 'kind parameter consistency' describe block (9 tests) that fully duplicated coverage already provided by 'valid entities', 'wrong-type detection: cloud given as agent', and 'wrong-type detection: agent given as cloud' describes. Also remove redundant loop assertions ('should return true for all three agent keys' etc.) that repeated what the individual named tests already covered. - manifest-cache-lifecycle.test.ts: Replace Record<string, any> with Record<string, AgentDef> and Record<string, CloudDef> for type safety. 1401 tests pass, 0 fail. Lint clean. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix: remove extra blank line to pass Biome format check Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * test: Remove duplicate and theatrical tests Remove redundant if-guards around always-present agent metadata fields in manifest-type-contracts.test.ts. All 12 metadata fields (creator, repo, license, created, added, github_stars, stars_updated, language, runtime, category, tagline, tags) are present on all 7 agents, making the if (agent.X !== undefined) guards always-truthy dead code that misleads readers into thinking tests might be skipped. Restructure into proper per-agent describe blocks to make the test structure honest and clear. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix: Apply Biome formatting to array literal Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * test: Remove duplicate and theatrical tests Fix always-pass anti-pattern in manifest-type-contracts.test.ts where optional field type tests were gated by `if (field !== undefined)` OUTSIDE the `it()` block. When no agent/cloud had the field, zero tests registered, giving false confidence. Changes: - Agent optional field types: move condition inside `it()`, test always runs - Cloud optional field types: same fix, tests always register for all clouds - Interactive prompts structure: consolidate filtered loop into one `it()` that iterates internally, avoiding silently-absent test registrations - Config files structure: same consolidation pattern Before: 551 pass, 64 fail (optional field tests only registered per-agent) After: 566 pass, 64 fail (optional field tests register for every agent/cloud) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * style: fix biome lint errors - add block statements to early returns Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * style: apply biome formatter to block statements Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-01 04:16:05 -05:00
A	210519a590	fix(security): document PKCE migration path for DigitalOcean OAuth (#2056 ) Adds explicit monitoring obligation and step-by-step migration checklist to the DO_CLIENT_SECRET comment. Tracks when PKCE was last verified unsupported (2026-03) and what to do when it becomes available, addressing the technical debt tracking request from #2041. Fixes #2041 Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-01 04:11:40 -05:00
A	84133fb036	fix(security): replace validateLaunchCmd blocklist with allowlist (#2053 ) * fix(security): replace validateLaunchCmd blocklist with allowlist The blocklist pattern />\\s\\// (redirection to absolute path) matched 2>/dev/null, which appears in every valid launch command generated by agent-setup.ts. This caused mergeLastConnection() to reject and discard all connection data, breaking the spawn list → "Enter agent" reconnect flow and spawn last. Replace the blocklist with a strict allowlist: each semicolon-separated segment must match one of: - source ~/.<rc-file> [2>/dev/null] - export PATH=<safe-path> - <binary> [simple-args] This simultaneously fixes the false-positive and closes the latent injection gap (the old blocklist only blocked '; rm' but not arbitrary '; <other-cmd>'). Fixes #2052 Agent: issue-fixer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> style: apply biome formatter to fix CI format check Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 03:12:27 -05:00
A	9be0c9597d	fix: spawn last reconnects to existing VM instead of always reprovisioning (#2051 ) `cmdLast()` was always calling `cmdRun()`, creating a brand-new VM every time. Wire it into `handleRecordAction()` instead, which already contains the reconnect-vs-rerun logic used by `spawn list`: if the latest history record has a live connection (IP + server ID), the user is offered options to enter the agent or SSH in; only if no connection info exists (or the user chooses "Spawn a new VM") does it provision a fresh instance. Also bumps CLI version 0.11.13 → 0.11.14. Fixes #2050 Agent: issue-fixer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 01:49:36 -05:00
A	e1ef024981	test: Remove duplicate and theatrical tests (#2047 ) * test: Remove duplicate and theatrical tests - check-entity.test.ts: Remove 'kind parameter consistency' describe block (9 tests) that fully duplicated coverage already provided by 'valid entities', 'wrong-type detection: cloud given as agent', and 'wrong-type detection: agent given as cloud' describes. Also remove redundant loop assertions ('should return true for all three agent keys' etc.) that repeated what the individual named tests already covered. - manifest-cache-lifecycle.test.ts: Replace Record<string, any> with Record<string, AgentDef> and Record<string, CloudDef> for type safety. 1401 tests pass, 0 fail. Lint clean. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix: remove extra blank line to pass Biome format check Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * test: Remove duplicate and theatrical tests Remove redundant if-guards around always-present agent metadata fields in manifest-type-contracts.test.ts. All 12 metadata fields (creator, repo, license, created, added, github_stars, stars_updated, language, runtime, category, tagline, tags) are present on all 7 agents, making the if (agent.X !== undefined) guards always-truthy dead code that misleads readers into thinking tests might be skipped. Restructure into proper per-agent describe blocks to make the test structure honest and clear. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix: Apply Biome formatting to array literal Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-01 00:11:48 -05:00
Ahmed Abushagur	2155a36a1f	fix(e2e): use explicit PATH for hermes, codex, and kilocode binary checks (#2049 ) Non-interactive SSH sessions don't source .bashrc or .zshrc, so binaries installed to ~/.local/bin (hermes via uv) or ~/.npm-global/bin (codex, kilocode via npm) were not found during verification. Fix all three verify functions and the codex input test to use explicit PATH with the known install directories, matching the pattern already used by openclaw and claude. Verified: AWS 7/7, Hetzner 6/6 implemented, GCP 6/6 implemented. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 00:07:33 -05:00
Ahmed Abushagur	cd758589c3	fix(e2e): robust DigitalOcean teardown with retry and deletion confirmation (#2046 ) The teardown was doing a single DELETE without --max-time, so connection timeouts caused HTTP 000 and the droplet was never deleted. When running 6 agents in batches of 3, batch 1's stale droplet caused batch 2 to fail with "will exceed your droplet limit." Fix: - Add --max-time 30 to prevent curl hangs - Retry DELETE up to 3 times on failure - Poll the API after DELETE to confirm the droplet is actually gone (up to 60s) - Remove -f flag from curl so %{http_code} is always captured Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 22:09:15 -05:00
Ahmed Abushagur	d8dbf952c2	fix(e2e): fix openclaw input test (PATH, CLI flags, gateway restart) (#2045 ) The openclaw e2e input test was failing for three independent reasons: 1. PATH missing ~/.npm-global/bin — openclaw installs via npm with a custom prefix, but verify_openclaw and input_test_openclaw didn't include that directory in PATH 2. Wrong CLI invocation — used `openclaw -p` which doesn't exist. The correct command is `openclaw agent --message "..." --session-id` 3. Gateway not running — the openclaw gateway (port 18789) can die between provisioning and verification. Now the input test ensures the gateway is running before sending the prompt. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-02-28 20:41:48 -05:00
A	708326a693	refactor: Remove dead exports across TypeScript modules (#2044 ) Remove `export` from functions that are only used internally within their own file and never imported elsewhere. Affected modules: - `history.ts`: `mergeLastConnection` (only called internally by `getActiveServers`/`filterHistory`) - `update-check.ts`: `isUpdateBackedOff` (only called internally by `checkForUpdates`) - `aws/aws.ts`: `waitForSsh` (only called internally by `waitForCloudInit`) - `gcp/gcp.ts`: `waitForSsh` (only called internally by `waitForCloudInit`) - `daytona/daytona.ts`: `waitForSsh` (only called internally by `waitForCloudInit`) - `shared/agent-setup.ts`: 11 implementation helpers (`installAgent`, `uploadConfigFile`, `installClaudeCode`, `setupClaudeCodeConfig`, `promptGithubAuth`, `setupCodexConfig`, `setupOpenclawConfig`, `startGateway`, `setupZeroclawConfig`, `ensureSwapSpace`, `openCodeInstallCmd`) — all only used within `createAgents()` All 1410 tests pass, biome lint clean (0 errors). Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-02-28 20:39:38 -05:00

1 2 3 4 5 ...

1714 commits