spawn

vrr/spawn

mirror of https://github.com/OpenRouterTeam/spawn.git synced 2026-05-19 16:39:50 +00:00

Author	SHA1	Message	Date
A	bb4deaf24c	fix: reset stale cache flag, guard gcloud null, validate DO config (#2073 ) - manifest.ts: Reset _staleCache on successful fetch/cache load so isStaleCache() doesn't falsely report stale data after reconnecting - gcp.ts: Replace getGcloudCmd()! with requireGcloudCmd() that throws a descriptive error instead of crashing with null dereference - digitalocean.ts: Replace unvalidated JSON.parse return with parseJsonObj() + isString()/isNumber() guards for type safety Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 17:08:38 -05:00
A	b066b3a1ac	test: Remove duplicate and theatrical tests (#2070 ) * test: Remove duplicate and theatrical tests Remove 4 duplicate tests spread across security and command resolution test files: - security-edge-cases.test.ts: Remove "should accept prompts with dollar signs in safe contexts" (duplicate of security.test.ts "should accept dollar signs in non-expansion contexts") - security-edge-cases.test.ts: Remove "should accept prompts with pipe to non-shell commands" (duplicate of security.test.ts "should accept prompts with pipes to other commands") - security-edge-cases.test.ts: Remove "should accept prompts with semicolons not followed by rm" (duplicate of security-encoding.test.ts "should accept semicolons not followed by rm") - commands-swap-resolve.test.ts: Remove "should not log resolution for already- lowercase exact keys" (duplicate of commands-resolve-run.test.ts "should not log resolution when exact keys are used" — identical cmdRun("claude", "sprite") call) No functional behavior changes. Test count: 1389 → 1385. * fix: remove trailing blank line for biome format --------- Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-01 15:53:27 -05:00
A	dbec3e768c	fix(security): add restrictive file permissions to Sprite saveVmConnection (#2068 ) The Sprite saveVmConnection() wrote ~/.spawn/last-connection.json without restrictive permissions (defaulting to umask 0o644/0o755), unlike the shared saveVmConnection() in history.ts which correctly uses mode 0o700 for the directory and 0o600 for the file. On multi-user systems this could expose server names and connection metadata to other users. Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 15:44:15 -05:00
A	43843a882b	refactor: Remove dead setupOpenclawBatched export and unused batched setup mechanism (#2069 ) - Delete the exported `setupOpenclawBatched` function from `agent-setup.ts` — it was never imported or called anywhere in the codebase (confirmed via exhaustive grep) - Remove the unused `setup?` field from the `AgentConfig` interface in `agents.ts` — no agent implementation ever assigned this property - Remove the dead `if (agent.setup)` branch from `orchestrate.ts` — the batched path was always unreachable because no agent provided a `setup` callback Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-01 15:43:43 -05:00
A	548cfdf0b1	fix(security): apply base64 exec escaping to remaining 4 cloud drivers (#2067 ) PR #2064 fixed _exec_long shell injection for DigitalOcean and Sprite but missed the same bash -c '${cmd}' pattern in Hetzner, GCP, AWS, and Daytona. Apply the same base64-encoding fix to all four. Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 11:50:33 -08:00
A	8025376ee6	fix: use ignore stdin for SSH commands to prevent deadlock on Hetzner and DigitalOcean (#2066 ) runServer and runServerCapture on Hetzner and DigitalOcean used stdio:["pipe",...] for stdin but called proc.stdin!.end() AFTER await proc.exited. If a remote SSH command reads from stdin (apt prompts, read calls), the process deadlocks until the 5-minute timeout fires. AWS and GCP correctly use stdio:["ignore",...]. Fix: change stdin from "pipe" to "ignore" in runServer and runServerCapture for both Hetzner and DigitalOcean, removing the now-unnecessary stdin.end() calls. Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 18:48:33 +00:00
A	862030b776	fix(security): escape cmd args in _exec_long to prevent shell injection (#2064 ) Base64-encode the command before embedding it in bash -c to prevent single-quote breakout in _sprite_exec_long and _digitalocean_exec_long. Fixes #2063 Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-01 12:42:27 -05:00
A	631722151c	fix(hetzner): add SPAWN_CUSTOM guard to promptServerType (#2065 ) Every other cloud provider (GCP, DO, Daytona) gates their size/type picker behind SPAWN_CUSTOM !== "1" so users get a fast default launch. Hetzner's promptLocation had the guard but promptServerType was missing it, causing an unexpected interactive picker on the cheapest/most-used cloud when running without --custom. Bump CLI to 0.11.19. Agent: team-lead Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-01 12:41:32 -05:00
A	902f3091d3	test: Remove duplicate and theatrical tests (#2061 ) * test: Remove duplicate and theatrical tests - Remove 3 duplicate/always-pass tests from commands-update-download.test.ts: "should reject script without shebang via validateScriptContent" (already covered in download-and-failure.test.ts and cmdrun-happy-path.test.ts), "should reject script with dangerous pattern" (duplicate + always-pass or-chain), "should show script-not-found message when both URLs 404" (duplicate of existing 404 test) - Remove 5 theatrical tests from custom-flag.test.ts that only verify constant arrays have entries with defined id/label fields (SERVER_TYPES, LOCATIONS, DROPLET_SIZES, DO_REGIONS, SANDBOX_SIZES) — these test constant existence, not behavior, and fail due to @openrouter/spawn-shared import error - Bump CLI version to 0.11.18 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix: Remove trailing blank lines in custom-flag.test.ts for biome format Agent: pr-maintainer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: B <6723574+louisgv@users.noreply.github.com>	2026-03-01 09:09:29 -08:00
A	add83bbd6c	refactor: Remove dead code and stale references (#2062 ) Remove orphaned sh/test/fixtures/ directory. These shell fixture files (_shared_agent_assertions.sh, hetzner/_env.sh, hetzner/_api_assertions.sh, digitalocean/_env.sh, digitalocean/_api_assertions.sh) were part of a mock test harness (mock.sh) that was removed from the repository. The fixture files reference `assert_api_called` and `MOCK_LOG` variables that are never defined anywhere, confirming they are unreachable dead code. Scan results: - Dead code (sh/test/fixtures/): 5 orphaned fixture files removed - Dead code (sh/shared, packages/cli/src/): none found - Stale references to non-existent files: none found - Python usage (python3 -c / python -c): none found - Duplicate utilities across cloud modules: loadTokenFromConfig pattern exists in hetzner/daytona/digitalocean but reads from different cloud- specific config paths — cannot be consolidated (confirmed intentional) - Stale comments: none found beyond those already fixed in prior PRs -- qa/code-quality Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-01 11:45:24 -05:00
A	3fa0c82c91	refactor: Remove dead code and stale references (#2059 ) * refactor: Remove dead code and stale references Fix stale path comment in sh/shared/key-request.sh that referenced the wrong location for loadTokenFromConfig (cli/src/ instead of packages/cli/src/). Also updated wording from "Must match" to "Keep in sync with" to more accurately describe the relationship. Scan results (no other issues found): - Dead code (sh/shared, packages/cli/src): none found - Stale references to non-existent files: none found - Python usage (python3 -c / python -c): none found - Duplicate utilities across cloud modules: none (cloud-specific config loading functions share the same pattern but read from different paths and cannot be consolidated) - Stale comments: one stale path in key-request.sh (fixed) -- qa/code-quality Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * refactor: Remove dead code and stale references Remove duplicate `log_step` function from `sh/shared/github-auth.sh`. `log_step` was identical to `log_info` (same printf format, same output stream) and had no semantic distinction. All 6 call sites are updated to use `log_info` directly. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-01 08:26:10 -05:00
A	cef14ce9ea	fix(sprite): pass timeoutSecs through to runSprite, add kill-on-timeout (#2060 ) runSprite was wired as CloudRunner.runServer but silently dropped the timeoutSecs parameter. All other clouds (Hetzner, DO, AWS, GCP, Daytona) implement kill-on-timeout via setTimeout+killWithTimeout; Sprite had zero timeout protection, so a hung agent install (e.g. ZeroClaw's 600s Rust compile, Claude Code's 300s install) would hang forever on Sprite. Matches the pattern used by every other cloud provider. Agent: team-lead Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 08:21:26 -05:00
A	40e14c2b6b	test: Remove duplicate and theatrical tests (#2058 ) - manifest-type-contracts.test.ts: Replace 42 per-agent/per-cloud silently-skipping tests (if field === undefined { return }) with 6 aggregate tests that filter to entries that actually have the field and assert the field count > 0 so the test can't pass vacuously. Affected: pre_launch, config_files, notes (agents); defaults, notes, icon (clouds). - history.test.ts: Remove always-pass test "throws for SPAWN_HOME pointing to /root when user home is different" — it silently returns early whenever the CI environment runs as root (which it always does), providing zero signal. The adjacent "throws for SPAWN_HOME outside home directory" test already covers this semantic. Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-01 08:20:44 -05:00
Ahmed Abushagur	45caf4b96b	fix(sprite): fix all 6 Sprite agent installs for E2E (#2057 ) * fix(sprite): fix all 6 Sprite agent installs for E2E - Use `npm install -g --prefix` instead of `npm config set prefix` to avoid creating .npmrc that conflicts with nvm on Sprite VMs - Fix shell environment setup to only modify .bash_profile (not .bashrc) so non-interactive bash -c commands retain PATH config - Add $HOME/.cargo/bin to PATH for zeroclaw (Sprite has no ~/.cargo/env) - Add $HOME/.local/bin to PATH config for Sprite shell environment - Add sprite E2E cloud driver with org detection, config corruption fix, direct command embedding (not $1 positional), and retry logic - Fix provision.sh to kill full process tree after timeout (prevents orphaned sprite exec sessions from corrupting config) - Fix verify.sh zeroclaw check to not rely on ~/.cargo/env existing Tested: 6/6 Sprite agents pass E2E (claude, codex, openclaw, zeroclaw, opencode, kilocode). Hermes is not in the Sprite manifest. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: biome format - collapse runSprite call to single line Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-01 07:15:09 -05:00
A	66bd992198	refactor: Remove dead code and stale references (#2055 ) Fix stale path comment in sh/shared/key-request.sh that referenced the wrong location for loadTokenFromConfig (cli/src/ instead of packages/cli/src/). Also updated wording from "Must match" to "Keep in sync with" to more accurately describe the relationship. Scan results (no other issues found): - Dead code (sh/shared, packages/cli/src): none found - Stale references to non-existent files: none found - Python usage (python3 -c / python -c): none found - Duplicate utilities across cloud modules: none (cloud-specific config loading functions share the same pattern but read from different paths and cannot be consolidated) - Stale comments: one stale path in key-request.sh (fixed) -- qa/code-quality Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-01 04:45:54 -05:00
A	41adfbdb0a	test: Remove duplicate and theatrical tests (#2054 ) * test: Remove duplicate and theatrical tests - check-entity.test.ts: Remove 'kind parameter consistency' describe block (9 tests) that fully duplicated coverage already provided by 'valid entities', 'wrong-type detection: cloud given as agent', and 'wrong-type detection: agent given as cloud' describes. Also remove redundant loop assertions ('should return true for all three agent keys' etc.) that repeated what the individual named tests already covered. - manifest-cache-lifecycle.test.ts: Replace Record<string, any> with Record<string, AgentDef> and Record<string, CloudDef> for type safety. 1401 tests pass, 0 fail. Lint clean. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix: remove extra blank line to pass Biome format check Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * test: Remove duplicate and theatrical tests Remove redundant if-guards around always-present agent metadata fields in manifest-type-contracts.test.ts. All 12 metadata fields (creator, repo, license, created, added, github_stars, stars_updated, language, runtime, category, tagline, tags) are present on all 7 agents, making the if (agent.X !== undefined) guards always-truthy dead code that misleads readers into thinking tests might be skipped. Restructure into proper per-agent describe blocks to make the test structure honest and clear. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix: Apply Biome formatting to array literal Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * test: Remove duplicate and theatrical tests Fix always-pass anti-pattern in manifest-type-contracts.test.ts where optional field type tests were gated by `if (field !== undefined)` OUTSIDE the `it()` block. When no agent/cloud had the field, zero tests registered, giving false confidence. Changes: - Agent optional field types: move condition inside `it()`, test always runs - Cloud optional field types: same fix, tests always register for all clouds - Interactive prompts structure: consolidate filtered loop into one `it()` that iterates internally, avoiding silently-absent test registrations - Config files structure: same consolidation pattern Before: 551 pass, 64 fail (optional field tests only registered per-agent) After: 566 pass, 64 fail (optional field tests register for every agent/cloud) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * style: fix biome lint errors - add block statements to early returns Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * style: apply biome formatter to block statements Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-01 04:16:05 -05:00
A	210519a590	fix(security): document PKCE migration path for DigitalOcean OAuth (#2056 ) Adds explicit monitoring obligation and step-by-step migration checklist to the DO_CLIENT_SECRET comment. Tracks when PKCE was last verified unsupported (2026-03) and what to do when it becomes available, addressing the technical debt tracking request from #2041. Fixes #2041 Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-01 04:11:40 -05:00
A	84133fb036	fix(security): replace validateLaunchCmd blocklist with allowlist (#2053 ) * fix(security): replace validateLaunchCmd blocklist with allowlist The blocklist pattern />\\s\\// (redirection to absolute path) matched 2>/dev/null, which appears in every valid launch command generated by agent-setup.ts. This caused mergeLastConnection() to reject and discard all connection data, breaking the spawn list → "Enter agent" reconnect flow and spawn last. Replace the blocklist with a strict allowlist: each semicolon-separated segment must match one of: - source ~/.<rc-file> [2>/dev/null] - export PATH=<safe-path> - <binary> [simple-args] This simultaneously fixes the false-positive and closes the latent injection gap (the old blocklist only blocked '; rm' but not arbitrary '; <other-cmd>'). Fixes #2052 Agent: issue-fixer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> style: apply biome formatter to fix CI format check Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 03:12:27 -05:00
A	9be0c9597d	fix: spawn last reconnects to existing VM instead of always reprovisioning (#2051 ) `cmdLast()` was always calling `cmdRun()`, creating a brand-new VM every time. Wire it into `handleRecordAction()` instead, which already contains the reconnect-vs-rerun logic used by `spawn list`: if the latest history record has a live connection (IP + server ID), the user is offered options to enter the agent or SSH in; only if no connection info exists (or the user chooses "Spawn a new VM") does it provision a fresh instance. Also bumps CLI version 0.11.13 → 0.11.14. Fixes #2050 Agent: issue-fixer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 01:49:36 -05:00
A	e1ef024981	test: Remove duplicate and theatrical tests (#2047 ) * test: Remove duplicate and theatrical tests - check-entity.test.ts: Remove 'kind parameter consistency' describe block (9 tests) that fully duplicated coverage already provided by 'valid entities', 'wrong-type detection: cloud given as agent', and 'wrong-type detection: agent given as cloud' describes. Also remove redundant loop assertions ('should return true for all three agent keys' etc.) that repeated what the individual named tests already covered. - manifest-cache-lifecycle.test.ts: Replace Record<string, any> with Record<string, AgentDef> and Record<string, CloudDef> for type safety. 1401 tests pass, 0 fail. Lint clean. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix: remove extra blank line to pass Biome format check Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * test: Remove duplicate and theatrical tests Remove redundant if-guards around always-present agent metadata fields in manifest-type-contracts.test.ts. All 12 metadata fields (creator, repo, license, created, added, github_stars, stars_updated, language, runtime, category, tagline, tags) are present on all 7 agents, making the if (agent.X !== undefined) guards always-truthy dead code that misleads readers into thinking tests might be skipped. Restructure into proper per-agent describe blocks to make the test structure honest and clear. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix: Apply Biome formatting to array literal Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-01 00:11:48 -05:00
Ahmed Abushagur	2155a36a1f	fix(e2e): use explicit PATH for hermes, codex, and kilocode binary checks (#2049 ) Non-interactive SSH sessions don't source .bashrc or .zshrc, so binaries installed to ~/.local/bin (hermes via uv) or ~/.npm-global/bin (codex, kilocode via npm) were not found during verification. Fix all three verify functions and the codex input test to use explicit PATH with the known install directories, matching the pattern already used by openclaw and claude. Verified: AWS 7/7, Hetzner 6/6 implemented, GCP 6/6 implemented. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 00:07:33 -05:00
Ahmed Abushagur	cd758589c3	fix(e2e): robust DigitalOcean teardown with retry and deletion confirmation (#2046 ) The teardown was doing a single DELETE without --max-time, so connection timeouts caused HTTP 000 and the droplet was never deleted. When running 6 agents in batches of 3, batch 1's stale droplet caused batch 2 to fail with "will exceed your droplet limit." Fix: - Add --max-time 30 to prevent curl hangs - Retry DELETE up to 3 times on failure - Poll the API after DELETE to confirm the droplet is actually gone (up to 60s) - Remove -f flag from curl so %{http_code} is always captured Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 22:09:15 -05:00
Ahmed Abushagur	d8dbf952c2	fix(e2e): fix openclaw input test (PATH, CLI flags, gateway restart) (#2045 ) The openclaw e2e input test was failing for three independent reasons: 1. PATH missing ~/.npm-global/bin — openclaw installs via npm with a custom prefix, but verify_openclaw and input_test_openclaw didn't include that directory in PATH 2. Wrong CLI invocation — used `openclaw -p` which doesn't exist. The correct command is `openclaw agent --message "..." --session-id` 3. Gateway not running — the openclaw gateway (port 18789) can die between provisioning and verification. Now the input test ensures the gateway is running before sending the prompt. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-02-28 20:41:48 -05:00
A	708326a693	refactor: Remove dead exports across TypeScript modules (#2044 ) Remove `export` from functions that are only used internally within their own file and never imported elsewhere. Affected modules: - `history.ts`: `mergeLastConnection` (only called internally by `getActiveServers`/`filterHistory`) - `update-check.ts`: `isUpdateBackedOff` (only called internally by `checkForUpdates`) - `aws/aws.ts`: `waitForSsh` (only called internally by `waitForCloudInit`) - `gcp/gcp.ts`: `waitForSsh` (only called internally by `waitForCloudInit`) - `daytona/daytona.ts`: `waitForSsh` (only called internally by `waitForCloudInit`) - `shared/agent-setup.ts`: 11 implementation helpers (`installAgent`, `uploadConfigFile`, `installClaudeCode`, `setupClaudeCodeConfig`, `promptGithubAuth`, `setupCodexConfig`, `setupOpenclawConfig`, `startGateway`, `setupZeroclawConfig`, `ensureSwapSpace`, `openCodeInstallCmd`) — all only used within `createAgents()` All 1410 tests pass, biome lint clean (0 errors). Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-02-28 20:39:38 -05:00
A	dc27f456ea	fix(e2e): add sh/aws/hermes.sh and mark aws/hermes as implemented (#2042 ) Hermes agent was fully implemented in shared/agent-setup.ts (createAgents includes hermes with install, envVars, and launchCmd) but the convenience shell script sh/aws/hermes.sh was missing and the matrix showed "missing". - Add sh/aws/hermes.sh (matching pattern of all other aws agent scripts) - Update manifest.json: "aws/hermes" -> "implemented" - Update sh/aws/README.md with Hermes Agent install command Discovered during QA E2E sweep: E2E suite lists hermes in ALL_AGENTS and would attempt to provision it; without the matrix entry and script the agent was silently untracked as a missing implementation gap. Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-28 20:38:26 -05:00
A	cdab002a13	fix: correct install URLs in README to use openrouter.ai/labs/spawn/cli/ (#2040 ) The bash install URL was missing the /cli/ path segment, and the PowerShell URL was pointing to raw.githubusercontent.com instead of the openrouter.ai domain. Fixed all three occurrences. Closes #2032 Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-02-28 18:40:12 -05:00
A	ad38a66c96	fix: extend HOME hardening to local.ts and commands.ts (missed by #2026/#2036) (#2037 ) When HOME is unset (containers, systemd, cron, some CI), two files still used `process.env.HOME \|\| ""` which produces broken paths: - local/local.ts:38 — uploadFile() expands ~ to "", writing config files to filesystem root (e.g. /.openclaw/openclaw.json) instead of ~/.openclaw/ - commands.ts:898 — hasCloudConfigCredentials() checks "" + .config/spawn/ resolving to /.config/spawn/{cloud}.json, silently failing credential detection and causing false "Missing credentials" warnings on every run Fix: add `import { homedir } from "node:os"` to both files and change `process.env.HOME \|\| ""` to `process.env.HOME \|\| homedir()`. Completes the HOME hardening series started in #2026 and #2036. Agent: team-lead Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-28 15:27:29 -08:00
A	0904e53c85	fix: surface OAuth denial error immediately instead of waiting 120s (#2039 ) When a user denies OAuth access on OpenRouter or DigitalOcean, the CLI now immediately shows a clear error message and falls back to manual key entry, instead of silently waiting the full 120s poll timeout. Changes: - OpenRouter OAuth: check for `error` query param on callback, set `oauthDenied` flag, show denial-specific HTML page in browser, break polling loop early, and log a clear terminal error - DigitalOcean OAuth: add `oauthDenied` flag (error detection already existed but the polling loop still waited 120s), break loop early Fixes #2038 Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-28 14:42:32 -08:00
A	44f67462ed	fix: extend HOME hardening to ssh-keys, sprite, gcp (3 files missed by #2026 ) (#2036 ) When HOME is unset (containers, systemd, cron), process.env.HOME produces literal "undefined" in path strings: - ssh-keys.ts: SSH discovery/generation writes to "undefined/.ssh/" - sprite.ts: CLI detection misses ~/.local/bin, PATH update corrupted - gcp.ts: gcloud detection misses ~/google-cloud-sdk/bin, PATH corrupted Same fix as #2026: use `process.env.HOME \|\| homedir()` via `join()` for robust OS-level fallback when HOME is unset. Agent: team-lead Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-28 13:51:09 -08:00
A	8975c3c986	fix(openclaw): add pre-launch tip for sequential channel setup (#2035 ) Display a hint before launching `openclaw tui` warning users to set up channels one at a time. Concurrent token pastes trigger a race condition inside OpenClaw's TUI that causes setup to hang. Adds an optional `preLaunchMsg` field to `AgentConfig` so any agent can surface a user-visible tip just before its interactive session starts. OpenClaw sets this to advise sequential channel onboarding. Fixes #2030 Agent: issue-fixer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-28 13:11:56 -08:00
A	8a1643a73d	fix(assets): add missing Hermes agent icon and .sources.json entry (#2034 ) Fixes #2027 PR #2023 added Hermes to manifest.json with an icon URL pointing to assets/agents/hermes.png, but the file and the .sources.json entry were never committed. This caused a 404 on the icon URL. - Download hermes.png from Nous Research GitHub org avatar - Add hermes entry to assets/agents/.sources.json Agent: issue-fixer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-28 13:10:27 -08:00
A	e5157b5264	fix: correct install URLs in README.md (#2033 ) - Add missing /cli/ path segment to main bash install URL - Replace raw.githubusercontent.com with openrouter.ai domain for PowerShell install - Replace raw.githubusercontent.com with openrouter.ai domain for troubleshooting install Fixes #2032 Agent: issue-fixer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-28 13:09:17 -08:00
A	ab08476a63	refactor: Remove dead code and stale references (#2028 ) - Add `hermes` to ALL_AGENTS in sh/e2e/lib/common.sh (stale: hermes added to manifest.json in #2023 but never added to the e2e agent list) - Add verify_hermes() and input_test_hermes() to sh/e2e/lib/verify.sh and wire them into verify_agent/run_input_test dispatch tables - Remove dead log_warn() from sh/shared/github-auth.sh (defined but never called) - Remove dead get_cloud_env_vars() from sh/shared/key-request.sh (no callers outside file) - Remove dead invalidate_cloud_key() from sh/shared/key-request.sh (no callers anywhere) Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-02-28 12:55:09 -08:00
A	7003a8ad40	fix: replace module-level process.env.HOME with homedir() in config paths (#2026 ) Fixes #2025 Silent credential loss in Docker/CI when HOME is unset. Use node:os homedir() which has OS-level fallbacks and matches history.ts pattern. Prefer process.env.HOME when set to respect test sandboxing overrides. Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-28 12:54:12 -08:00
A	6f6b2a7895	feat(spa): redesign tool footer — show latest tool, stats line, expandable history (#2031 ) Replace the continuously-appending tool list with a cleaner 3-part footer: 1. Latest tool call (swapped, not appended) — shows current tool + hint 2. Compact stats line — "1× Bash, 4× Read, 5× Grep, 8× Glob" 3. Expandable attachment — full ordered tool history (Slack auto-collapses) Also adds toolHint field to SlackSegment, extracts formatToolStats and formatToolHistory as tested helpers, and adds 19 new unit tests. Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-02-28 15:43:29 -05:00
A	a8b7bb7fb9	test: consolidate wasteful one-per-flag tests in unknown-flags suite (#2029 ) Remove 18 redundant/theatrical tests from unknown-flags.test.ts: - Removed duplicate 'should detect --verbose as unknown' test (same name, same assertion, nearly identical inputs as the test 28 lines above it) - Consolidated 14 individual 'allows known flags' tests — each called findUnknownFlag([flag]) with a single flag and expected null — into one data-driven loop over all 17 flags; same coverage, 13 fewer test cases - Removed 'should contain --name flag' which is fully subsumed by the immediately following 'should contain all expected flags' test that already verifies --name along with 22 other flags Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-28 20:42:04 +00:00
A	912d8305c5	fix: add missing hermes agent to createAgents() and update sprite agents list (#2024 ) The hermes agent was added to manifest.json and sh/sprite/hermes.sh in feat #2023, but createAgents() in shared/agent-setup.ts was not updated. This caused sh/sprite/hermes.sh to throw "Unknown agent: hermes" when resolveAgent() was called. - Add hermes entry to createAgents() with correct install cmd, envVars, and launchCmd - Update sprite/main.ts usage error message to include hermes Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-28 12:08:18 -05:00
A	187cd6aafc	feat(agent): add Hermes Agent (Nous Research) (#2023 ) Implements Hermes Agent on Sprite cloud. Hermes is a persistent AI agent by Nous Research with multi-platform messaging (Telegram, Discord, Slack, CLI), memory across sessions, tool use, and native OpenRouter support. - Add hermes agent entry to manifest.json with env config - Add matrix entries for all 7 clouds (sprite implemented, rest missing) - Create sh/sprite/hermes.sh thin bash shim Closes #1952 Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-28 06:53:13 -08:00
A	111512220f	fix(security): replace eval with safe export parser in provision.sh (#2022 ) Replace eval "${cloud_env}" with a while-read loop that only processes lines matching ^export VAR="VALUE" — arbitrary shell commands in cloud driver output are silently ignored. Also removes the now-unused cloud_env local variable since the while-read loop calls cloud_headless_env directly. Fixes #2019 Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-28 09:43:29 -05:00
A	54c7764d03	fix(security): prevent cmd injection in sprite exec via positional args (#2021 ) Replace bash -c "${cmd}" with bash -c '$1' _ "${cmd}" so the command is passed as a positional argument, not interpolated into the shell string. Same pattern applied to the timeout wrapper. Fixes #2018 Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-28 09:42:13 -05:00
A	ba06e49d97	test: Remove duplicate and theatrical tests (#2020 ) Remove 162 lines of duplicate/theatrical test code across 5 files: - manifest-integrity.test.ts: Drop 5 crude toBeTruthy() field checks superseded by manifest-type-contracts.test.ts with precise type assertions - manifest-type-contracts.test.ts: Remove dead deps block (0 agents have deps) and Dotenv configuration describe block (0 agents have dotenv) — both generated zero tests at runtime - commands-error-paths.test.ts: Remove 5-test "swapped arguments detection" block duplicated by the more thorough commands-swap-resolve.test.ts - run-path-credential-display.test.ts: Remove 5-test "implementation checks" block that retested getImplementedClouds/Agents already covered by commands-exported-utils.test.ts; also remove now-unused imports - manifest.test.ts: Remove redundant toBeDefined() assertion after an already-present toHaveProperty() check on the same field Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-28 07:49:55 -05:00
A	ae3f4001cc	refactor: Remove dead code and stale references (#2017 ) Remove stale comments in test files that referenced deleted test files (commands-untested.test.ts, commands-helpers.test.ts) and remove "Agent: X" metadata annotations that became obsolete after the theatrical test cleanup. All 1424 tests pass, biome lint clean. Co-authored-by: spawn-qa-bot <qa@openrouter.ai>	2026-02-28 02:07:48 -08:00
Ahmed Abushagur	d5461adc16	feat: SPAWN_CLI_DIR env var to force local source in e2e (#2015 ) * feat: SPAWN_CLI_DIR env var to force local source in e2e and shell scripts When SPAWN_CLI_DIR is set, the entire toolchain uses local TypeScript source instead of downloading pre-bundled scripts from GitHub releases: - e2e.sh: auto-sets SPAWN_CLI_DIR to repo root when running locally - provision.sh: exports SPAWN_CLI_DIR into the headless subshell - commands.ts: reads local shell scripts instead of fetching from CDN - All 36 cloud/agent shell scripts: exec local main.ts when set This enables e2e tests to validate local changes before they're released. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(security): add path traversal defense to SPAWN_CLI_DIR script loading Canonicalize the path via realpathSync and verify it stays inside the resolved CLI directory before reading. Prevents SPAWN_CLI_DIR from being used to read arbitrary files via ../ traversal. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(security): harden SPAWN_CLI_DIR path traversal defense - Validate cloud/agent names don't contain '..', '/' or '\' before constructing file paths - Fix root-directory edge case in prefix check by handling trailing separator correctly Agent: pr-maintainer Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: B <6723574+louisgv@users.noreply.github.com>	2026-02-28 04:14:36 -05:00
A	ee83e46d9b	test: Remove duplicate and theatrical tests (#2016 ) Remove 3 duplicate tests from security-edge-cases.test.ts that were already covered in security.test.ts: - validateIdentifier: "reject 65-char identifier" (duplicate of security.test.ts "should reject overly long identifiers") - validateScriptContent: "accept wget\|sh" (duplicate of security.test.ts "should accept scripts with wget\|bash") - validatePrompt: "accept prompts at exactly the max length" (duplicate of security.test.ts "should accept prompts at the size limit") The edge-cases file retains unique tests: 64-char boundary check, single char identifiers, mkfs with multiple filesystems, extra-whitespace pipe detection, etc. Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-02-28 04:11:48 -05:00
Ahmed Abushagur	c1e605c884	fix(e2e): increase server sizes and install timeouts (#2014 ) E2E tests were failing because agent installs didn't complete within the default 120s timeout, and small VMs ran out of memory during builds. - INSTALL_WAIT: 120s → 300s (with per-cloud override via cloud_install_wait) - AWS: nano_3_0 → medium_3_0 (all agents need 4GB for reliable installs) - DigitalOcean: s-1vcpu-512mb-10gb → s-2vcpu-2gb, cap at 3 parallel - GCP: e2-medium → e2-standard-2 - Hetzner: cap at 5 parallel (primary IP limit) - Sprite: 300s install wait (slower exec than SSH) Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-02-28 00:25:36 -08:00
Ahmed Abushagur	c595c90dc4	fix(e2e): prevent multi-cloud name and file collisions (#2013 ) When multiple clouds run in parallel, they generate the same app name (e.g. e2e-claude-TIMESTAMP) and write to the same temp files (.exit/.stdout/.stderr), causing data corruption. - Include ACTIVE_CLOUD in make_app_name: e2e-gcp-claude-TIMESTAMP - Use ${app_name} instead of ${agent} for provision temp files Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-02-28 03:12:05 -05:00
A	f0cadc2758	test: Remove duplicate and theatrical tests (#2009 ) * test: Remove duplicate and theatrical tests Found and removed three categories of test anti-patterns: 1. Conditional always-pass guard (`with-retry-result.test.ts`): The `if (!result.ok)` wrapper around `expect(result.error).toBeInstanceOf(Error)` was silently skippable if the condition ever evaluates false. Replaced with a type-narrowing early return (`if (result.ok) { return; }`) so the assertions always execute when the code path is reached. 2. Duplicate `loadManifest` tests (`manifest.test.ts`): Five tests covering stale-cache fallback, no-cache-network-fail, invalid-fetch-fallback, fetch timeout, and cached-instance reuse were exact duplicates of tests already in `manifest-cache-lifecycle.test.ts` (which covers these scenarios more thoroughly). Removed the duplicates; kept the three tests unique to `manifest.test.ts` (fetch URL validation, fresh-cache skips network, forceRefresh behavior). 3. Duplicate manifest structural checks (`manifest-type-contracts.test.ts`): The "Cross-referential consistency" (matrix coverage, no-invalid-refs, valid status values) and "Display name uniqueness" (unique agent/cloud names, no key collisions) describe blocks duplicated tests already present in `manifest-integrity.test.ts`. Removed the 6 redundant tests; the unique type validation tests (per-field `typeof` checks) remain. Net result: -11 tests, 0 new failures, lint clean. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix: biome format issues in test files Remove trailing blank line in manifest.test.ts and expand single-line if block in with-retry-result.test.ts to satisfy biome formatter. Agent: pr-maintainer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: B <6723574+louisgv@users.noreply.github.com>	2026-02-28 00:08:57 -08:00
A	31b52e92c3	fix(daytona): handle daytona-sandbox sentinel in cmdConnect (#2011 ) When a user selects a Daytona sandbox in `spawn list` and chooses "SSH into VM", cmdConnect was missing the daytona-sandbox sentinel handler. It fell through to the generic SSH path and tried to run `ssh daytona@daytona-sandbox`, which fails with a cryptic error. Added the handler mirroring the existing pattern in cmdEnterAgent. Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-27 22:47:58 -08:00
Ahmed Abushagur	b4b1b149e9	fix(gcp): remove invalid --subnet-region flag from instance creation (#2012 ) `gcloud compute instances create` doesn't accept `--subnet-region`. The subnet region is automatically inferred from `--zone`. This flag causes all GCP provisioning to fail. Bump CLI to 0.11.3. Signed-off-by: Ahmed Abushagur <ahmed@abushagur.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 01:13:35 -05:00
A	87978b424d	refactor: Remove dead code and stale references (#2010 ) Dead code removed: - `cleanup_stale_apps` function in `sh/e2e/lib/cleanup.sh` — defined but never called; `e2e.sh` calls `cloud_cleanup_stale` directly instead - `generateEnvConfig` and `AgentConfig` re-exports from all 7 cloud-specific `agents.ts` modules (aws, hetzner, gcp, digitalocean, daytona, local, sprite) — nothing imported these from the cloud modules; they were already available via `@openrouter/spawn-shared` and `../shared/agents` All 1435 tests pass, biome lint is clean (0 errors). Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-28 00:18:20 -05:00

1 2 3 4 5 ...

1688 commits