spawn

vrr/spawn

mirror of https://github.com/OpenRouterTeam/spawn.git synced 2026-05-20 01:11:18 +00:00

Author	SHA1	Message	Date
A	548cfdf0b1	fix(security): apply base64 exec escaping to remaining 4 cloud drivers (#2067 ) PR #2064 fixed _exec_long shell injection for DigitalOcean and Sprite but missed the same bash -c '${cmd}' pattern in Hetzner, GCP, AWS, and Daytona. Apply the same base64-encoding fix to all four. Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-01 11:50:33 -08:00
A	862030b776	fix(security): escape cmd args in _exec_long to prevent shell injection (#2064 ) Base64-encode the command before embedding it in bash -c to prevent single-quote breakout in _sprite_exec_long and _digitalocean_exec_long. Fixes #2063 Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-01 12:42:27 -05:00
Ahmed Abushagur	45caf4b96b	fix(sprite): fix all 6 Sprite agent installs for E2E (#2057 ) * fix(sprite): fix all 6 Sprite agent installs for E2E - Use `npm install -g --prefix` instead of `npm config set prefix` to avoid creating .npmrc that conflicts with nvm on Sprite VMs - Fix shell environment setup to only modify .bash_profile (not .bashrc) so non-interactive bash -c commands retain PATH config - Add $HOME/.cargo/bin to PATH for zeroclaw (Sprite has no ~/.cargo/env) - Add $HOME/.local/bin to PATH config for Sprite shell environment - Add sprite E2E cloud driver with org detection, config corruption fix, direct command embedding (not $1 positional), and retry logic - Fix provision.sh to kill full process tree after timeout (prevents orphaned sprite exec sessions from corrupting config) - Fix verify.sh zeroclaw check to not rely on ~/.cargo/env existing Tested: 6/6 Sprite agents pass E2E (claude, codex, openclaw, zeroclaw, opencode, kilocode). Hermes is not in the Sprite manifest. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: biome format - collapse runSprite call to single line Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-01 07:15:09 -05:00
Ahmed Abushagur	2155a36a1f	fix(e2e): use explicit PATH for hermes, codex, and kilocode binary checks (#2049 ) Non-interactive SSH sessions don't source .bashrc or .zshrc, so binaries installed to ~/.local/bin (hermes via uv) or ~/.npm-global/bin (codex, kilocode via npm) were not found during verification. Fix all three verify functions and the codex input test to use explicit PATH with the known install directories, matching the pattern already used by openclaw and claude. Verified: AWS 7/7, Hetzner 6/6 implemented, GCP 6/6 implemented. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 00:07:33 -05:00
Ahmed Abushagur	cd758589c3	fix(e2e): robust DigitalOcean teardown with retry and deletion confirmation (#2046 ) The teardown was doing a single DELETE without --max-time, so connection timeouts caused HTTP 000 and the droplet was never deleted. When running 6 agents in batches of 3, batch 1's stale droplet caused batch 2 to fail with "will exceed your droplet limit." Fix: - Add --max-time 30 to prevent curl hangs - Retry DELETE up to 3 times on failure - Poll the API after DELETE to confirm the droplet is actually gone (up to 60s) - Remove -f flag from curl so %{http_code} is always captured Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 22:09:15 -05:00
Ahmed Abushagur	d8dbf952c2	fix(e2e): fix openclaw input test (PATH, CLI flags, gateway restart) (#2045 ) The openclaw e2e input test was failing for three independent reasons: 1. PATH missing ~/.npm-global/bin — openclaw installs via npm with a custom prefix, but verify_openclaw and input_test_openclaw didn't include that directory in PATH 2. Wrong CLI invocation — used `openclaw -p` which doesn't exist. The correct command is `openclaw agent --message "..." --session-id` 3. Gateway not running — the openclaw gateway (port 18789) can die between provisioning and verification. Now the input test ensures the gateway is running before sending the prompt. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-02-28 20:41:48 -05:00
A	ab08476a63	refactor: Remove dead code and stale references (#2028 ) - Add `hermes` to ALL_AGENTS in sh/e2e/lib/common.sh (stale: hermes added to manifest.json in #2023 but never added to the e2e agent list) - Add verify_hermes() and input_test_hermes() to sh/e2e/lib/verify.sh and wire them into verify_agent/run_input_test dispatch tables - Remove dead log_warn() from sh/shared/github-auth.sh (defined but never called) - Remove dead get_cloud_env_vars() from sh/shared/key-request.sh (no callers outside file) - Remove dead invalidate_cloud_key() from sh/shared/key-request.sh (no callers anywhere) Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-02-28 12:55:09 -08:00
A	111512220f	fix(security): replace eval with safe export parser in provision.sh (#2022 ) Replace eval "${cloud_env}" with a while-read loop that only processes lines matching ^export VAR="VALUE" — arbitrary shell commands in cloud driver output are silently ignored. Also removes the now-unused cloud_env local variable since the while-read loop calls cloud_headless_env directly. Fixes #2019 Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-28 09:43:29 -05:00
A	54c7764d03	fix(security): prevent cmd injection in sprite exec via positional args (#2021 ) Replace bash -c "${cmd}" with bash -c '$1' _ "${cmd}" so the command is passed as a positional argument, not interpolated into the shell string. Same pattern applied to the timeout wrapper. Fixes #2018 Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-28 09:42:13 -05:00
Ahmed Abushagur	d5461adc16	feat: SPAWN_CLI_DIR env var to force local source in e2e (#2015 ) * feat: SPAWN_CLI_DIR env var to force local source in e2e and shell scripts When SPAWN_CLI_DIR is set, the entire toolchain uses local TypeScript source instead of downloading pre-bundled scripts from GitHub releases: - e2e.sh: auto-sets SPAWN_CLI_DIR to repo root when running locally - provision.sh: exports SPAWN_CLI_DIR into the headless subshell - commands.ts: reads local shell scripts instead of fetching from CDN - All 36 cloud/agent shell scripts: exec local main.ts when set This enables e2e tests to validate local changes before they're released. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(security): add path traversal defense to SPAWN_CLI_DIR script loading Canonicalize the path via realpathSync and verify it stays inside the resolved CLI directory before reading. Prevents SPAWN_CLI_DIR from being used to read arbitrary files via ../ traversal. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(security): harden SPAWN_CLI_DIR path traversal defense - Validate cloud/agent names don't contain '..', '/' or '\' before constructing file paths - Fix root-directory edge case in prefix check by handling trailing separator correctly Agent: pr-maintainer Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: B <6723574+louisgv@users.noreply.github.com>	2026-02-28 04:14:36 -05:00
Ahmed Abushagur	c1e605c884	fix(e2e): increase server sizes and install timeouts (#2014 ) E2E tests were failing because agent installs didn't complete within the default 120s timeout, and small VMs ran out of memory during builds. - INSTALL_WAIT: 120s → 300s (with per-cloud override via cloud_install_wait) - AWS: nano_3_0 → medium_3_0 (all agents need 4GB for reliable installs) - DigitalOcean: s-1vcpu-512mb-10gb → s-2vcpu-2gb, cap at 3 parallel - GCP: e2-medium → e2-standard-2 - Hetzner: cap at 5 parallel (primary IP limit) - Sprite: 300s install wait (slower exec than SSH) Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-02-28 00:25:36 -08:00
Ahmed Abushagur	c595c90dc4	fix(e2e): prevent multi-cloud name and file collisions (#2013 ) When multiple clouds run in parallel, they generate the same app name (e.g. e2e-claude-TIMESTAMP) and write to the same temp files (.exit/.stdout/.stderr), causing data corruption. - Include ACTIVE_CLOUD in make_app_name: e2e-gcp-claude-TIMESTAMP - Use ${app_name} instead of ${agent} for provision temp files Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-02-28 03:12:05 -05:00
A	87978b424d	refactor: Remove dead code and stale references (#2010 ) Dead code removed: - `cleanup_stale_apps` function in `sh/e2e/lib/cleanup.sh` — defined but never called; `e2e.sh` calls `cloud_cleanup_stale` directly instead - `generateEnvConfig` and `AgentConfig` re-exports from all 7 cloud-specific `agents.ts` modules (aws, hetzner, gcp, digitalocean, daytona, local, sprite) — nothing imported these from the cloud modules; they were already available via `@openrouter/spawn-shared` and `../shared/agents` All 1435 tests pass, biome lint is clean (0 errors). Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-28 00:18:20 -05:00
Ahmed Abushagur	627026a26b	feat(e2e): multi-cloud test suite with cloud driver pattern (#2004 ) * feat(e2e): multi-cloud test suite with cloud driver pattern Scale the E2E test suite from AWS-only to all 6 infrastructure clouds (aws, hetzner, digitalocean, gcp, daytona, sprite) with parallel execution support. Architecture: - Cloud driver pattern: each cloud implements _cloudname_func() functions - load_cloud_driver() wires cloud-specific functions to generic names (cloud_exec, cloud_teardown, etc.) - Shared orchestration stays in one place, cloud details are isolated New files: - sh/e2e/e2e.sh — unified entry point with --cloud flag - sh/e2e/lib/clouds/{aws,hetzner,digitalocean,gcp,daytona,sprite}.sh Refactored: - common.sh — removed AWS constants, added load_cloud_driver() - provision.sh — cloud-agnostic via cloud_headless_env/cloud_provision_verify - verify.sh — replaced aws_ssh with cloud_exec/cloud_exec_long - teardown.sh/cleanup.sh — delegate to cloud driver functions - aws-e2e.sh — thin wrapper: exec e2e.sh --cloud aws Usage: e2e.sh --cloud aws # Single cloud e2e.sh --cloud aws --cloud hetzner # Multiple clouds in parallel e2e.sh --cloud all --parallel 3 # All clouds, 3 agents parallel Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(e2e): prevent subshell EXIT trap inheritance and single-cloud early exit - Reset EXIT trap in multi-cloud subshells to prevent LOG_DIR deletion before the main process reads log files - Use `\|\| true` for single-cloud run_agents_for_cloud to prevent set -e from skipping the summary on env validation failure Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: default to parallel agent provisioning in e2e tests All agents within a cloud now run in parallel by default instead of sequentially. Use --sequential to restore the old behavior. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: cap sprite parallelism, 4GB for openclaw, remove stderr suppression - Sprite: add _sprite_max_parallel (cap 2 concurrent agents) to avoid CLI rate limiting that caused all 6 agents to fail - AWS: use medium_3_0 (4GB) bundle for openclaw which needs more RAM - Input tests: remove 2>/dev/null from agent commands so failures produce visible error output instead of empty responses - Add cloud_max_parallel to driver interface, respected by e2e.sh Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: use bash instead of sh for exec_long across all cloud drivers Ubuntu's /bin/sh is dash, which doesn't support bash-specific PATH sourcing from .spawnrc/.cargo/env. This caused codex and zeroclaw input tests to fail with "command not found" even though verify passed. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: codex input test uses positional prompt, not -q flag codex CLI takes prompt as positional arg: `codex "PROMPT"`. The -q flag doesn't exist, causing "Usage:" error output. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: use codex exec -q for non-interactive input test codex requires `exec` subcommand for non-interactive mode. Plain `codex PROMPT` expects a TTY (stdin is not a terminal). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: codex exec takes no -q flag, just positional prompt Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: use cx23 instead of deprecated cx22 for Hetzner e2e tests Hetzner deprecated server type cx22 (ID 104). The default now uses cx23. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-02-27 19:28:08 -08:00
A	edcdf78ba8	fix(e2e): correct env var names passed to AWS CLI in provision.sh (#1985 ) The provision.sh was setting wrong env var names that the TypeScript CLI does not read: - AWS_LIGHTSAIL_INSTANCE_NAME → LIGHTSAIL_SERVER_NAME (read by aws.ts:getServerName) - AWS_REGION → AWS_DEFAULT_REGION (read by aws.ts:authenticate/promptRegion) - AWS_BUNDLE → LIGHTSAIL_BUNDLE (read by aws.ts:promptBundle) Without the correct names, each provisioning run created an instance with a random generated name instead of app_name, causing the post-provision existence check to fail every time. Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-27 01:47:41 -08:00
A	d04096a15b	feat!: remove Fly.io cloud provider support (#1979 ) * feat!: remove Fly.io cloud provider support Drop Fly.io as a supported cloud provider. Sprite (which uses Fly.io infrastructure internally) is retained. - Delete packages/cli/src/fly/ module, sh/fly/ scripts, fixtures/fly/ - Remove fly cloud entry and 6 fly matrix entries from manifest.json - Remove fly imports, destroy cases, and connection handlers from commands.ts - Remove fly-ssh sentinel from security.ts - Port E2E test suite from Fly.io to AWS Lightsail (fly-e2e.sh → aws-e2e.sh) - Update README (7 clouds, 42 combinations), CLAUDE.md, and skill prompts - Clean up fly references in build config, gitignore, icon sources - Bump CLI version to 0.11.0 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * chore: restore Docker image build under sh/docker/ Move openclaw Dockerfile from sh/fly/docker/ to sh/docker/ and rename workflow from fly-docker.yml to docker.yml with updated paths. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * style: fix extra blank lines in commands.ts Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: spawn-bot <spawn-bot@openrouter.ai> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-02-27 00:06:32 -05:00
A	b6f021ecf2	fix(security): clarify base64+single-quote pattern in fly_ssh (#1937 ) Fixes #1933. The comments incorrectly implied base64 encoding alone prevents injection. Safety relies on the combination of base64 output (no single quotes in alphabet) + single-quote wrapping. Made this explicit in all 7 affected comments. Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-25 18:44:51 -05:00
A	b09295131a	fix(security): replace eval with pipe-to-sh in fly_ssh functions (#1928 ) Eliminates nested quote eval pattern in favor of direct pipe to sh, removing potential injection surface in fly_ssh and fly_ssh_long. Fixes #1927 Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-25 14:50:24 -05:00
A	f9c1568f9c	fix(security): use explicit exports in provision.sh subshell (#1926 ) Replace inline env-var prefix pattern (VAR=value command) with explicit export statements inside the subshell. While the inline prefix is POSIX-compliant and not a real injection vector, explicit exports are clearer about intent, eliminate the fragile backslash-continuation chain, and prevent future copy-paste of the pattern into unsafe contexts. Fixes #1924 Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-25 12:54:58 -05:00
A	cac06b2706	fix(security): base64-encode INPUT_TEST_PROMPT in E2E verify.sh to prevent injection (#1923 ) Fixes #1921 Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-25 10:26:18 -05:00
A	638250e744	fix(security): use base64 encoding in fly_ssh to prevent command injection (#1918 ) Replace single-quote escaping (which only handled ' but not other shell metacharacters like $(), backticks, ;, \|\|, &&, \|) with base64 encoding. Base64 output contains only [A-Za-z0-9+/=] characters, completely eliminating shell metacharacter injection risks regardless of command content. Compatible with both GNU coreutils (Linux) and BSD (macOS). Fixes #1912 Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-25 03:25:31 -08:00
A	4994c28594	fix(security): harden shell scripts - fix sed portability, curl HTTPS enforcement, token expiry (#1917 ) - MEDIUM: Validate flyctl auth status before empty FLY_API_TOKEN fallback in provision.sh (fail fast instead of silent failure) - LOW: Fix sed -i portability in qa.sh (use sed -i.bak for macOS compat) - LOW: Increase FLY_API_TOKEN expiry from 2h to 8h in common.sh - LOW: Add --proto '=https' to all curl -L calls in digitalocean scripts (6 files) to prevent HTTP downgrade on redirects Fixes #1913 Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-25 03:23:32 -08:00
A	154112fb41	feat: add live input/output E2E verification for agents (#1886 ) * feat: add live input/output E2E verification for agents The E2E suite previously only verified static artifacts (binaries, config files, env vars). An agent with a broken API key or crash-on-launch bug would pass all checks. This adds an input test phase that sends a real prompt to each agent and verifies the response contains a marker string. - Add fly_ssh_long() with configurable timeout for long-running commands - Add per-agent input test functions (claude -p, codex -q, openclaw -p, zeroclaw agent -p; opencode/kilocode skip as TUI-only) - Add run_input_test() dispatcher with SKIP_INPUT_TEST env var support - Add --skip-input-test CLI flag to fly-e2e.sh - Chain input test after verify in run_single_agent() pipeline - Add INPUT_TEST_TIMEOUT constant (default 120s, env-overridable) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: format p.text({ message }) to multi-line for biome Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: spawn-bot <spawn-bot@openrouter.ai> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 15:16:30 -05:00
A	80c044dbb3	fix: correct CLI entry point path in E2E provision script (#1887 ) * fix: correct CLI entry point path in E2E provision script The path resolution went up 2 levels (../../) from sh/e2e/lib/ which landed in sh/ instead of the repo root. After the monorepo restructure, packages/cli/src/index.ts is at the repo root — need 3 levels (../../..). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: format p.text({ message }) to multi-line for biome Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: spawn-bot <spawn-bot@openrouter.ai> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 15:16:07 -05:00
A	65f6f1be32	feat: Bun workspace monorepo — packages/cli + packages/shared (#1853 ) Restructure the repo as a Bun workspace monorepo: - Move cli/ → packages/cli/ - Create packages/shared/ (@openrouter/spawn-shared) with type-guards and parse utilities - Add root package.json with workspace configuration - Update all CLI imports to use @openrouter/spawn-shared - Deduplicate toRecord/toObjectArray helpers from 4 cloud modules - Update SPA (slack-bot) to use shared package instead of local toObj() - Update 48 agent shell scripts for new packages/cli/ path - Update install.sh, install.ps1, e2e, and test scripts - Update all GitHub workflows, .gitignore, pre-commit hooks - Update CLAUDE.md, README.md, and skill prompt references - Pin all dependency versions (no ^ ranges) - Bump CLI version 0.9.1 → 0.10.0 All 1908 tests pass. Lint clean. All 8 cloud bundles build. Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-23 22:07:05 -08:00
A	b84adfb74e	refactor: move all shell scripts to /sh directory (#1843 ) Reorganizes the project so all shell scripts live under a dedicated /sh directory, enabling the OpenRouter rewrite URL to point at /sh/ instead of the repository root. Moves: - cli/install.sh → sh/cli/install.sh - shared/.sh → sh/shared/.sh - {cloud}/{agent}.sh → sh/{cloud}/{agent}.sh (48 scripts) - {cloud}/README.md → sh/{cloud}/README.md - e2e/.sh → sh/e2e/.sh - test/macos-compat.sh → sh/test/macos-compat.sh - test/fixtures/*/.sh → sh/test/fixtures/*/.sh Updates all references: - RAW_BASE path construction in commands.ts, update-check.ts - GitHub auth URL in agent-setup.ts - Self-referencing URLs in install.sh, github-auth.sh - CI workflow paths in lint.yml, cli-release.yml - Test file paths in install-script-validation, manifest-integrity - Documentation in README.md, cli/README.md, CLAUDE.md - QA scripts in .claude/skills/ Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-23 21:14:54 -08:00

26 commits