spawn

vrr/spawn

mirror of https://github.com/OpenRouterTeam/spawn.git synced 2026-05-12 14:20:17 +00:00

Author	SHA1	Message	Date
A	0ea0d5bb61	test: add coverage for retryOrQuit and skipCloudInit auto-detection (#2810 ) Both functions were added in recent commits but had zero test coverage: - retryOrQuit (`ed127cf`): non-interactive mode now verified to throw - skipCloudInit (`2280550`): 4 cases verify correct tier/cloud/mode conditions 1468 tests pass, 0 failures. Agent: test-engineer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-19 23:45:04 -07:00
A	69b6f8aa66	fix(test): fix 7 failing tests — GCP mock gaps and sandbox pollution (#2816 ) - GCP coverage tests (6 failures): getServerIp, listServers, and authenticate tests did not mock the `which gcloud` spawnSync call inside requireGcloudCmd(), causing "gcloud CLI not found" errors. Add mockSpawnSyncWithGcloud/mockWhichGcloud helpers that satisfy the gcloud discovery call before the test-specific mock. - Sandbox guardrail test (1 failure): cmd-uninstall-cov deletes ~/.spawn and other sandbox directories but never re-creates them. Since Bun runs test files in the same process, the fs-sandbox test then fails. Add afterEach restoration of sandbox dirs. - Add coverageThreshold to bunfig.toml with correct syntax (coverageThreshold under [test], not [test.coverage]) Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-19 23:43:13 -07:00
A	18c7834d24	fix: restore packages/cli/bunfig.toml for preload when running from subdir (#2813 ) The pre-merge hook and `cd packages/cli && bun test` need a local bunfig.toml so the preload path resolves correctly for the sandbox. Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 22:57:03 -07:00
A	9ae3525030	feat: enforce CI coverage thresholds + colocate billing guidance (#2811 ) - Move bunfig.toml to repo root with valid coverageThreshold syntax (line=80%, function=0 to avoid per-file false positives) - Add --coverage flag to CI test step - Delete packages/cli/bunfig.toml (superseded by root config) - Add tests for packages/shared (type-guards, parse, result) - Colocate billing config into each cloud directory (aws/billing.ts, gcp/billing.ts, hetzner/billing.ts, digitalocean/billing.ts) - Refactor billing-guidance.ts: BillingConfig interface replaces cloud-string-keyed Record maps - Bump CLI version to 0.25.1 Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-19 22:52:45 -07:00
Ahmed Abushagur	aa4b2a23d6	feat: auto-reconnect on SSH drops during interactive session (#2806 ) When SSH exits with code 255 (connection dropped/timed out), retry up to 5 times with 3s delay between attempts. Clean exits (0), Ctrl+C (130), and agent crashes exit immediately without retrying. Only applies to remote clouds — local sessions skip reconnect logic. Signed-off-by: L <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-19 22:28:10 -07:00
A	72c3f23364	test: add comprehensive code coverage tests (#2802 ) * test: add comprehensive coverage tests (67% → 85% lines) Add 27 new test files with ~565 tests covering all major modules: Shared modules: - ui-cov: logging, prompts, validation, shellQuote, withRetry, loadApiToken - ssh-cov: spawnInteractive, killWithTimeout, startSshTunnel, waitForSsh - ssh-keys-cov: generateSshKey edge cases, key sorting, fingerprint - oauth-cov: PKCE flow, code verifier/challenge, key management - orchestrate-cov: provisioning flow, enabled steps, model preferences - agent-setup-cov: wrapSshCall, createCloudAgents, GitHub auth Commands: - connect, status, uninstall, pick, delete, update, fix, interactive - link, run, list (with formatRelativeTime, filters, actions) Cloud providers: - aws, gcp, digitalocean, hetzner, sprite (auth, CRUD, SSH ops) Remaining: - picker, unicode-detect, history, manifest, update-check Also fixes: - do-payment-warning.test.ts: use spyOn instead of mock.module for shared/ui to prevent cross-test contamination - preflight-credentials.test.ts: resilient to @clack/prompts mock replacement by other test files Coverage: 74% → 90% functions, 67% → 85% lines Tests: 1467 → 2032, 0 failures Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * test: expand coverage tests for commands, oauth, orchestrate, and link Add 65+ new tests across 7 test files: - cmd-list-cov: handleRecordAction branches (rerun, fix, no-connection), resolveListFilters with cloud filter, footer and empty message paths - cmd-run-cov: showDryRunPreview edge cases, getScriptFailureGuidance for all exit codes, getSignalGuidance, cmdRun validation - cmd-pick-cov: flag edge cases (missing values, multiple flags) - cmd-link-cov: IP generation, detection spinner, invalid IP - cmd-fix-cov: additional fix paths - oauth-cov: non-standard key confirmation, null config handling - orchestrate-cov: tunnel support, checkAccountReady, tarball, SPAWN_NAME, preLaunch, restart loop, step validation Coverage: 90.50% functions, 85.13% lines (2097 tests, 0 failures) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * test: add coverage thresholds (80% lines, 90% functions) Configure bun test coverage thresholds in bunfig.toml to enforce minimum coverage levels and prevent regressions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 22:24:54 -07:00
A	646faf66e2	test: remove duplicate config_files test in manifest-type-contracts (#2809 ) Consolidated two overlapping describe blocks that both iterated over the same config_files data: - 'Agent optional field types' had a test checking config_files keys were strings with length > 0 - 'Config files structure' had a separate describe checking the same keys match a path regex and values are non-null objects Merged into a single test within 'Agent optional field types' that checks all constraints: key is string, key is non-empty, key matches path regex (/[/~./]), and value is a non-null object. Removed the now-redundant 'Config files structure' describe block. -- qa/dedup-scanner Co-authored-by: spawn-qa-bot <qa@openrouter.ai>	2026-03-19 22:05:41 -07:00
Ahmed Abushagur	ed127cf592	feat: never-give-up resilience layer (#2807 ) Some checks failed CLI Release / Build and release CLI (push) Failing after 5s Details Lint / Biome Lint (push) Failing after 4s Details Lint / macOS Compatibility (push) Successful in 15s Details Lint / ShellCheck (push) Successful in 59s Details * feat: never-give-up resilience layer — retry every failure instead of exiting Add retryOrQuit() helper to shared/ui.ts that prompts "Try again? (Y/n)" after any recoverable failure. Wrap all fatal exit points with retry loops: - Cloud auth (Hetzner, DigitalOcean, AWS, GCP): retry after 3 failed tokens - API key acquisition: retry after 3 failed OAuth+manual attempts - Server creation: retry on any createServer failure (both fast & sequential) - SSH readiness: retry on waitForReady timeout - Agent install: retry on install failure - Pre-launch hooks: retry on preLaunch failure Non-interactive mode (SPAWN_NON_INTERACTIVE=1) still throws immediately. Ctrl+C at any retry prompt exits cleanly. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat(e2e): add AI-driven interactive test harness Add --interactive mode to the E2E test framework. Instead of running spawn in headless mode (SPAWN_NON_INTERACTIVE=1), this spawns the CLI in a real PTY and uses Claude Haiku to respond to prompts like a human user would. New files: - sh/e2e/interactive-harness.ts — Bun script that drives the PTY + AI loop - sh/e2e/lib/interactive.sh — Bash integration with the E2E framework Usage: e2e.sh --cloud hetzner claude --interactive Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat(qa): wire interactive E2E into scheduled QA pipeline - Add `e2e-interactive` option to workflow_dispatch in qa.yml - Add `e2e-interactive` run mode to qa.sh (loads cloud creds + ANTHROPIC_API_KEY) - Runs `e2e.sh --cloud hetzner claude --interactive` directly (no Claude Code needed) - Defaults to hetzner (cheapest), overridable via E2E_INTERACTIVE_CLOUD/AGENT env vars Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat(qa): schedule interactive E2E daily at 6am UTC Runs one agent (claude) on one cloud (hetzner) with AI-driven prompts. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(qa): offset soak cron to avoid GitHub Actions schedule dedup GitHub Actions deduplicates overlapping cron schedules into one run, making `github.event.schedule` unpredictable. The soak test at `0 3 * * 1` was getting absorbed by the `0 /4 * ` quality sweep and never firing as reason=soak. Move soak to `30 1 * 1` (Monday 1:30am UTC) — safely between the 0am and 4am quality sweep slots. Interactive E2E at `0 6 * * ` is already safe (between the 4am and 8am slots). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> fix(qa): add e2e-interactive to trigger server valid reasons The trigger server validates reason query params against an allowlist. Without this, the `e2e-interactive` dispatch returns 400. Also note: `soak` is already in VALID_REASONS in the repo but the running service on the QA VM is stale — needs a restart to pick up both soak and e2e-interactive reasons. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 17:33:22 -07:00
Ahmed Abushagur	2280550c18	perf: skip cloud-init for minimal-tier agents with tarballs/snapshots (#2804 ) * perf: skip cloud-init for minimal-tier agents with tarballs/snapshots Ubuntu 24.04 base images already have curl + git, so minimal-tier agents (claude, opencode, zeroclaw, hermes) don't need the cloud-init package install step when using tarballs or snapshots. Adds skipCloudInit flag to CloudOrchestrator — set automatically when (tarball \|\| snapshot) && tier === "minimal". Each cloud's waitForReady checks this flag and calls waitForSshOnly instead of waitForCloudInit. Saves ~30-60s on minimal-tier agent deploys with --fast or --beta tarball. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: add --fast mode and updated beta features to README Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: remove timing table from README Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-19 16:14:49 -07:00
A	1d0349cc23	test: add SPAWN_FAST fast-mode coverage to orchestrate (#2801 ) Add 6 test cases verifying the Promise.allSettled parallel orchestration path introduced in #2796. Tests cover: happy path, server boot failure propagation, API key failure propagation, tarball fallback to agent.install, local cloud exclusion from fast mode, and non-fatal preProvision/checkAccountReady failures. Agent: test-engineer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-19 13:16:02 -07:00
Ahmed Abushagur	5efbcf9ee7	feat: add --fast flag for parallel server boot + setup (#2796 ) * feat: add --fast flag for parallel server boot + setup Adds `--fast` flag that runs server creation concurrently with API key prompt, account check, pre-provision hooks, tarball download, and env config generation. Once SSH is up, uploads tarball and applies config. --fast implies --beta tarball and --beta images, enabling snapshots and pre-built tarballs automatically. Flow without --fast (sequential): auth → API key → preProvision → size → create → boot → install → configure Flow with --fast (parallel): auth → size → [create+boot \| API key \| preProvision \| tarball download \| accountCheck] → upload tarball → inject env → configure Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: add --beta parallel as standalone opt-in for parallel setup --beta parallel enables the parallel orchestration without implying tarball/images. --fast still implies all three (tarball + images + parallel). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 10:26:54 -07:00
A	6772ed1cd7	fix(cli): validate agentKey in buildFixScript and fixSpawn before manifest lookup (#2792 ) Some checks failed Lint / ShellCheck (push) Successful in 1m5s Details CLI Release / Build and release CLI (push) Failing after 18s Details Lint / Biome Lint (push) Failing after 4s Details Lint / macOS Compatibility (push) Successful in 14s Details Add validateIdentifier() calls to buildFixScript() and fixSpawn() to ensure agent keys from spawn history match [a-z0-9_-]+ before using them to index manifest.agents. This prevents potential prototype pollution or unexpected behavior from tampered history files. Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-19 06:36:06 -07:00
A	787087144c	fix(cli): bump version to 0.23.2 for missed patch releases (#2787 ) Some checks failed CLI Release / Build and release CLI (push) Failing after 5s Details Lint / Biome Lint (push) Failing after 4s Details Lint / macOS Compatibility (push) Successful in 17s Details Lint / ShellCheck (push) Successful in 57s Details Two CLI changes landed after the last version bump (0.23.1) without incrementing the version: - `d9575acd`: fix(cli): exit with code 1 on spawn fix error paths - `148cc9e7`: refactor: extract duplicate waitForSshSnapshotBoot to shared/ssh.ts The CLI has auto-update enabled — without a version bump, users won't pick up these fixes on next run. Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-19 01:00:10 -07:00
A	148cc9e7ee	refactor: extract duplicate waitForSshSnapshotBoot to shared/ssh.ts (#2783 ) The waitForSshOnly function was identically duplicated in hetzner.ts and digitalocean.ts. Extract the shared logic into waitForSshSnapshotBoot() in shared/ssh.ts and replace the duplicate cloud implementations with thin wrappers that resolve module-local state before delegating. -- qa/code-quality Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-18 22:10:25 -07:00
A	d9575acd43	fix(cli): exit with code 1 on `spawn fix` error paths (#2781 ) cmdFix error paths (spawn not found, non-interactive with multiple servers, picker mismatch) previously returned without setting a non-zero exit code. Scripts checking $? would incorrectly see success. Now exits with code 1 on all error paths in cmdFix. fixSpawn() is unchanged since it is also called from the list picker where returning to loop is correct behavior. Agent: ux-engineer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 20:43:31 -07:00
A	15a62a9ad0	fix(cli): use tryCatch for JSON.parse in loadPreferredModel (#2782 ) tryCatchIf(isFileError) only catches filesystem errors (ENOENT, EACCES), but JSON.parse throws SyntaxError on corrupted preferences.json. This was the same bug fixed in `16a2f180` across 4 files, but orchestrate.ts was missed. A corrupted ~/.spawn/preferences.json would crash the CLI instead of gracefully falling back to no preferred model. Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 20:15:17 -07:00
Ahmed Abushagur	7289f3ef36	feat(hetzner): add snapshot support + Packer image builds (#2774 ) Some checks failed CLI Release / Build and release CLI (push) Failing after 31s Details Lint / ShellCheck (push) Successful in 40s Details Lint / Biome Lint (push) Failing after 14s Details Lint / macOS Compatibility (push) Successful in 18s Details CLI changes: - Add findSpawnSnapshot() to query Hetzner /images?type=snapshot API for pre-built spawn-{agent}-* images (matches by description prefix) - Add waitForSshOnly() for snapshot boots (skips cloud-init polling) - Update createServer() to accept optional snapshotId — boots from snapshot instead of ubuntu-24.04, skips cloud-init userdata - Wire up orchestrator with skipAgentInstall flag Packer changes: - Add packer/hetzner.pkr.hcl using hcloud plugin, mirroring the DO template (tier scripts, agent install, cleanup, manifest) - Unify packer-snapshots.yml to build both DO and Hetzner in a single workflow with cloud×agent matrix and per-cloud cleanup steps Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 16:46:48 -07:00
A	04eb54b409	test: consolidate repetitive validateLaunchCmd and validatePreLaunchCmd valid-input tests (#2771 ) 7 agent-specific it() blocks for validateLaunchCmd (all calling .not.toThrow() on trivially different inputs) collapsed into one data-driven loop. Similarly, 6 individual validatePreLaunchCmd valid-pattern tests collapsed into one loop. Reduces it() count in security-connection-validation.test.ts from 93 to 81 with zero change in coverage - every command variant is still exercised. Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 14:16:38 -07:00
A	16a2f1807c	fix(cli): use tryCatch instead of tryCatchIf for JSON.parse callsites (#2770 ) tryCatchIf(isFileError) only catches filesystem errors (ENOENT, EACCES), but JSON.parse throws SyntaxError on corrupted input. Since tryCatchIf rethrows non-matching errors, a corrupted config file crashes the CLI instead of returning the intended null/false fallback. Affected: readCache(), local manifest loader, loadApiToken(), loadSavedOpenRouterKey(), hasCloudConfigCredentials() Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-18 12:54:41 -07:00
A	fc98700a24	fix(digitalocean): use s-2vcpu-4gb-intel for openclaw to support nyc3 region (#2769 ) s-2vcpu-4gb is not available in nyc3 (the default E2E region), causing openclaw provisioning to fail with 422. s-2vcpu-4gb-intel offers the same specs (2 vCPUs, 4 GB RAM) and is available in all regions including nyc3. -- qa/e2e-tester Co-authored-by: spawn-qa-bot <qa@openrouter.ai>	2026-03-18 11:26:19 -07:00
A	b46524887d	feat(hetzner): fetch locations from API, re-prompt on unavailable location (#2766 ) Hetzner disabled fsn1 (Falkenstein), causing a fatal HTTP 412 error for all users using the default location. This change: - Fetches available locations dynamically from GET /locations API - Falls back to a hardcoded list if the API call fails - On location-unavailable errors (HTTP 412 resource_unavailable), prompts the user to pick a different location instead of crashing - Changes default location from fsn1 to nbg1 (Nuremberg) - Excludes previously-failed locations from the re-pick list Closes #2764 Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Security Reviewer <security@openrouter.ai> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-18 10:39:42 -07:00
A	1ad385117e	test: consolidate redundant platform tests in shell.test.ts (#2767 ) macOS and Linux return identical results for getLocalShell, getWhichCommand, getInstallScriptUrl, and getInstallCmd. Collapsed the duplicate per-platform tests into a data-driven loop over ["darwin", "linux"], reducing repetition while preserving the same coverage. Also added the missing Linux case for getInstallCmd (was only tested for Windows and macOS). Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 10:28:09 -07:00
A	4e31e8dd4c	docs(tests): document 5 undocumented test files in README (#2762 ) Some checks failed CLI Release / Build and release CLI (push) Failing after 19s Details Lint / Biome Lint (push) Failing after 3s Details Lint / macOS Compatibility (push) Successful in 14s Details Lint / ShellCheck (push) Successful in 58s Details Added missing entries to packages/cli/src/__tests__/README.md for: - auto-update.test.ts — setupAutoUpdate systemd service unit generation - kill-with-timeout.test.ts — killWithTimeout SIGKILL grace period logic - shell.test.ts — platform-aware shell detection utilities - digitalocean-token.test.ts — DigitalOcean token storage and API helpers - hetzner-pagination.test.ts — Hetzner API multi-page pagination All 1467 tests pass. No code changes. -- qa/code-quality Co-authored-by: spawn-qa-bot <qa@openrouter.ai>	2026-03-18 07:01:19 -07:00
A	47b8bd30cc	test: remove duplicate and theatrical tests (#2763 ) removed the "integration with getScriptFailureGuidance" describe block from credential-hints.test.ts. all three tests were redundant: - "always includes setup instructions regardless of env state": tested for vague "setup instructions" string, already verified by the "when all required env vars are missing" describe block above. - "always returns at least one line": pure existence check, already proven by the "when no authHint is provided" tests which assert exact length of 1. - "returns more lines when authHint is provided": tests line-count implementation detail rather than behavior; behavior is fully covered by the per-scenario describe blocks. 1467 to 1464 tests. zero regressions. biome lint: 0 errors. -- qa/dedup-scanner Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 06:37:40 -07:00
A	af300ba248	fix(digitalocean): paginate SSH keys/droplets and harden key registration check (#2758 ) Add doGetAll() pagination helper (matching Hetzner's hetznerGetAll pattern) and use it for all three unpaginated DO API calls: - ensureSshKey(): /account/keys (was silently truncated at 20 keys) - createServer(): /account/keys (same issue for SSH key ID collection) - listServers(): /droplets (was silently truncated at 20 droplets) Replace fragile `regText.includes('"id"')` string check with proper `parseJsonObj(regText)?.ssh_key` validation for SSH key registration. Fixes #2748 Fixes #2749 Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 01:18:06 -07:00
A	d4774fdc8e	fix(sprite): append to ~/.bash_profile and gate exec zsh on interactive shells (#2756 ) - Use >> instead of > to append to ~/.bash_profile (preserves existing config) - Gate exec zsh on interactive shells: [[ $- == i ]] && exec /usr/bin/zsh -l - Bump CLI version to 0.21.7 Fixes #2740 Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 23:55:33 -07:00
A	75c75d42d4	fix(ui): propagate Ctrl+C/Esc cancellation instead of returning empty string (#2757 ) When p.isCancel() detected user cancellation in prompt() and selectFromList(), the result was silently converted to "" instead of exiting. This caused infinite retry loops in billing prompts, silent fallthrough in oauth key entry, and unintended defaults in name prompts. Now both functions call process.exit(0) on cancel for a clean exit. Fixes #2745 Agent: ux-engineer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 23:54:32 -07:00
A	fef312cd47	fix(update): cache successful update checks for 1 hour (#2755 ) checkForUpdates() previously fetched the latest version from GitHub on every single CLI invocation, blocking for up to 10s on slow/offline connections. Now it writes a timestamp to ~/.config/spawn/.update-checked after a successful check and skips the network call if the cache is less than 1 hour old. Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-17 23:08:05 -07:00
A	133b94939e	fix(hetzner): ensure cloud-init marker is always written despite early exit (#2747 ) Remove `set -e` from userdata script and add an EXIT trap to guarantee /root/.cloud-init-complete is written even if apt-get or other setup steps fail. Add `\|\| true` to apt-get commands for extra resilience. Previously, the userdata script used `set -e` causing it to abort on any command failure before reaching the marker write at the end. This made waitForCloudInit() always time out with "Cloud-init marker not found, continuing anyway..." adding ~5 minutes to every Hetzner provisioning. Fixes #2739 Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 23:02:16 -07:00
A	1b978c03ce	fix(tarball): validate VM architecture when only one arch asset exists (#2753 ) When a GitHub Release contains only one architecture-specific tarball (e.g., x86_64 only), the download command now checks `uname -m` on the remote VM and fails with exit 1 if the arch doesn't match. This prevents installing an x86_64 binary on ARM (or vice versa) and ensures the orchestrator falls back to live installation. Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-17 22:59:04 -07:00
A	035ee3ca63	fix(ssh): always escalate to SIGKILL in killWithTimeout (#2752 ) proc.killed is true as soon as kill() is called, not when the process exits. This meant SIGKILL escalation was always skipped, leaving stuck processes hanging indefinitely. Remove the faulty guard and always attempt SIGKILL after the grace period — try/catch handles already-dead processes. Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 05:54:38 +00:00
A	a557fb1002	fix(cli): handle --help and --version flags after positional args (#2750 ) Previously, `spawn claude sprite --help` would warn about extra args and proceed to provision a server. Now trailing help/version flags are detected and handled correctly in both the default command path and verb alias path (e.g., `spawn run claude sprite --help`). Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 22:29:48 -07:00
Ahmed Abushagur	39f62b8c75	fix(windows): use dirname() instead of unix-only regex for config paths (#2738 ) The regex `configPath.replace(/\/[^/]+$/, "")` only matches forward slashes, so on Windows (which uses backslashes) it returns the full path unchanged. `mkdirSync` then creates `digitalocean.json` as a directory, causing EISDIR on the next write. Replace with `dirname()` from `node:path` which handles both separators. Affects digitalocean.ts, hetzner.ts, and aws.ts (oauth.ts already used dirname correctly). Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: PR Reviewer <pr-reviewer@spawn> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-17 22:22:30 -07:00
A	800c446ca4	fix(security): resolve symlinks in prompt file validation to prevent bypass (#2744 ) validatePromptFilePath used path.resolve() which only normalizes the string but doesn't follow symlinks. An attacker could create a symlink (e.g., innocent.txt -> ~/.ssh/id_rsa) to bypass sensitive path checks and exfiltrate credentials. Now uses realpathSync() to canonicalize the path before pattern matching. Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 22:21:11 -07:00
A	1e190924bf	fix(aws): wait for public IP before returning from waitForInstance (#2746 ) Lightsail can report state=running before assigning a public IP. Continue polling until both state is running and IP is non-empty, preventing SSH connection failures from an empty IP address. Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 22:16:57 -07:00
A	1ac7b9a0d1	fix(hetzner): paginate SSH key and server list API calls to prevent truncation at 25 items (#2741 ) Hetzner API defaults to 25 items per page. Users with >25 SSH keys would hit SSH lockout on server creation because the newly registered key landed on page 2+ and was omitted from the ssh_keys payload. Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-17 22:11:45 -07:00
A	f35696434a	fix(security): use writeFileSync for credential files — Bun.write ignores mode option (#2742 ) Bun.write does not support the `mode` option, so credential config files (Hetzner, DigitalOcean, AWS, OpenRouter) were created with 0644 permissions instead of the intended 0600, exposing API tokens to other local users. Switch to node:fs writeFileSync which correctly applies file permissions. Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 22:09:36 -07:00
A	7fe1bdf6b3	fix(junie): remove JUNIE_MODEL env var to fix 'Unknown model: openrouter/auto' crash (#2735 ) Junie only accepts its own shorthand model names (gpt, opus, sonnet, etc.) and not OpenRouter model IDs. Removing modelEnvVar lets junie handle its own model routing via the OpenRouter API key instead. Fixes #2734 Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 21:22:32 -07:00
Ahmed Abushagur	c11879d547	fix(windows): download JS bundle instead of bash wrapper on Windows (#2730 ) The bash wrapper scripts (.sh) contain bash syntax that PowerShell cannot parse. On Windows, download the pre-built JS bundle from GitHub releases and run it directly via `bun run {cloud}.js {agent}`, which is exactly what the bash wrapper ultimately does. Affects both interactive (execScript) and headless (cmdRunHeadless) code paths. macOS/Linux behavior unchanged. Closes #2726 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-17 19:09:44 -07:00
A	b1de116690	refactor: replace manual multi-level type guards with toRecord/isString in index.ts (#2731 ) Two instances of the pattern `err && typeof err === "object" && "code" in err` violated the type-safety rule requiring valibot or shared type-guard utilities instead of manual multi-level type checks. Replaced with `toRecord(err)` and `isString()` from @openrouter/spawn-shared for consistent, rule-compliant error code extraction. Also bumps CLI patch version per cli-version.md. -- qa/code-quality Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-17 18:40:16 -07:00
Ahmed Abushagur	6e92cc832b	feat: add systemd auto-update service for agents on cloud VMs (#2728 ) Installs a systemd timer + oneshot service that updates the agent binary and system packages every 6 hours without disrupting running instances. Agent update safety: - Binary agents (Go, Rust): Linux keeps old inode in memory; safe to replace - npm agents: Node.js caches modules at startup; running processes unaffected - New version takes effect on next restart via the existing restart loop System update safety: - Disables Ubuntu's unattended-upgrades to prevent dpkg lock contention - Uses flock -w 300 on /var/lib/dpkg/lock-frontend before apt operations - DEBIAN_FRONTEND=noninteractive with --force-confdef/--force-confold User-facing: - "Auto-update" option in setup multiselect (default on, user can uncheck) - Skipped for local cloud and non-systemd systems - Non-fatal: setup failure doesn't block agent launch - Logs to /var/log/spawn-auto-update.log Timer: 15min after boot, then every 6h with 30min random jitter. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 17:34:12 -07:00
Ahmed Abushagur	66b16d8651	feat: add Windows PowerShell support — remove bash dependency for local execution (#2727 ) Replace hardcoded "bash" shell references with platform-aware utilities so spawn works natively from PowerShell on Windows without WSL or Git Bash. - New shared/shell.ts: isWindows(), getLocalShell(), getInstallScriptUrl(), getInstallCmd(), getWhichCommand() with platform override for testability - local/local.ts: use getLocalShell() for runLocal() and interactiveSession() - commands/run.ts: spawnScript/runScriptHeadless use getLocalShell() - commands/update.ts: Windows downloads install.ps1, runs via PowerShell - update-check.ts: Windows auto-update uses install.ps1; "where" replaces "which" - shared/orchestrate.ts: PowerShell-compatible .spawnrc setup for local Windows - Remote SSH commands unchanged — remote servers are always Linux Closes #2726 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-17 16:35:23 -07:00
A	ba94f681b3	feat(cli): add spawn uninstall command (#2724 ) * feat(cli): add `spawn uninstall` command Adds a new `uninstall` subcommand that cleanly reverses the install: - Removes ~/.local/bin/spawn binary and /usr/local/bin/spawn symlink - Cleans spawn PATH entries from shell RC files (.bashrc, .zshrc, etc.) - Removes ~/.cache/spawn/ cache directory - Optionally removes ~/.spawn/ (history) and ~/.config/spawn/ (keys/config) - Shows confirmation prompt before any destructive action Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * refactor: use start/end markers for shell RC blocks - Add shared RC_MARKER_START/RC_MARKER_END constants in paths.ts - Update install.sh to write `# >>> spawn >>>` / `# <<< spawn <<<` block markers - Update uninstall.ts to remove content between markers (with legacy fallback) - Addresses review feedback: shared markers make RC entries easier to audit/remove Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * refactor: share legacy RC marker from paths.ts Move the legacy "# Added by spawn installer" string to RC_MARKER_LEGACY in shared/paths.ts so both install.sh and uninstall.ts reference the same source of truth for all marker strings. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-03-17 16:33:09 -07:00
A	1733903a1f	fix(digitalocean): add OAuth recovery in doApi for mid-session 401 errors (#2723 ) When a DigitalOcean token expires mid-session (after ensureDoToken succeeds), API calls like ensureSshKey, createServer, listServers, destroyServer would crash with "Fatal: DigitalOcean API error 401" because doApi had no recovery path for 401 responses. Now doApi detects 401, attempts OAuth browser flow recovery via tryDoOAuth(), and retries the request with the new token. A re-entrancy guard prevents infinite loops (doApi → tryDoOAuth → doApi → ...). If OAuth recovery fails, the original 401 error is thrown as before. Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 16:13:42 -07:00
A	00863b0172	fix(digitalocean): handle 401 gracefully in testDoToken instead of crashing (#2722 ) testDoToken() used asyncTryCatchIf(isNetworkError, ...) which only caught network errors. A 401 HTTP response threw a regular Error that escaped the guard, propagating to main().catch() and printing "Fatal: DigitalOcean API error 401...". Changed to asyncTryCatch() to catch all errors, returning false for invalid tokens so ensureDoToken() naturally falls through to OAuth recovery. Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 15:14:30 -07:00
A	6509973154	test: remove duplicate terminal-width boilerplate in cmd-listing-output tests (#2721 ) Consolidate 10 single-assertion cmdMatrix tests (5 wide-terminal + 5 narrow-terminal) into 2 comprehensive tests using beforeEach/afterEach for terminal-width setup. Also fix a pre-existing environment-dependent failure where HCLOUD_TOKEN being set on the host caused the auth-hint test to see "ready" instead of "needs". Changes: - "grid view (wide terminal)": 5 tests → 1 test (8 fewer cmdMatrix() calls) - "compact view (narrow terminal)": 5 tests → 1 test (same) - Fix "should display auth hints" to clear host env vars before asserting Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 14:22:05 -07:00
A	c6087534aa	fix: populate connection fields in --headless --output json result (#2716 ) After runBashHeadless() succeeds, read the spawn record saved during orchestration and populate ip_address, ssh_user, server_id, and server_name in the SpawnResult output. Closes #2715 Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 12:29:44 -07:00
A	0e5bfd830b	fix(e2e): double GCP cloud-init wait timeout to 10 minutes for Node install (#2713 ) * chore: update agent GitHub star counts * chore: update agent GitHub star counts * chore: update agent GitHub star counts * chore: update agent GitHub star counts * chore: update agent GitHub star counts * chore: update agent GitHub star counts * chore: update agent GitHub star counts * chore: update agent GitHub star counts * chore: update agent GitHub star counts * chore: update agent GitHub star counts * chore: update agent GitHub star counts * chore: update agent GitHub star counts * chore: update agent GitHub star counts * chore: update agent GitHub star counts * chore: update agent GitHub star counts * chore: update agent GitHub star counts * fix(gcp): double cloud-init wait timeout to 120 attempts (10 min) GCP startup scripts installing Node.js 22 via `n` from curl take longer than 5 min on cold starts. The previous 60-attempt (5 min) poll timed out with "Startup script may not have completed, continuing..." and proceeded to run `npm install -g @kilocode/cli` before npm was available, causing `npm: command not found` errors. Increase `maxAttempts` from 60 to 120 (10 min) in `waitForCloudInit` to give the Node install enough time to complete on GCP cold starts. Confirmed by E2E run: GCP kilocode failed with npm not found after all 60 poll attempts exhausted; all other GCP agents passed (they don't need Node). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 11:51:41 -07:00
Ahmed Abushagur	34785a9a63	feat(hermes): add YOLO mode toggle to setup menu (#2711 ) Add HERMES_YOLO_MODE as a setup option for Hermes Agent, enabled by default. This disables Hermes's security approval prompts so it can self-install skill dependencies (e.g. himalaya for email) at runtime on dedicated cloud VMs. Users can uncheck it in the setup multiselect if they prefer Hermes to prompt before installing tools. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-17 10:09:41 -07:00
A	5004a4db52	test: replace loose cloud-type count assertion with enumerated known-set check (#2709 ) The "should have a reasonable number of distinct cloud types" test used toBeGreaterThanOrEqual(2) and toBeLessThanOrEqual(10) — bounds so wide they would never catch a real type-naming mistake. Replace it with an explicit allowlist check so adding an unknown type fails immediately. Current valid types (api, cli, local) are all in the set; vm, container, sandbox, and cloud are pre-approved to avoid blocking planned additions. Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 09:55:15 -07:00

1 2 3 4 5 ...

484 commits