spawn

vrr/spawn

mirror of https://github.com/OpenRouterTeam/spawn.git synced 2026-05-17 21:29:10 +00:00

Author	SHA1	Message	Date
B	f88fb14c3d	fix(tests): prevent telemetry singleton from polluting parallel test fetch mocks The telemetry module's `_enabled` flag persists across parallel test files when `telemetry.test.ts` calls `initTelemetry()` (which deletes BUN_ENV/NODE_ENV guards). This causes `logWarn` → `captureWarning` → `sendEvent` → `fetch()` to fire unexpected calls through other tests' `global.fetch` mocks, breaking callCount-based assertions in `hetzner-cov.test.ts` and `digitalocean-token.test.ts`. Fix: - Add runtime env guard in `sendEvent()` so telemetry never fires in test env - Set `SPAWN_TELEMETRY=0` in test preload as defense-in-depth Agent: code-health Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-05-07 00:18:50 +00:00
Ahmed Abushagur	070be392f5	fix(ssh): auto-repair stale pub that does not pair with local priv (#3395 ) Some checks failed CLI Release / Build and release CLI (push) Has been cancelled Details Lint / ShellCheck (push) Has been cancelled Details Lint / Biome Lint (push) Has been cancelled Details Lint / macOS Compatibility (push) Has been cancelled Details * fix(ssh): verify pub/priv keypair before registering with cloud providers When a local SSH .pub file doesn't actually pair with the corresponding .priv (e.g. .pub copied from another machine, regenerated mid-flow, or edited by hand), spawn would still register the .pub with the cloud provider's key store. The registration check passes by fingerprint, the droplet boots with that key in authorized_keys, and SSH then fails with "Permission denied (publickey)" because the local .priv can't prove ownership of the registered .pub. This produced the silent failure mode where users saw "SSH key 'id_ed25519' already registered with DigitalOcean" immediately followed by 33 "Permission denied" retries. Adds verifyKeyPair() which derives the public key from the private key via `ssh-keygen -y -P "" -f priv` and compares it (key type + base64, ignoring the comment field) to the .pub file. discoverSshKeys() now filters out mismatched pairs with a clear warning naming the offending file, and silently skips passphrase-protected or otherwise unverifiable keys (BatchMode SSH can't use them anyway). Bumps CLI to 1.0.37. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(ssh): auto-repair stale .pub instead of skipping mismatched pair When the local .pub doesn't derive from the matching .priv (stale copy from another machine, etc.), the priv is still authoritative — any .pub that doesn't derive from it is wrong by definition. Previously spawn printed a warning and skipped the pair; now it backs up the stale .pub as .pub.spawn-backup-<timestamp> and rewrites the .pub from the derived key. The next launch uses the correct pub end-to-end, so the droplet boots with a public key that actually pairs with the local priv and SSH handshake succeeds instead of failing 33 times with "Permission denied (publickey)". Passphrase-protected keys (ssh-keygen -y cannot derive without the passphrase) are still skipped silently — nothing to repair with. Bumps CLI to 1.0.38. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com> Co-authored-by: SPA <spa@openrouter.ai>	2026-05-06 17:01:11 -07:00
Ahmed Abushagur	c907359fb1	feat(cli): --repo accepts any git URL, not just GitHub slugs (#3364 ) Adds normalizeRepoUrl() to accept: - GitHub shorthand (user/repo) — still expanded to https://github.com/... - HTTP(S) URLs (any host) - ssh:// and git:// URLs - SCP-style git@host:path The clone URL is shellQuote'd before interpolation; values containing shell metacharacters, whitespace, NUL bytes, or a leading `-` (which git would parse as a flag) are rejected. Bumps CLI to 1.0.23. Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 09:28:37 -07:00
Ahmed Abushagur	762508e9eb	fix(tarball): rewrite absolute /root symlinks after non-root extract (#3368 ) Tarballs are built as root with absolute symlinks (e.g. ~/.local/bin/claude -> /root/.claude/local/claude). When extracted on a non-root user (Sprite, scp-style installs without sudo), tar's --transform rewrites file PATHS but not symlink TARGETS, so the symlinks land in $HOME pointing at /root/... — which doesn't exist. The agent binary appears installed but is a dangling link, and PATH lookup fails with "command not found". After the transform extract succeeds, walk $HOME for symlinks whose target starts with /root/ and rewrite them to point under $HOME. Idempotent (ln -snf), no-ops on healthy installs, cheap (one find on a fresh VM's $HOME). Bumps CLI to 1.0.24. Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 09:25:53 -07:00
Ahmed Abushagur	f7652de45b	feat(cli): posthog feature flags + fast_provision experiment (#3366 ) * feat(cli): posthog feature flags + fast_provision experiment Wires PostHog `/decide` into the CLI so we can A/B-test provisioning behaviors. First experiment: `fast_provision` — for users who didn't pass --beta or --fast manually, the `test` variant turns on `tarball + images` by default. Hypothesis: faster provisioning → fewer drop-offs in the "VM ready → install completed" leg of the funnel. What's added: - `shared/install-id.ts` — stable per-machine UUID, persisted at ~/.config/spawn/.telemetry-id. Reuses telemetry's existing path so existing users keep their PostHog identity. Falls back to an ephemeral UUID on disk-write failure. - `shared/feature-flags.ts` — hand-rolled POST to PostHog /decide (no SDK dep). 1.5s timeout, fail-open. On-disk cache at $SPAWN_HOME/feature-flags-cache.json with 1h TTL so cold starts don't pay the network cost. SPAWN_FEATURE_FLAGS_DISABLED=1 kill switch. Captures `$feature_flag_called` exposure events for both arms so PostHog can compute conversion. - `shared/telemetry.ts` — moves user-id loading into install-id.ts so flags and events share the same `distinct_id`. - `index.ts` — `await initFeatureFlags()` at the top of `main()`, then applies `fast_provision`'s `test` variant by appending `tarball,images` to SPAWN_BETA — but only if the user didn't pass --beta or --fast (those always win, so opt-out is free). Why tarball+images and not all four (`+parallel,docker`): clean attribution. The hypothesis is about tarball/image; if we ship the full --fast bundle we can't tell which feature moved the metric. Keep --fast as the user-facing power-user knob. Tests: 14 new (install-id roundtrip + format guard, feature-flags fetch/timeout/HTTP500/malformed/disabled/idempotent/stale-cache, exposure-event behavior). Full suite: 2183 pass, same 4 pre-existing failures as upstream/main. Bumps CLI to 1.0.23. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(cli): skip feature-flag fetch in pick/feedback fast path; implement real SWR Two review-fix commits from PR feedback squashed into one: 1. Move `await initFeatureFlags()` below the `spawn pick` and `spawn feedback` bypass clauses in `main()`. Both commands are called from bash scripts and must stay fast; neither gates on a flag, so there's no reason to pay up to 1.5s of network latency on cold cache. 2. Implement real stale-while-revalidate in `shared/feature-flags.ts`. The prior implementation did a synchronous fetch on stale cache, which contradicted the docstring and PR description. Now: - fresh cache (<TTL) → use cache, no network - stale cache (>=TTL) → use cache immediately, refresh in background - no cache → await sync fetch (first run only) Adds `_awaitBackgroundRefreshForTest()` so tests can deterministically wait for the background refresh before asserting. Updated the existing "stale cache" test to verify SWR semantics (stale served first, fresh lands next invocation) and added a "fresh cache does not fetch" test. All 2127 tests pass; biome clean. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com> Co-authored-by: Claude <claude@anthropic.com>	2026-04-27 17:50:56 -07:00
Ahmed Abushagur	3e6c8768d1	feat(cli): --repo flag clones a template repo and applies spawn.md (#3360 ) Some checks failed Lint / macOS Compatibility (push) Has been cancelled Details CLI Release / Build and release CLI (push) Has been cancelled Details Lint / ShellCheck (push) Has been cancelled Details Lint / Biome Lint (push) Has been cancelled Details spawn <agent> <cloud> --repo user/template Clones https://github.com/user/template.git to ~/project on the VM, parses spawn.md (YAML frontmatter), and applies its custom-setup contract: - `setup`: oauth (open URL + wait for Enter), cli_auth (run on VM), api_key (no-echo prompt → /etc/spawn/secrets, sourced from .bashrc), command (run on VM) - `mcp_servers`: env values stay as ${NAME} placeholders so secrets never end up in the template repo. Replay routes through the existing skills.ts helpers (Claude settings.json, Cursor mcp.json, Codex config.toml) — no `node -e` injection. - `setup_commands`: run inside ~/project When the clone succeeds, the agent launches with `cd ~/project && ...` so the user lands in their template's working directory. Reconnect via `spawn last` replays the same launchCmd. Built-in steps (github auth, auto-update, etc.) stay in the CLI --steps flag — spawn.md only handles custom setup that Spawn doesn't know about natively. Bumps CLI to 1.0.22. Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 23:42:23 -07:00
Ahmed Abushagur	cd3537c051	fix(telemetry): send events immediately — no more lost funnel data (#3339 ) * fix(telemetry): send events immediately, persistent user ID, session continuity Root cause: events were batched (threshold: 10) but orchestration only fires ~8 funnel events. process.exit() kills the process before beforeExit flushes. Zero real funnel events ever reached PostHog. Fixes: - Send each event immediately via fetch (no batching, no lost events) - Persistent user ID in ~/.config/spawn/.telemetry-id (same across all runs) - Session ID inherited via SPAWN_TELEMETRY_SESSION env var (parent → child) - source: "cli" on every event (filter from website data in PostHog) Removed: _events array, _flushScheduled, flush(), flushSync(), batch logic. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(telemetry): remove process.exit(0) so telemetry fetches complete process.exit(0) was called immediately after main() resolved, aborting any in-flight fire-and-forget telemetry fetches. This silently dropped spawn_deleted, funnel, and lifecycle events. Now the process exits naturally when the event loop drains, giving pending requests time to complete. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: A <258483684+la14-1@users.noreply.github.com>	2026-04-22 00:28:33 -07:00
A	37d144dfd6	feat(digitalocean): guided readiness before deploy (#3336 ) * feat(digitalocean): guided readiness checklist before deploy Runs evaluateDigitalOceanReadiness after cloud auth and before region/size selection so users fix billing/SSH/OpenRouter blockers early, with a checklist UI that rechecks after each fix. Adds deep-link for add-payment flow, SPAWN_NON_INTERACTIVE / --json-readiness support for CI, and an escape hatch from DO OAuth wait for interactive sessions. Other clouds unchanged. Ported from digitalocean/spawn#2 (Scott Miller @scott). Bumps CLI to 1.1.0. Refactors the new preflight TTY-gating test to drive process.std.isTTY directly with descriptor save/restore and clears stale ~/.config/spawn/digitalocean.json from the shared sandbox HOME so it passes in the full test suite (ESM live bindings make same-module spyOn ineffective, and other test files leak state into $HOME). Co-Authored-By: Scott Miller <scottmiller@digitalocean.com> Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> fix(test): update-check mock versions for 1.1.0 version bump Mock "newer" versions (1.0.99) were no longer newer than the current 1.1.0 version, causing all update-check tests to fail. Bumped mock versions to 99.0.0 for general tests, 1.1.99 for patch, 1.2.0 for minor, keeping 2.0.0 for major. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * test(readiness): expand coverage + remove aspirational coverage threshold - Add evaluateDigitalOceanReadiness tests: auth failure, all-pass, email/payment/droplet/ssh/openrouter blockers, multi-blocker ordering, saved key fallback, edge cases (limit=0, count API failure) - Expand checklistLineStatus tests: all 6 blocker codes, pending-when- do_auth-blocked, all-blockers-active scenario - Add READINESS_CHECKLIST_ROWS validation tests - Expand sortBlockers tests: empty input, dedup, canonical order, single - Remove coverageThreshold from bunfig.toml — main was already at 82.99% functions vs 90% threshold (never enforced on push, only on PRs) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Scott Miller <scottmiller@digitalocean.com> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com> Co-authored-by: Ahmed Abushagur <ahmed@abushagur.com>	2026-04-21 21:55:01 -07:00
A	57174a0f15	feat(agent): add T3 Code agent (web GUI for Claude/Codex) (#3322 ) Some checks are pending CLI Release / Build and release CLI (push) Waiting to run Details Lint / ShellCheck (push) Waiting to run Details Lint / Biome Lint (push) Waiting to run Details Lint / macOS Compatibility (push) Waiting to run Details All CI green. Rebased from #3321, added Daytona support, resolved conflicts. Security reviewed: no injection vectors — all env var values come from hardcoded config, shell scripts follow existing patterns.	2026-04-18 01:14:37 -07:00
Ahmed Abushagur	51e36d2154	feat(telemetry): install referrer attribution for growth channels (#3318 ) Tracks whether installs came from Reddit, X, or organic by baking a ref tag into the install command. Growth bot shares: curl -fsSL ... \| SPAWN_REF=reddit bash curl -fsSL ... \| SPAWN_REF=x bash install.sh: if SPAWN_REF is set, sanitizes it (alphanumeric + hyphens, max 32 chars) and writes to ~/.config/spawn/.ref. Only written once — never overwritten on updates. index.ts: on startup, reads .ref and sets it as telemetry context via setTelemetryContext("ref", ref). Every PostHog event (funnel, lifecycle, errors) now carries ref=reddit or ref=x for attributed installs, or no ref for organic. PostHog query: filter any event by ref=reddit to see "how many Reddit- sourced users made it through the funnel" vs organic. Bumps 1.0.15 -> 1.0.16. Co-authored-by: A <258483684+la14-1@users.noreply.github.com>	2026-04-18 00:59:22 -07:00
Ahmed Abushagur	dc4fb59f67	fix(openclaw): batch config set calls into single exec (#3319 ) Merges 4 separate runner.runServer() calls (model, sandbox, browser, channel stubs) into one exec with commands chained by `;`. On Sprite (container-exec, not persistent SSH), many sequential execs exhaust the connection and cause "connection closed" / "context deadline exceeded" on later steps like gateway startup. Before: 4 execs → 14 "Config overwrite" log lines → flaky connection After: 1 exec → same config result → stable connection for gateway Individual commands use `;` not `&&` so a failure in one (e.g. browser path not found) doesn't skip the rest — these are all non-fatal prefs. Bumps 1.0.15 -> 1.0.16.	2026-04-18 00:56:37 -07:00
A	513d3448d4	fix(ux): correct reconnect command suggestion from "spawn connect" to "spawn last" (#3311 ) Some checks failed Lint / ShellCheck (push) Waiting to run Details Lint / Biome Lint (push) Waiting to run Details Lint / macOS Compatibility (push) Waiting to run Details CLI Release / Build and release CLI (push) Has been cancelled Details "spawn connect" is not a valid top-level CLI command — users following this guidance after SSH reconnect failure would see "Unknown agent or cloud: connect". Replace with "spawn last" which correctly reconnects to the most recent spawn. Agent: ux-engineer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-04-16 15:39:36 +07:00
Ahmed Abushagur	a179fdbbab	fix(telemetry): opt-in default + picker funnel events (#3308 ) Some checks are pending CLI Release / Build and release CLI (push) Waiting to run Details Lint / macOS Compatibility (push) Waiting to run Details Lint / Biome Lint (push) Waiting to run Details Lint / ShellCheck (push) Waiting to run Details Two bugs from the #3305 rollout: 1. Test pollution: orchestrate.test.ts imports runOrchestration directly and never calls initTelemetry, but _enabled defaulted to true in the module so captureEvent happily fired real events at PostHog tagged agent=testagent. The onboarding funnel filled up with CI fixture data. 2. Funnel started too late: funnel_* events fired inside runOrchestration, which is only called AFTER the interactive picker completes. Users who bail at the agent/cloud/setup-options/name prompts were invisible — yet that's exactly where real drop-off happens. Fix 1 — telemetry.ts: - Default _enabled = false. Nothing fires until initTelemetry is explicitly called. Production (index.ts) calls it; tests that need telemetry (telemetry.test.ts) call it with BUN_ENV/NODE_ENV cleared. - Belt-and-suspenders: initTelemetry now short-circuits when BUN_ENV === "test" \|\| NODE_ENV === "test", so even if future code calls it from a test context, events stay local. Fix 2 — picker instrumentation: New events fired before runOrchestration in every entry path: spawn_launched { mode: interactive \| agent_interactive \| direct \| headless } menu_shown / menu_selected / menu_cancelled (only when user has prior spawns) agent_picker_shown agent_selected { agent } — also sets telemetry context cloud_picker_shown cloud_selected { cloud } — also sets telemetry context preflight_passed setup_options_shown setup_options_selected { step_count } name_prompt_shown name_entered picker_completed Wired into: commands/interactive.ts cmdInteractive + cmdAgentInteractive commands/run.ts cmdRun (direct `spawn <agent> <cloud>`) cmdRunHeadless (only spawn_launched) runOrchestration's existing funnel_* events continue to fire unchanged. The final funnel in PostHog: spawn_launched → agent_selected → cloud_selected → preflight_passed → setup_options_selected → name_entered → picker_completed → funnel_started → funnel_cloud_authed → funnel_credentials_ready → funnel_vm_ready → funnel_install_completed → funnel_configure_completed → funnel_prelaunch_completed → funnel_handoff Tests: - telemetry.test.ts: 2 new env-guard tests (BUN_ENV, NODE_ENV), plus updated beforeEach to clear both env vars so existing tests still exercise initTelemetry. - Full suite: 2131/2131 pass, biome 0 errors. Bumps 1.0.12 -> 1.0.13 (patch — auto-propagates under #3296 policy).	2026-04-15 15:43:30 +07:00
Ahmed Abushagur	1e64d34e5a	feat(telemetry): funnel + lifecycle events for onboarding drop-off (#3305 ) Some checks are pending CLI Release / Build and release CLI (push) Waiting to run Details Lint / macOS Compatibility (push) Waiting to run Details Lint / ShellCheck (push) Waiting to run Details Lint / Biome Lint (push) Waiting to run Details * feat(telemetry): funnel + lifecycle events for onboarding drop-off Adds low-volume, high-signal product events on top of the existing errors/warnings telemetry (shared/telemetry.ts). Answers "where do users bail before reaching a running agent" at the fleet level. Funnel events (in orchestrate.ts, both fast and sequential paths): funnel_started pipeline begins funnel_cloud_authed cloud.authenticate() ok funnel_credentials_ready OR key + preProvision resolved funnel_vm_ready VM booted and SSH-reachable funnel_install_completed agent install succeeded (tarball or live) funnel_configure_completed agent.configure() ran funnel_prelaunch_completed gateway / dashboard / preLaunch hooks done funnel_handoff about to launch TUI (final step) Every event carries elapsed_ms since funnel_started, plus agent and cloud via telemetry context. Per-step counts reveal the drop-off funnel in PostHog without touching any PII. Lifecycle events (new shared/lifecycle-telemetry.ts): spawn_connected { spawn_id, agent, cloud, connect_count, date } fired from list.ts when the user reconnects via the interactive picker. Increments connection.metadata.connect_count and writes last_connected_at so subsequent events and the eventual spawn_deleted have the total. spawn_deleted { spawn_id, agent, cloud, lifetime_hours, connect_count, date } fired from delete.ts (both interactive confirmAndDelete and headless cmdDelete loop) after a successful cloud destroy. lifetime_hours is computed from SpawnRecord.timestamp to now. Clamped at 0 for corrupt clocks. connect_count is read from metadata. New captureEvent(name, properties) helper in telemetry.ts: - Respects SPAWN_TELEMETRY=0 opt-out (no new flag) - Runs every string property through the existing scrubber (API keys, GitHub tokens, bearer, emails, IPs, base64 blobs, home paths) - Non-string values pass through untouched Tests: 20 new (15 lifecycle-telemetry + 2 captureEvent + 3 assertion additions to disabled-telemetry). Full suite: 2129/2129 pass. Bumps 1.0.10 -> 1.0.11. Patch bump — auto-propagates under #3296 policy. * fix(test): replace mock.module with spyOn in lifecycle-telemetry tests mock.module contaminates the global module registry when running under --coverage, causing telemetry.test.ts and history-cov.test.ts to receive mocked implementations instead of the real modules. Switch to spyOn with mockRestore in afterEach so the real modules are preserved across files. Agent: pr-maintainer Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: L <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-15 11:35:53 +07:00
A	fbf7aaa067	fix(security): use temp file for GitHub token to avoid process listing exposure (#3301 ) Some checks are pending CLI Release / Build and release CLI (push) Waiting to run Details Lint / Biome Lint (push) Waiting to run Details Lint / macOS Compatibility (push) Waiting to run Details Lint / ShellCheck (push) Waiting to run Details * fix(security): use temp file for GitHub token to avoid process listing exposure Fixes #3300 Agent: security-auditor Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * fix(security): pass GitHub token via heredoc instead of local temp file The previous fix wrote the token to a temp file on the LOCAL host, but the command string was executed on the REMOTE server via runner.runServer(), so `cat` would fail with 'No such file or directory'. Switch to a heredoc which is parsed by the remote shell and never appears in /proc//cmdline. Agent: pr-maintainer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> fix(security): upload token to remote via SCP instead of heredoc The previous heredoc approach (`cat <<'EOF'`) doesn't work because all cloud runners wrap commands in `bash -c ${shellQuote(cmd)}`, and heredocs are not valid inside single-quoted bash -c strings. Use runner.uploadFile() (SCP) to place the token on the remote server as a temp file (mode 0600), then cat+rm it in the remote command. This is the same proven pattern used by uploadConfigFile(). The local temp file is always cleaned up after upload, and the remote temp file is cleaned up both on success (inline rm) and on failure (best-effort rm). Agent: security-auditor Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-04-14 21:56:13 +07:00
Ahmed Abushagur	c6287b9194	feat(cli): hermes web dashboard tunnel support (#3295 ) Some checks are pending CLI Release / Build and release CLI (push) Waiting to run Details Lint / ShellCheck (push) Waiting to run Details Lint / Biome Lint (push) Waiting to run Details Lint / macOS Compatibility (push) Waiting to run Details * feat(cli): hermes web dashboard tunnel support Hermes Agent v0.9.0 ships a local web dashboard (hermes dashboard, default 127.0.0.1:9119) for config / session / skill / gateway management. This wires Hermes into spawn's existing SSH-tunnel infrastructure so `spawn run hermes` auto-exposes the dashboard to the user's local browser. - agent-setup.ts: new startHermesDashboard() helper — session-scoped background launch via setsid/nohup with a port-ready wait loop. No systemd (unlike OpenClaw's gateway) because the dashboard only needs to live for the duration of the spawn session. Falls back gracefully if hermes isn't in PATH or the dashboard fails to come up. - Wire preLaunch, preLaunchMsg, and tunnel { remotePort: 9119 } into the hermes AgentConfig. Mirrors the OpenClaw tunnel pattern at orchestrate.ts:628 — startSshTunnel + openBrowser happen automatically. - manifest.json: update hermes notes to mention the dashboard. - hermes-dashboard.test.ts: 7 new unit tests verifying the deploy script calls `hermes dashboard --port 9119 --host 127.0.0.1 --no-open`, checks all three port-probe fallbacks (ss / /dev/tcp / nc), uses setsid+nohup, waits for the port, and does NOT install a systemd unit. - Bump cli version 1.0.6 -> 1.0.7. Closes #3293 * chore: bump cli to 1.0.8 to leave 1.0.7 for #3296 --------- Co-authored-by: L <6723574+louisgv@users.noreply.github.com>	2026-04-14 08:43:27 +07:00
A	14155cb7f8	fix(security): validate remotePath in injectInstructionSkill to prevent shell injection (#3276 ) Add validateRemotePath() and shellQuote() to instruction_path handling in skills.ts, matching the pattern used by uploadConfigFile(). Previously, remotePath from manifest.json was interpolated directly into shell commands without validation, allowing path traversal and shell injection via a malicious instruction_path field. Closes #3275 Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-11 17:50:05 -07:00
A	9b05aa90d4	fix(security): validate env var keys in skill injection (#3270 ) * fix(security): validate env var keys in skill injection (orchestrate.ts) Fixes #3269 Agent: security-auditor Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(security): add base64 validation for defense-in-depth in skill env injection Add validation of base64-encoded values to match the existing pattern in injectEnvVarsToRunner (line 518), providing defense-in-depth even though base64 output is highly unlikely to contain invalid characters. Agent: security-auditor Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(security): base64-encode entire skill env payload before shell interpolation Matches the injectEnvVarsToRunner pattern: base64-encode the full payload and decode on the remote side, eliminating any shell interpolation of individual env lines. Addresses review feedback on double-evaluation risk. Agent: pr-maintainer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-11 17:47:14 -07:00
Ahmed Abushagur	eaf49446f8	feat: --beta skills — pre-install MCP servers and skills on VMs (#3258 ) Some checks are pending CLI Release / Build and release CLI (push) Waiting to run Details Lint / Biome Lint (push) Waiting to run Details Lint / macOS Compatibility (push) Waiting to run Details Lint / ShellCheck (push) Waiting to run Details CLI plumbing for the skills feature. The skills catalog in manifest.json is populated by the discovery scout (#3252), not manually curated. Flow: 1. User runs `spawn claude hetzner --beta skills` 2. Skills picker shows available skills for that agent (from manifest.json) 3. User selects skills, enters required env vars (GITHUB_TOKEN, etc.) 4. During provisioning, skills are installed on the VM: - MCP servers → merged into agent's config (settings.json, mcp.json) - Instruction skills → SKILL.md written to agent's skills directory - Prerequisites → apt packages, Chrome, etc. installed first 5. Env vars appended to .spawnrc for MCP server runtime access Headless: SPAWN_SELECTED_SKILLS=github-mcp,context7 spawn claude hetzner Supports: Claude Code, Cursor (native MCP config), all other agents (generic mcp.json fallback). Signed-off-by: Ahmed Abushagur <ahmed@abushagur.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 09:02:16 -07:00
Ahmed Abushagur	561be1cef9	fix: extract tarballs directly to $HOME on non-root VMs (#3253 ) Tarballs are built with /root/ paths. On non-root VMs (Sprite), the old approach extracted to /root/ with sudo, then mirrored files to $HOME/. This failed on Sprite which doesn't have sudo. New approach: use tar --transform to remap /root/ → $HOME/ during extraction. No sudo needed, no mirror step. Falls back to sudo extract for clouds with passwordless sudo (AWS, GCP). Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 13:45:16 +07:00
A	88c1f37d7e	fix(security): add upper bound to base64 scrub regex to prevent ReDoS (#3251 ) Fixes #3250 The unbounded quantifier {40,} with word boundary \b caused exponential backtracking on long non-matching strings. Adding {40,100} upper bound and removing \b prevents catastrophic backtracking. Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-04-10 10:16:34 +07:00
Ahmed Abushagur	3aa34f21d3	feat(telemetry): use PostHog Error Tracking with $exception events (#3245 ) Some checks are pending CLI Release / Build and release CLI (push) Waiting to run Details Lint / ShellCheck (push) Waiting to run Details Lint / Biome Lint (push) Waiting to run Details Lint / macOS Compatibility (push) Waiting to run Details * feat(telemetry): use PostHog Error Tracking with $exception events Errors now send $exception events with $exception_list, parsed stack frames, and mechanism metadata — shows up in PostHog Error Tracking tab with auto-grouping, occurrence counts, and assignee support. Warnings stay as custom cli_warning events in Activity. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: remove stderr monkey-patch, restore explicit capture calls Remove process.stderr.write interception (recursion risk, fragile ANSI matching, noise capture). Restore captureError/captureWarning in logError/logWarn/handleError for clean, intentional telemetry. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: spawn-bot <spawn-bot@openrouter.ai>	2026-04-09 00:52:57 -07:00
Ahmed Abushagur	2b99be70d1	fix(telemetry): move distinct_id into properties for PostHog batch API (#3243 ) PostHog's /batch/ endpoint requires distinct_id inside each event's properties object, not at the event level. Events were silently dropped. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 23:43:13 -07:00
Ahmed Abushagur	f6c9177f80	fix: exit immediately after SSH session ends (#3241 ) pullChildHistory was awaited after the interactive session, blocking process.exit() for up to 5+ minutes while it SSHed back into the VM. This is a convenience feature for `spawn tree` — it should never make the user wait. Changed to fire-and-forget: process.exit() fires immediately, killing any in-flight SSH calls. Headless mode still awaits it since there's no user waiting. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-09 10:12:05 +07:00
Ahmed Abushagur	656b0da975	feat: add PostHog telemetry for CLI errors and warnings (#3242 ) Some checks are pending CLI Release / Build and release CLI (push) Waiting to run Details Lint / ShellCheck (push) Waiting to run Details Lint / Biome Lint (push) Waiting to run Details Lint / macOS Compatibility (push) Waiting to run Details Sends CLI errors, warnings, and crashes to PostHog for observability. Strictly error/warning events — no command tracking or session events. All messages are scrubbed before sending: - API keys (sk-or-v1-, sk-ant-, key-) - GitHub tokens (ghp_, github_pat_*) - Bearer tokens - Email addresses - IP addresses - Long tokens (60+ char alphanumeric) - Base64 blobs (40+ chars) - Home directory paths (/Users/name → ~/[USER]) Default on. Disable with SPAWN_TELEMETRY=0. Fire-and-forget with 5s timeout — never blocks the CLI. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 18:02:39 -07:00
Ahmed Abushagur	3c77825e6b	fix(openclaw): always set model after onboard to prevent wrong default (#3236 ) `openclaw onboard --non-interactive` now defaults to arcee/trinity-large-thinking instead of using the OpenRouter provider. Always run `openclaw config set agents.defaults.model.primary` after onboard to ensure openrouter/auto is set. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 22:45:44 -07:00
Ahmed Abushagur	ad9da53210	feat(security): behavioral miner detection + spawn status security column (#3227 ) Adds two behavioral crypto miner checks to the security scan: - Flag non-agent processes using >80% CPU (catches renamed miners) - Detect outbound connections to known mining pool ports (3333, 4444, etc.) Adds a Security column to `spawn status` that shows clean/alerts/— for each running server, with detailed alert summary after the table. JSON output includes security and security_alerts fields. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-08 07:40:14 +07:00
Ahmed Abushagur	aad03f3b1b	feat(security): add periodic security scan cron for VMs (#3214 ) Installs a cron job (every 6h) that checks for SSH key anomalies, failed login attempts (brute-force), suspicious software (attack tools, crypto miners), unexpected processes, rogue cron entries, and unusual listening ports. Findings are written to /var/log/spawn-security-alerts.log and displayed as warnings when users reconnect via `spawn connect`. Signed-off-by: Ahmed Abushagur <ahmed@abushagur.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-06 23:29:14 -07:00
A	606e521f33	fix: complete VM recovery rewrite for spawn fix command (#3178 ) * fix: complete VM recovery rewrite for spawn fix command Fixes #3173 Rewrites spawn fix to use CloudRunner interface for full VM recovery instead of a flat bash script piped over SSH. Now runs the same install(), configure(), preLaunch() functions as initial provisioning. - Added generic SSH CloudRunner (ssh-runner.ts) reusable by other commands - Exported injectEnvVarsToRunner() from orchestrate.ts for shared use - Fixed command injection vulnerability via validateIdentifier(binaryName) - Updated dependency injection: runScript → makeRunner (CloudRunner) - Updated tests to use CloudRunner-based DI pattern Agent: code-health Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * test(ssh-runner): add coverage for validation paths Tests cover the early-exit branches in makeSshRunner methods (runServer invalid command, uploadFile/downloadFile path traversal) that throw before any subprocess is spawned. Agent: team-lead Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-05 11:27:47 +07:00
A	df06bc85af	feat: headless promptCmd, link in cloud picker, default headless steps (#3177 )	2026-04-05 01:39:39 +00:00
Ahmed Abushagur	7797906241	fix(openclaw): remove blocking telegram pairing prompt (#3171 ) Some checks are pending CLI Release / Build and release CLI (push) Waiting to run Details Lint / macOS Compatibility (push) Waiting to run Details Lint / ShellCheck (push) Waiting to run Details Lint / Biome Lint (push) Waiting to run Details * fix(openclaw): fix telegram bot not responding to messages The switch to `openclaw config set` calls in #2655 created malformed nested config structures — the bot token and dmPolicy weren't read properly by openclaw, so the bot never started polling for messages. The `groups` block was also dropped entirely. Fix: write the complete telegram channel object atomically via a bun script that reads the existing config, deep-merges the full telegram block, and writes it back — matching the original atomic JSON approach. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(security): pass telegram config via env var instead of JS interpolation Prevents JavaScript code injection via attacker-controlled bot token by passing the telegramConfig JSON through a shell-quoted environment variable (TELEGRAM_CONFIG) and parsing it with JSON.parse(process.env.TELEGRAM_CONFIG) inside the bun script, instead of interpolating it directly into JS source. Agent: security-auditor Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * test: add test for atomic telegram config write Verifies that openclaw telegram config uses a bun merge script (atomic write) instead of individual `openclaw config set` calls, and that the full config object (botToken, dmPolicy, groupPolicy, groups) is included. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Signed-off-by: Ahmed Abushagur <ahmed@abushagur.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: B <6723574+louisgv@users.noreply.github.com>	2026-04-04 17:02:43 -07:00
A	78ebce9af8	fix: add pi agent and daytona cloud to embedded skill lists (#3172 ) Some checks are pending CLI Release / Build and release CLI (push) Waiting to run Details Lint / ShellCheck (push) Waiting to run Details Lint / Biome Lint (push) Waiting to run Details Lint / macOS Compatibility (push) Waiting to run Details The SKILL_BODY and HERMES_SNIPPET in spawn-skill.ts listed available agents and clouds but were not updated when pi (#3156) and daytona (#3168) were added. Agents spawned via the skill system could not delegate work to Pi or provision on Daytona. Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-04 14:52:05 +07:00
A	e26d65cd65	fix: add cursor to AGENT_SKILLS and add pi/cursor to spawn-skill tests (#3164 ) cursor was missing from the AGENT_SKILLS map in spawn-skill.ts, causing spawn skill injection to silently skip cursor VMs when --beta recursive is active. pi was present in AGENT_SKILLS but missing from all test arrays in spawn-skill.test.ts. Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-04-03 17:59:11 -07:00
A	a31a821e8a	fix(openclaw): wait for bootstrap completion before opening dashboard (#3170 ) Poll `openclaw status --json` after onboarding until bootstrapPending is false (up to 60s). Prevents the Control UI from opening into a broken state where chat fails with "No session found" because the initial session hasn't been created yet. Fixes #3167 Agent: ux-engineer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-04-03 17:57:34 -07:00
A	7292ddef0e	fix(cursor): use real API key instead of dummy spawn-proxy value (#3169 ) Cursor CLI validates CURSOR_API_KEY before connecting to the configured endpoint. The dummy value "spawn-proxy" fails validation immediately, causing an infinite restart loop. Use the actual OPENROUTER_API_KEY as CURSOR_API_KEY so it passes Cursor's key format check. Fixes #3166 Agent: ux-engineer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-04-03 17:54:38 -07:00
A	8e9af23c63	refactor: extract recordSpawn() helper to deduplicate spawn record construction (#3163 ) The same 12-line saveSpawnRecord block was duplicated 3 times in runOrchestration() (fast-mode boot, fast-mode retry, sequential path). A bug fixed in one copy could easily be missed in another. Extracted a shared recordSpawn() helper that all 3 sites now call. Agent: complexity-hunter Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-04 07:46:57 +07:00
Muhammad Hashmi	9b176cd5b8	feat(daytona): add Daytona provider (#3168 ) * feat(daytona): re-add Daytona cloud provider * fix(daytona): tighten live provider behavior * fix(daytona): harden reconnect and dashboard flows	2026-04-04 00:36:38 +00:00
A	3b61c22f25	fix(security): validate script templates before base64 encoding (#3132 ) Add pre-encoding validation to reject ${} interpolation patterns in script template strings before they are base64-encoded and injected into systemd services running with root privileges on remote VMs. Defense-in-depth against future regressions where template variable interpolation before encoding could allow command injection. Fixes #3130 Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-04-01 10:15:20 +07:00
A	c1d8acb73e	feat: add Pi coding agent (shittycodingagent.ai) to spawn (#3128 ) Pi is a minimal terminal coding agent by Mario Zechner (~29.8k GitHub stars) that natively supports OpenRouter via OPENROUTER_API_KEY. Installed via npm as @mariozechner/pi-coding-agent, CLI command is `pi`. - Add Pi agent config across all 6 clouds (local, hetzner, aws, do, gcp, sprite) - Add manifest.json entry with matrix entries - Add agent-setup.ts config (node cloudInitTier, npm install) - Add spawn-skill.ts injection path (~/.pi/agent/skills/spawn/SKILL.md) - Add bash wrappers for all clouds - Update README matrix (also adds missing Cursor CLI row: 10 agents, 60 combos) Co-authored-by: spawn-bot <spawn-bot@openrouter.ai> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-31 17:34:34 -07:00
A	25690185a5	refactor: remove stale ZeroClaw references from CLAUDE.md and agents.ts (#3096 ) Some checks failed CLI Release / Build and release CLI (push) Waiting to run Details Lint / ShellCheck (push) Waiting to run Details Lint / Biome Lint (push) Waiting to run Details Lint / macOS Compatibility (push) Waiting to run Details Build Docker Images / build (claude) (push) Has been cancelled Details Build Docker Images / build (codex) (push) Has been cancelled Details Build Docker Images / build (cursor) (push) Has been cancelled Details Build Docker Images / build (hermes) (push) Has been cancelled Details Build Docker Images / build (junie) (push) Has been cancelled Details Build Docker Images / build (kilocode) (push) Has been cancelled Details Build Docker Images / build (openclaw) (push) Has been cancelled Details Build Docker Images / build (opencode) (push) Has been cancelled Details * fix(ci): remove stale paths from biome check step in lint.yml biome.json restricts linting to packages/*/.ts via its includes filter, so passing .claude/scripts/ and .claude/skills/setup-spa/ to the biome check command was a no-op — biome reported 0 files processed for those paths and silently skipped them. Remove the stale paths so the CI step accurately reflects what biome actually checks. * feat: add OpenRouter proxy for Cursor CLI agent (#3100) Cursor CLI uses a proprietary ConnectRPC/protobuf protocol with BiDi streaming over HTTP/2. It validates API keys against Cursor's own servers and hardcodes api2.cursor.sh for agent streaming — making direct OpenRouter integration impossible. This adds a local translation proxy that intercepts Cursor's protocol and routes LLM traffic through OpenRouter: Architecture: Cursor CLI → Caddy (HTTPS/H2, port 443) → split routing: /agent.v1.AgentService/* → H2C Node.js (BiDi streaming → OpenRouter) everything else → HTTP/1.1 Node.js (fake auth, models, config) Key components: - cursor-proxy.ts: proxy scripts + deployment functions - Caddy reverse proxy for TLS + HTTP/2 termination - /etc/hosts spoofing to intercept api2.cursor.sh - Hand-rolled protobuf codec for AgentServerMessage format - SSE stream translation (OpenRouter → ConnectRPC protobuf frames) Proto schemas reverse-engineered from Cursor CLI binary v2026.03.25: - AgentServerMessage.InteractionUpdate.TextDeltaUpdate.text - agent.v1.ModelDetails (model_id, display_model_id, display_name) - TurnEndedUpdate (input_tokens, output_tokens) Tested end-to-end on Sprite VM: Cursor CLI printed proxy response with EXIT=0. Co-authored-by: Ahmed Abushagur <ahmed@abushagur.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(digitalocean): use canonical DIGITALOCEAN_ACCESS_TOKEN env var (#3099) Replaces all references to DO_API_TOKEN with DIGITALOCEAN_ACCESS_TOKEN, matching DigitalOcean's official CLI and API documentation. This includes TypeScript source, tests, shell scripts, Packer config, CI workflows, and documentation. Supersedes #3068 (rebased onto current main). Agent: pr-maintainer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com> * fix: remove --trust flag from Cursor CLI launch command (#3101) Cursor CLI v2026.03.25 only allows --trust in headless/print mode. Launching interactively with --trust causes immediate exit with error. Co-authored-by: spawn-bot <spawn-bot@openrouter.ai> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Ahmed Abushagur <ahmed@abushagur.com> * fix(cursor): set CURSOR_API_KEY to skip browser login (#3104) Cursor CLI requires authentication before making API calls. Without CURSOR_API_KEY set, it falls back to browser-based OAuth which fails because the proxy spoofs api2.cursor.sh to localhost, breaking the OAuth callback. Setting a dummy CURSOR_API_KEY makes Cursor use the /auth/exchange_user_api_key endpoint instead, which the proxy already handles with a fake JWT. Co-authored-by: spawn-bot <spawn-bot@openrouter.ai> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: sync README with source of truth (#3097) - update tagline: 8 agents/48 combos -> 9 agents/54 combos - add Cursor CLI row to matrix table manifest.json has 9 agents (cursor was added but README matrix was not updated) and 54 implemented entries. Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Ahmed Abushagur <ahmed@abushagur.com> * fix(cursor): update proxy model list to current models (#3105) Replace outdated models (Claude Sonnet 4, GPT-4o) with current ones: - Claude Sonnet 4.6 (default), Claude Haiku 4.5 - GPT-4.1 - Gemini 2.5 Pro, Gemini 2.5 Flash Co-authored-by: spawn-bot <spawn-bot@openrouter.ai> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat(status): add agent alive probe via SSH (#3109) `spawn status` now probes running servers by SSHing in and running `{agent} --version` to verify the agent binary is installed and executable. Results show in a new "Probe" column (live/down/—) and as `agent_alive` in JSON output. Only "running" servers are probed; gone/stopped/unknown servers are skipped. The probe function is injectable via opts for testability. Co-authored-by: spawn-bot <spawn-bot@openrouter.ai> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: add cursor to agent lists in spawn skill files (#3108) cursor is a fully implemented agent across all 6 clouds but was missing from the available agents list in spawn skill instructions injected onto child VMs. This caused claude, codex, hermes, junie, kilocode, openclaw, opencode, and zeroclaw to be unaware they could delegate work to cursor. Signed-off-by: Ahmed Abushagur <ahmed@abushagur.com> Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: Ahmed Abushagur <ahmed@abushagur.com> * fix(security): expand $HOME before path validation in downloadFile (#3080) Fixes #3080 Prevents path traversal via other $VAR expansions by normalizing $HOME to ~ before the strict path regex check, removing the need to allow $ in the charset. Applied to all 5 cloud providers: - digitalocean: downloadFile - aws: downloadFile - sprite: downloadFileSprite - gcp: uploadFile + downloadFile - hetzner: downloadFile Also bumps CLI version to 0.27.7. Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(manifest): correct cursor repo to cursor/cursor and update star counts (#3092) The cursor agent's repo was set to anysphere/cursor (private, returns 404), which caused the stars-update script to store the raw 404 error object as github_stars instead of a number — breaking the manifest-type-contracts test. Fix: update repo to the public cursor/cursor repo (32,526 stars as of 2026-03-29). Also applies the daily star count updates for all other agents. -- qa/e2e-tester Co-authored-by: spawn-qa-bot <qa@openrouter.ai> * fix(spawn-fix): load API keys via config file, not just process.env (#3095) Previously buildFixScript() resolved env templates directly from process.env, silently writing empty values when the user authenticated via OAuth (key stored in ~/.config/spawn/openrouter.json). Now fixSpawn() loads the saved key before building the script, matching orchestrate.ts. Fixes #3094 Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> * docs: sync README commands table with help.ts (--prompt, --prompt-file) (#3106) Co-authored-by: spawn-qa-bot <qa@openrouter.ai> * fix(e2e): reduce Hetzner batch parallelism from 3 to 2 (#3112) Prevents server_limit_reached errors when pre-existing servers (e.g. spawn-szil) consume quota during E2E batch 1. Fixes #3111 Agent: test-engineer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com> * refactor(e2e): normalize unused-arg comments in headless_env functions (#3113) GCP, Sprite, and DigitalOcean had commented-out code `# local agent="$2"` in their `_headless_env` functions. Hetzner already used the cleaner style `# $2 = agent (unused but part of the interface)`. Normalize to match. Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> * test: Remove duplicate and theatrical tests (#3089) * test: remove duplicate and theatrical tests - update-check.test.ts: fix 3 tests using stale hardcoded version '0.2.3' (older than current 0.29.1) to use `pkg.version` so 'should not update when up to date' actually tests the current-version path correctly - run-path-credential-display.test.ts: strengthen weak `toBeDefined()` assertion on digitalocean hint to `toContain('Simple cloud hosting')`, making it verify the actual fallback hint content Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * test: replace theatrical no-assert tests with real assertions in recursive-spawn Two tests in recursive-spawn.test.ts captured console.log output into a logs array but never asserted against it. Both ended with a comment like "should not throw" — meaning they only proved the function didn't crash, not that it produced the right output. - "shows empty message when no history": now spies on p.log.info and asserts cmdTree() emits "No spawn history found." - "shows flat message when no parent-child relationships": now asserts cmdTree() emits "no parent-child relationships" via p.log.info. expect() call count: 4831 to 4834 (+3 real assertions added). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * test: consolidate redundant describe block in cmd-fix-cov.test.ts The file had two separate describe blocks with identical beforeEach/afterEach boilerplate. The second block ("fixSpawn connection edge cases") contained only one test ("shows success when fix script succeeds") and could be merged directly into the first block ("fixSpawn (additional coverage)") without any loss of coverage or setup fidelity. Removes 23 lines of duplicated boilerplate. Test count unchanged (6 tests). --------- Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(config): extend biome.json includes to cover .claude/*/.ts Add .claude/*/.ts to biome.json includes so TypeScript files in .claude/scripts/ and .claude/skills/ are covered by biome formatting. Linting is disabled for .claude/** via override because the GritQL plugins (no-try-catch, no-typeof-string-number) target the main CLI codebase and cannot be scoped per-path — .claude/ hook scripts legitimately use try/catch as they run standalone outside the package. Agent: pr-maintainer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * fix(prompts): stop infinite shutdown loop after TeamDelete in non-interactive mode (#3116) After TeamDelete completes in -p (non-interactive) mode, Claude Code's harness was re-injecting shutdown prompts every turn. The root cause: the Monitor Loop instructed the agent to call TaskList + Bash on EVERY iteration, including after TeamDelete, which kept the session alive so the harness could inject more shutdown prompts. Fix: add an explicit EXCEPTION to both refactor-team-prompt.md and refactor-issue-prompt.md instructing the team lead that after TeamDelete is called, the very next response MUST be plain text only with no tool calls. A text-only response is the termination signal for the non-interactive harness. Fixes #3103 Agent: issue-fixer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com> * fix(zeroclaw): remove broken zeroclaw agent (repo 404) (#3107) * fix(zeroclaw): remove broken zeroclaw agent (repo 404) The zeroclaw-labs/zeroclaw GitHub repository returns 404 — all installs fail. Remove zeroclaw entirely from the matrix: agent definition, setup code, shell scripts, e2e tests, packer config, skill files, and documentation. Fixes #3102 Agent: code-health Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * fix(zeroclaw): remove stale zeroclaw reference from discovery.md ARM agents list Addresses security review on PR #3107 — the last remaining zeroclaw reference in .claude/rules/discovery.md is now removed. Agent: issue-fixer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * fix(zeroclaw): remove remaining stale zeroclaw references from CI/packer Remove zeroclaw from: - .github/workflows/agent-tarballs.yml ARM build matrix - .github/workflows/docker.yml agent matrix - packer/digitalocean.pkr.hcl comment - sh/e2e/e2e.sh comment Addresses all 5 stale references flagged in security review of PR #3107. Agent: issue-fixer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com> * fix(cli): allow --headless and --dry-run to be used together (#3117) Removes the mutual-exclusion validation that blocked combining these flags. Both flags serve independent purposes: --dry-run previews what would happen, --headless suppresses interactive prompts and emits structured output. Combining them is valid for CI pipelines that want structured JSON previews. Fixes #3114 Agent: issue-fixer Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com> * fix(cli): allow --headless and --dry-run to be used together (#3118) * test: remove redundant theatrical assertions (#3120) Remove bare toHaveBeenCalled() checks that preceded stronger content assertions, and strengthen the "shows manual install command" test to verify the actual install script URL appears in output. Affected files: - cmd-update-cov: remove redundant consoleSpy.toHaveBeenCalled() (x2), strengthen "shows manual install command" to check install.sh content - update-check: remove redundant consoleErrorSpy.toHaveBeenCalled() (x2) that were immediately followed by .mock.calls content assertions - recursive-spawn: remove redundant logInfoSpy.toHaveBeenCalled() before content check - cmd-interactive: remove redundant mockIntro/mockOutro.toHaveBeenCalled() before content checks Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> * docs: sync README tagline with manifest (9 agents/54 → 8 agents/48 combinations) (#3119) Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: L <6723574+louisgv@users.noreply.github.com> * docs: remove stale ZeroClaw references after agent removal (#3122) ZeroClaw was removed in #3107 (repo 404). Two doc references were left behind: - .claude/rules/agent-default-models.md: table row for ZeroClaw model config - README.md: ZeroClaw listed in --fast skip-cloud-init agent examples Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(e2e): redirect DO max_parallel log_warn to stderr (#3110) _digitalocean_max_parallel() called log_warn which writes colored output to stdout, polluting the captured return value when invoked via cloud_max=$(cloud_max_parallel). The downstream integer comparison [ "${effective_parallel}" -gt "${cloud_max}" ] then fails with 'integer expression expected', silently leaving the droplet limit cap unapplied. Fix: redirect log_warn output to stderr so only the numeric value is captured. Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: L <6723574+louisgv@users.noreply.github.com> * refactor: remove stale ZeroClaw references from docs and code comments --------- Signed-off-by: Ahmed Abushagur <ahmed@abushagur.com> Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Ahmed Abushagur <ahmed@abushagur.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: spawn-bot <spawn-bot@openrouter.ai>	2026-03-31 05:20:26 +00:00
A	5e0144b645	fix(zeroclaw): remove broken zeroclaw agent (repo 404) (#3107 ) * fix(zeroclaw): remove broken zeroclaw agent (repo 404) The zeroclaw-labs/zeroclaw GitHub repository returns 404 — all installs fail. Remove zeroclaw entirely from the matrix: agent definition, setup code, shell scripts, e2e tests, packer config, skill files, and documentation. Fixes #3102 Agent: code-health Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * fix(zeroclaw): remove stale zeroclaw reference from discovery.md ARM agents list Addresses security review on PR #3107 — the last remaining zeroclaw reference in .claude/rules/discovery.md is now removed. Agent: issue-fixer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * fix(zeroclaw): remove remaining stale zeroclaw references from CI/packer Remove zeroclaw from: - .github/workflows/agent-tarballs.yml ARM build matrix - .github/workflows/docker.yml agent matrix - packer/digitalocean.pkr.hcl comment - sh/e2e/e2e.sh comment Addresses all 5 stale references flagged in security review of PR #3107. Agent: issue-fixer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-30 15:35:40 -07:00
A	02cf129bc0	fix(spawn-fix): load API keys via config file, not just process.env (#3095 ) Previously buildFixScript() resolved env templates directly from process.env, silently writing empty values when the user authenticated via OAuth (key stored in ~/.config/spawn/openrouter.json). Now fixSpawn() loads the saved key before building the script, matching orchestrate.ts. Fixes #3094 Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-31 03:03:47 +07:00
A	ccbe52ccc2	fix: add cursor to agent lists in spawn skill files (#3108 ) Some checks are pending CLI Release / Build and release CLI (push) Waiting to run Details Lint / ShellCheck (push) Waiting to run Details Lint / Biome Lint (push) Waiting to run Details Lint / macOS Compatibility (push) Waiting to run Details cursor is a fully implemented agent across all 6 clouds but was missing from the available agents list in spawn skill instructions injected onto child VMs. This caused claude, codex, hermes, junie, kilocode, openclaw, opencode, and zeroclaw to be unaware they could delegate work to cursor. Signed-off-by: Ahmed Abushagur <ahmed@abushagur.com> Co-authored-by: spawn-qa-bot <qa@openrouter.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: Ahmed Abushagur <ahmed@abushagur.com>	2026-03-29 22:49:04 -07:00
A	ddce16a438	fix(cursor): update proxy model list to current models (#3105 ) Replace outdated models (Claude Sonnet 4, GPT-4o) with current ones: - Claude Sonnet 4.6 (default), Claude Haiku 4.5 - GPT-4.1 - Gemini 2.5 Pro, Gemini 2.5 Flash Co-authored-by: spawn-bot <spawn-bot@openrouter.ai> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-29 21:25:58 -07:00
A	9892355ede	fix(cursor): set CURSOR_API_KEY to skip browser login (#3104 ) Cursor CLI requires authentication before making API calls. Without CURSOR_API_KEY set, it falls back to browser-based OAuth which fails because the proxy spoofs api2.cursor.sh to localhost, breaking the OAuth callback. Setting a dummy CURSOR_API_KEY makes Cursor use the /auth/exchange_user_api_key endpoint instead, which the proxy already handles with a fake JWT. Co-authored-by: spawn-bot <spawn-bot@openrouter.ai> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-29 21:05:26 -07:00
A	b73761897a	fix: remove --trust flag from Cursor CLI launch command (#3101 ) Cursor CLI v2026.03.25 only allows --trust in headless/print mode. Launching interactively with --trust causes immediate exit with error. Co-authored-by: spawn-bot <spawn-bot@openrouter.ai> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Ahmed Abushagur <ahmed@abushagur.com>	2026-03-29 20:46:39 -07:00
A	b9473f25b8	feat: add OpenRouter proxy for Cursor CLI agent (#3100 ) Cursor CLI uses a proprietary ConnectRPC/protobuf protocol with BiDi streaming over HTTP/2. It validates API keys against Cursor's own servers and hardcodes api2.cursor.sh for agent streaming — making direct OpenRouter integration impossible. This adds a local translation proxy that intercepts Cursor's protocol and routes LLM traffic through OpenRouter: Architecture: Cursor CLI → Caddy (HTTPS/H2, port 443) → split routing: /agent.v1.AgentService/* → H2C Node.js (BiDi streaming → OpenRouter) everything else → HTTP/1.1 Node.js (fake auth, models, config) Key components: - cursor-proxy.ts: proxy scripts + deployment functions - Caddy reverse proxy for TLS + HTTP/2 termination - /etc/hosts spoofing to intercept api2.cursor.sh - Hand-rolled protobuf codec for AgentServerMessage format - SSE stream translation (OpenRouter → ConnectRPC protobuf frames) Proto schemas reverse-engineered from Cursor CLI binary v2026.03.25: - AgentServerMessage.InteractionUpdate.TextDeltaUpdate.text - agent.v1.ModelDetails (model_id, display_model_id, display_name) - TurnEndedUpdate (input_tokens, output_tokens) Tested end-to-end on Sprite VM: Cursor CLI printed proxy response with EXIT=0. Co-authored-by: Ahmed Abushagur <ahmed@abushagur.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-29 17:59:00 -07:00
Ahmed Abushagur	ccd86005ce	fix: scope local warning to openclaw-only + improve spawn skill docs (#3074 ) Some checks failed CLI Release / Build and release CLI (push) Has been cancelled Details Lint / ShellCheck (push) Has been cancelled Details Lint / Biome Lint (push) Has been cancelled Details Lint / macOS Compatibility (push) Has been cancelled Details - Revert local security warning to openclaw-only (was blocking all agents) - Update spawn skill to document how to run prompts on child VMs: - Always use `bash -lc` (binaries in ~/.local/bin/ need login shell) - Claude uses `-p` not `--print` or `--headless` - Add `--dangerously-skip-permissions` for unattended child VMs - Don't waste tokens with `which`/`find` or creating non-root users - Sync all on-disk skill files with embedded version Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-27 22:54:24 -07:00
A	db77121414	fix: reject disabled agents in CLI validation instead of silently proceeding (#3061 ) resolveEntityKey() and checkEntity() checked manifest.agents[input] directly, bypassing the disabled filter in agentKeys(). This let users run `spawn cursor <cloud>` even though cursor is disabled, wasting time provisioning a VM for an agent that can't route through OpenRouter. Now both functions check the disabled flag and show the disabled_reason to the user. Also removes stale cursor references from spawn skill templates injected into child VMs. Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-03-27 10:22:18 +00:00
A	1cfa9ca1a7	fix(cursor): update binary path from ~/.cursor/bin to ~/.local/bin (#3058 ) The cursor installer changed its binary install location from ~/.cursor/bin/agent to ~/.local/bin/agent (as of 2026-03-25 release). Updates: - agent-setup.ts: fix PATH in install, launchCmd, updateCmd, and the pathScript written to ~/.bashrc/~/.zshrc - verify.sh: fix E2E binary check to look in ~/.local/bin first - Bump CLI to 0.27.3 -- qa/e2e-tester Co-authored-by: spawn-qa-bot <qa@openrouter.ai>	2026-03-27 02:37:40 -07:00

1 2 3 4 5 ...

263 commits