spawn

vrr/spawn

mirror of https://github.com/OpenRouterTeam/spawn.git synced 2026-05-10 12:20:07 +00:00

Author	SHA1	Message	Date
A	50dd2f26ed	fix: repair Fly.io saved token loading (_load_token_from_config misuse) (#1513 ) ensure_fly_token() called _load_token_from_config with only 1 argument (config file path) but the function requires 3 (config_file, env_var_name, provider_name). The empty env_var_name fails the security validation regex, so the function always returns 1 silently. Users with saved Fly.io tokens in ~/.config/spawn/fly.json were forced to re-authenticate every session. Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-20 03:54:41 -05:00
A	3280a44c45	feat: add browser-based OAuth login for Fly.io + token sanitizer (#1506 ) Replace the prompt-first auth flow with a browser-based CLI session flow (same as `fly auth login`). The new auth chain is: 1. Environment variable (FLY_API_TOKEN) 2. Saved config file (~/.config/spawn/fly.json) 3. flyctl CLI (`fly auth token`) 4. Browser OAuth via Fly.io CLI Sessions API (NEW) 5. Manual token prompt (last resort fallback) The browser flow creates a CLI session via POST /api/v1/cli_sessions, opens the auth URL in the user's browser, then polls for the access token. This is the same mechanism flyctl uses internally. Also add _sanitize_fly_token() to handle the Fly dashboard copy button which includes the display name before the token (e.g. "Deploy Token FlyV1 fm2_..."). The sanitizer strips everything before "FlyV1" or extracts bare "fm2_" tokens, and trims whitespace/newlines. Applied at every token entry point (env var, config, manual prompt). Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-19 22:50:19 -08:00
L	d5690a8b11	feat: spawn name prompt + kebab resource naming across all clouds (#1507 ) * feat: add spawn name prompt and project confirmation to GCP flow Ask for spawn name upfront (before auth), derive kebab-case default for VM naming, and confirm the current GCP project before using it. New interaction order: 1. Spawn name: "My Dev Box" → kebab "my-dev-box" exported as GCP_INSTANCE_NAME_KEBAB 2. gcloud auth + project confirm: "Current project: X Keep? [Y/n]" If no → project picker shown 3. SSH key 4. Machine type picker (existing) 5. Zone picker (existing) 6. Instance name prompt: "Instance name [my-dev-box]: " User can press Enter to accept or type a custom name New functions: _to_kebab_case() — lowercases, replaces non-alnum with hyphens _gcp_prompt_spawn_name() — prompts for display name, exports kebab default; honours SPAWN_NAME env var set by CLI (--name flag) Modified: _gcp_resolve_project() — adds Y/n confirmation when project already set get_server_name() — shows kebab default in prompt, accepts Enter cloud_authenticate() — calls _gcp_prompt_spawn_name first Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com> * feat: add spawn name prompt to all clouds via shared/common.sh Move _to_kebab_case() and prompt_spawn_name() to shared/common.sh so all clouds get upfront spawn name prompting and kebab-based resource naming. shared/common.sh: + _to_kebab_case() — "My Dev Box" → "my-dev-box" + prompt_spawn_name() — asks for display name, exports SPAWN_NAME_DISPLAY and SPAWN_NAME_KEBAB; skips if already set; honours SPAWN_NAME env var from CLI --name flag ~ get_resource_name() — replaces silent SPAWN_NAME fallback with a visible prefilled default: "Enter server name [my-dev-box]: " Per-cloud changes (cloud_authenticate gains prompt_spawn_name first): hetzner, fly, aws, daytona, digitalocean, sprite — one-line change each gcp/lib/common.sh: - Remove _to_kebab_case() (now in shared) - Remove _gcp_prompt_spawn_name() (now in shared as prompt_spawn_name) ~ cloud_authenticate: _gcp_prompt_spawn_name → prompt_spawn_name ~ get_server_name: simplified back to get_validated_server_name (shared get_resource_name now shows the kebab default in the prompt) Result — every cloud shows this flow upfront: Spawn name (e.g. "My Dev Box"): My Claude Box ℹ Resource name: my-claude-box ... Enter server name [my-claude-box]: ⏎ Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com> * fix: use "Use project '...'?" instead of "Keep this project?" in GCP prompt Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>	2026-02-19 22:22:59 -08:00
A	5612cda40b	feat: remove Aider, Goose, Open Interpreter, Gemini CLI, Amazon Q from matrix (#1472 ) These 5 agents are being dropped from the Spawn matrix. This removes 45 agent scripts across 9 clouds, cleans the manifest, test fixtures, READMEs, CLI source, and shared library comments. Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-19 12:31:00 -05:00
A	f3ffb6caed	fix: broken error message in multi-creds validation, predictable temp path (#1442 ) 1. _multi_creds_validate referenced undefined help_url variable, causing empty "Get new credentials from: " error messages when OVH credential validation fails. Added help_url as parameter and pass it from caller. 2. _spawn_inject_env_vars (used by 130+ agent scripts via spawn_agent) uploaded credentials to static /tmp/env_config path. The older inject_env_vars_ssh/inject_env_vars_cb functions document this as a symlink attack vector and use randomized paths. Fixed to match. 3. Removed dead inject_env_vars_fly and inject_env_vars_sprite functions (all agent scripts now use spawn_agent -> _spawn_inject_env_vars). Agent: code-health Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-18 07:51:28 -05:00
Ahmed Abushagur	db4aaa0c73	fix: prevent SSH hangs, fix command escaping, pin Python 3.12 for aider (#1439 ) * fix: use uv --upgrade to ensure Python 3.13-compatible Pillow across all clouds aider-chat on Python 3.13 fails with `ImportError: cannot import name '_imaging' from 'PIL'` when an old Pillow version (pre-10.4) is resolved — those releases have no Python 3.13 binary wheels, so the C extension is missing at runtime. Replace `--with 'Pillow>=10.2.0'` (which was silently broken — the `>` and single quotes get mangled by `printf '%q'` in run_server before the command reaches the remote machine) with `--upgrade`, which forces all transitive deps including Pillow to their latest compatible versions. Also adds a plain-text echo before the install so users see progress instead of a silent hang during the 2-4 minute install. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * test: update aider/gptme/interpreter assertions from pip to uv The install method for aider, gptme, and open-interpreter was changed from pip to `uv tool install` across all clouds. The mock test assertions still checked for the old `pip.install.` patterns, causing 9 failures (3 agents × 3 clouds). Update patterns to match the actual `uv tool install` commands now used in all cloud scripts. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * ci: trigger test run for uv assertion fix * fix: prevent SSH hangs, restore stderr, fix command escaping across clouds - Add < /dev/null to ssh_run_server and generic_ssh_wait to prevent SSH stdin theft causing sequential install/verify/configure steps to hang - Add ServerAliveInterval, ServerAliveCountMax, ConnectTimeout to default SSH_OPTS so long-running installs don't silently drop on flaky networks - Remove 2>/dev/null from Fly.io run_server so remote command errors are no longer silently swallowed (--quiet flag still suppresses flyctl noise) - Fix Fly.io printf '%q' double-quoting: remove extra quotes around $escaped_cmd that prevented the remote shell from consuming escapes, breaking && \|\| \| operators in commands - Remove broken printf '%q' from Daytona run_server and interactive_session where it escaped shell operators into literal characters since daytona exec has no intermediate shell layer - Pin aider to --python 3.12 instead of --with audioop-lts across all clouds Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: add --pty to fly ssh console for interactive sessions fly ssh console -C does not allocate a pseudo-terminal by default, causing interactive TUI agents (aider, claude) to fail with "Input is not a terminal (fd=0)" or completely unresponsive input. Adding --pty forces PTY allocation, matching how other clouds handle interactive sessions (SSH uses -t, Sprite uses -tty). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-18 04:23:15 -05:00
Ahmed Abushagur	633ce8eaac	feat: upgrade default server sizes, fix Fly.io agent installs, improve E2E tests (#1428 ) - Upgrade default VM sizes across clouds for better agent performance: - Hetzner: cpx11 → cx23 (with cx22 fallback support for deprecated types) - DigitalOcean: s-2vcpu-2gb → s-2vcpu-4gb - Daytona: 2048MB → 4096MB memory - Oracle: VM.Standard.E2.1.Micro → VM.Standard.A1.Flex - OVH: d2-2 → d2-4 - Fix Fly.io agent failures: - Add Node.js + build-essential to wait_for_cloud_init (fixes npm-based agents) - Prepend PATH in interactive_session (fixes "source not found" errors) - Fix openclaw installs across clouds: use explicit PATH export instead of source - Fix DigitalOcean token validation (check "uuid" not "id") - Fix AWS cloud-init: chown .bashrc/.zshrc to ubuntu user - Improve Hetzner fallback: add "cheapest available" as last-resort fallback - Upgrade E2E tests: per-combo auto-fix, credential collection, robustness fixes Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-17 22:17:08 -08:00
Ahmed Abushagur	22b6a402f4	feat: E2E test harness, QA pipeline integration, macOS compat linter (#1425 ) * feat: add QA upgrade — macOS compat linter, per-agent mock assertions Layer 1: macOS compat linter (test/macos-compat.sh) - 12 rules (MC001–MC012) catching bash 3.2 incompatibilities - Detects: base64 -w0 file args, non-portable echo flags, source <(), ((var++)), read -d, nounset flag, sed -i, date %N, local -n, declare -A, ${var,,}, and \|& - Added to CI lint.yml in warn-only mode for burn-in - Integrated as Phase 0.5 in qa-dry-run.sh Layer 2: Per-agent mock assertions - test/fixtures/_shared_agent_assertions.sh with install checks for all 15 agents (claude, openclaw, aider, goose, etc.) - Integrated into test/mock.sh via _run_agent_assertions() Also includes branch fixes: - Fix base64 -w0 to use stdin redirect (aws, daytona, fly) - Fix fly/openclaw to use npm install instead of broken curl\|bash Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: add E2E test harness and integrate into QA pipeline Add test/e2e.sh — a full E2E test harness that provisions real servers, installs agents, and verifies setup across all clouds. Features: - Smoke test (one canary agent per cloud) and full matrix modes - Credential auto-detection for 8 clouds - Per-cloud preflight validation (sequential) then parallel agent tests - Stale server cleanup, timing history, cross-cloud comparison - Auto-fix and optimization phases via Claude agents - macOS bash 3.2 compatible Integrate E2E as Phase 5 in both qa-cycle.sh and qa-dry-run.sh: - Runs after mock tests pass, gated on cloud credentials - Phase 5b auto-fixes failures using per-agent worktree branches - Parses results and includes in QA summary Also fixes: - shared/common.sh: honour SPAWN_NON_INTERACTIVE=1 in safe_read() - aws/lib/common.sh: fix SSH key import (use cat instead of base64, handle race condition on concurrent imports) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-17 20:41:07 -05:00
A	3e13a213f1	security: fix command injection in fly/lib/common.sh bash -c invocations (#1423 ) Quote $escaped_cmd inside the -C argument to bash -c in run_server() and interactive_session() to prevent word splitting. Without quotes, even though printf '%q' escapes shell metacharacters, the shell still splits the escaped command on whitespace before passing it to bash -c, enabling potential argument injection. Fixes #1422 Agent: security-auditor Co-authored-by: B <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-17 19:35:23 -05:00
Ahmed Abushagur	14d36d1e1d	fix: Fly.io SSH reliability and app name UX (#1388 ) * fix: re-prompt on taken Fly.io app names + timeout run_server Two fixes for Fly.io UX: 1. When app name is globally taken by another user, re-prompt instead of failing. Returns exit code 2 from _fly_create_app so create_server can loop with a new name. 2. run_server now has a 5-minute timeout (portable, no coreutils needed) to prevent indefinite hangs like the 3-hour SSH session stall. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: wait for SSH before installing tools on Fly.io The previous wait_for_cloud_init immediately ran apt-get via fly ssh console on a machine that wasn't SSH-reachable yet, causing indefinite hangs. Now: 1. _fly_wait_for_ssh polls with a 30s-timeout echo until SSH responds 2. Shows progress at each step instead of suppressing all output 3. Each run_server call has an explicit timeout (10min for apt, 2min for bun, 30s for PATH exports) 4. Retries package install once on timeout Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: run fly ssh console in foreground, not background fly ssh console breaks when backgrounded with & — it needs a foreground process to establish the connection. Reverted to foreground execution and use timeout/gtimeout when available (Linux/CI). On macOS where timeout isn't available, the user can Ctrl+C hung commands. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: ensure bun PATH is available in non-interactive fly ssh sessions Ubuntu's default .bashrc returns early for non-interactive shells, so "source ~/.bashrc && bun install -g openclaw" silently fails — the PATH line at the bottom of .bashrc is never reached. Fix by prepending ~/.bun/bin to PATH in run_server() so all remote commands have access to tools installed during wait_for_cloud_init. Also fix spawn_agent to explicitly handle agent_install failure instead of relying on set -e (which exits silently). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-17 05:54:34 -05:00
Ahmed Abushagur	999751537d	fix: validate saved tokens + handle FlyV1 auth scheme (#1386 ) * fix: validate saved API tokens before use Tokens loaded from config files (e.g. ~/.config/spawn/fly.json) were never validated, so expired or revoked tokens would silently pass through and only fail at the point of use (e.g. app creation). Now the provider's test function runs on config-file tokens too, falling through to a fresh prompt if validation fails. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: handle FlyV1 token auth scheme for Fly.io Machines API Fly.io dashboard tokens use the format "FlyV1 fm2_..." where "FlyV1" is the authorization scheme itself, not a Bearer token prefix. The script was always sending "Authorization: Bearer FlyV1 fm2_..." which the API rejects with "token validation error". Now detects FlyV1-prefixed tokens and sends them as "Authorization: FlyV1 fm2_..." using custom auth headers. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: make refactor service actually run reliably Three fixes for the refactor workflow that was producing zero PRs: 1. community-coordinator: Gemini → Sonnet — Gemini doesn't support the Task tool, causing a respawn on every single cycle 2. Monitoring loop: replace "sleep 5" (which drifted to sleep 30) with explicit short-sleep instructions and CRITICAL rule that every turn must include a tool call to stay alive 3. Lifecycle management: explicit shutdown sequence with retry, preventing early exit that orphans teammates Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-17 04:31:46 -05:00
A	8d533d3908	fix: add error handling for critical ID/IP extraction failures (#1323 ) Prevent silent failures when cloud API responses don't contain expected server/instance IDs or IPs. Without these checks, scripts would continue with empty variables, leading to cryptic failures downstream (e.g., "ssh root@" or API calls with empty IDs). Changes: - fly: Check FLY_MACHINE_ID after extraction, fail fast with clear error - ovh: Check OVH_INSTANCE_ID after extraction, fail fast with clear error - hetzner: Check HETZNER_SERVER_ID and HETZNER_SERVER_IP (+ null check for jq) - digitalocean: Check DO_DROPLET_ID after extraction, fail fast with clear error Impact: Improves reliability by catching API response parsing failures immediately rather than propagating empty values to SSH/API calls. Agent: code-health Co-authored-by: spawn-bot <bot@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-16 20:22:48 -05:00
Ahmed Abushagur	758b575658	feat: add server lifecycle management (reconnect + delete) (#1363 ) Wire up connection tracking across all 10 clouds so users can reconnect to and delete previously spawned servers via `spawn list` and `spawn delete`. Phase 1 - Connection tracking: - Extend save_vm_connection() with cloud and metadata params - Add save_vm_connection to create_server() in all cloud libs - Extend VMConnection with cloud, deleted, deleted_at, metadata fields Phase 2 - Delete via interactive picker: - Add "Delete this server" option to spawn list picker - Build delete scripts that reuse each cloud's destroy_server() - Confirmation UX with spinner feedback - Soft-delete marking in history (deleted records show [deleted]) Phase 3 - Standalone delete command: - spawn delete (aliases: rm, destroy) with interactive picker - Filter support: spawn delete -a <agent> -c <cloud> Also improves reconnect hints for Fly (fly ssh console) and Daytona (daytona ssh) connections. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-16 17:06:49 -08:00
A	ec81c74594	refactor: introduce cloud adapter + spawn_agent runner system (#1340 ) Eliminate ~70% boilerplate across 149 agent scripts by introducing a standard cloud_* adapter interface and spawn_agent orchestration runner. Each cloud's lib/common.sh now exports 7 adapter functions (cloud_authenticate, cloud_provision, cloud_wait_ready, cloud_run, cloud_upload, cloud_interactive, cloud_label) that wrap cloud-specific operations behind a uniform interface. Agent scripts define hooks (agent_install, agent_env_vars, agent_launch_cmd, etc.) and call `spawn_agent "Agent Name"` — the runner handles the full deployment flow: auth → provision → wait → install → API key → env → config → launch. - shared/common.sh: add spawn_agent(), _fn_exists(), _spawn_inject_env_vars() - 10 cloud lib/common.sh files: add cloud_* adapter functions - 149 agent scripts: rewrite to hook pattern (~40-80 lines → ~20-35 lines) - test/run.sh: update 2 sprite test patterns for new adapter paths - Net reduction: ~4,300 lines (2,257 added, 6,563 removed) Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-16 16:25:44 -08:00
A	3030b1d036	fix: revert .profile writes, use explicit PATH in launch commands (#1260 ) Stop writing env vars to ~/.profile and ~/.bash_profile — only write to .bashrc and .zshrc. The .profile approach caused issues because login shells source it inconsistently across distros, and creating .bash_profile makes bash -l skip .profile entirely. Replace `bash -lc claude` launch commands with explicit PATH export + source pattern across all cloud providers. This ensures claude is found regardless of shell initialization quirks. Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-16 00:43:49 -08:00
A	46e6f46008	fix: stop creating ~/.bash_profile — was destroying system PATH (#1258 ) On Ubuntu/Debian, ~/.bash_profile doesn't exist by default. When bash starts as a login shell (bash -l), it sources the FIRST file it finds from: ~/.bash_profile, ~/.bash_login, ~/.profile. Since only ~/.profile exists, that's what gets sourced — and ~/.profile sets up the standard PATH (/usr/bin, /bin, etc.) and sources ~/.bashrc. Our inject_env_vars_* functions and _finalize_claude_install were writing to ~/.bash_profile and ~/.zprofile (either via touch+append or via for-loop over all rc files). Creating ~/.bash_profile caused bash -l to source it INSTEAD of ~/.profile, completely losing the standard PATH setup. After deployment, even basic commands like `ls` would fail. Fix: Only write to ~/.profile, ~/.bashrc, ~/.zshrc across all clouds (shared, fly, sprite). These are the standard files that work correctly on all Linux distros without breaking the shell initialization chain. Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-16 00:27:28 -08:00
A	99b21e2797	fix: write env config to all shell startup files including .bash_profile (#1251 ) Root cause: bash -l sources the FIRST of ~/.bash_profile, ~/.bash_login, ~/.profile. If ~/.bash_profile exists (e.g. from cloud-init), ~/.profile is never read and our claude PATH exports are invisible. Additionally, .bashrc has a non-interactive guard that skips exports when sourced from non-interactive shells like `ssh host "cmd"` or `bash -lc`. Fix: write env config and PATH entries to ALL shell startup files: ~/.profile, ~/.bash_profile, ~/.bashrc, ~/.zshrc, ~/.zprofile. This ensures both login and interactive shells on any platform find claude. Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-16 00:04:36 -08:00
A	dac4c62d6c	fix: try bun before npm for Claude Code install, fix PATH in launch (#1249 ) Two fixes: 1. Swap fallback order from curl → npm → bun to curl → bun → npm. Bun is faster and typically pre-installed. Use `bun i -g`. 2. Fix "claude: command not found" at launch. The default .bashrc has a non-interactive guard (`case $- in i) ;; *) return;; esac`) that skips PATH exports when sourced from SSH command strings. Fix: write env config to ~/.profile (always sourced by login shells) in addition to .bashrc/.zshrc, and launch with `bash -lc claude` which starts a login shell that sources ~/.profile. Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-15 23:44:02 -08:00
L	d8ac64863d	fix: inject env vars into both .bashrc and .zshrc, fix PATH across all clouds (#1213 ) API keys and env vars were only written to .zshrc, so SSH sessions using bash couldn't find credentials. Also fixes incorrect ~/.claude/local/bin PATH (claude installs to ~/.local/bin) and syncs interactive_session PATH with cloud-init PATH across all 9 clouds. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-15 17:30:40 -08:00
A	9336998168	fix(ux): add post-session summary to 10 exec-based cloud providers (#1056 ) Users on exec-based clouds (Fly, Render, Koyeb, Northflank, Railway, Modal, Daytona, E2B, CodeSandbox, GitHub Codespaces) got no warning when their session ended that their service was still running and incurring charges. This adds: - _show_exec_post_session_summary() in shared/common.sh for non-SSH providers that use CLI exec commands instead of direct SSH - SPAWN_DASHBOARD_URL for all 10 exec-based clouds so users get actionable dashboard links - Post-session summary calls in each cloud's interactive_session() - 33 new tests covering the exec post-session summary feature Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-14 00:38:10 -05:00
A	d2fbd325b0	refactor: decompose fly get_server_name and oracle _setup_vcn_networking (#1000 ) - fly/lib/common.sh: Replace 23-line get_server_name() that duplicated env-var-check, prompt, and validation logic with a one-line call to the shared get_validated_server_name helper, matching all other cloud providers. - oracle/lib/common.sh: Break _setup_vcn_networking (48 lines, 3 distinct responsibilities) into focused helpers: - _create_internet_gateway: creates the IGW resource - _add_default_route: configures the route table - _add_ssh_security_rules: opens SSH port in the security list The orchestrator _setup_vcn_networking now delegates to these three helpers. Agent: complexity-hunter Co-authored-by: A <6723574+louisgv@users.noreply.github.com>	2026-02-13 12:57:11 -08:00
A	a0f6b335a4	fix: harden upload_file path validation with strict allowlist regex across 10 clouds (#993 ) Replace fragile blocklist validation and printf '%q' escaping in upload_file() with strict allowlist regex [a-zA-Z0-9/_.~-]+ across all non-SSH cloud providers. For codesandbox, additionally migrate from shell command interpolation to SDK filesystem API via environment variables, eliminating the injection surface entirely. Affected clouds: codesandbox, daytona, e2b, fly, koyeb, modal, northflank, railway, render, sprite Fixes #989 Agent: security-auditor Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 12:20:40 -08:00
A	5d69dc4192	fix: add actionable guidance to error messages across 10 cloud providers (#962 ) Improve error messages in cloud provider lib/common.sh files to include specific troubleshooting steps, dashboard URLs, and environment variable hints instead of bare "Failed" messages. Providers improved: Netcup, IONOS, CloudSigma, Northflank, UpCloud, Fly.io, RamNode, OVH, Civo, Scaleway. Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 10:10:45 -08:00
A	a0d1d1b40b	fix: replace jargon "Remediation" with plain "How to fix" in error messages (#925 ) Replace technical "Remediation steps:" with "How to fix:" and "Remediation: Check <url>" with "Check your dashboard: <url>" across 14 cloud providers for clearer error guidance. Add actionable error messages to Atlantic.Net create_server and SSH key registration failures. Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-13 05:52:31 -08:00
A	b2dd67a0af	refactor: extract helpers to reduce complexity in fly and netcup providers (#912 ) fly/lib/common.sh: - Extract _get_fly_cmd() to eliminate duplicated fly/flyctl CLI resolution across run_server, interactive_session, _try_flyctl_auth, ensure_fly_cli - Extract _fly_parse_error() to deduplicate JSON error parsing (was inline in _validate_fly_token, _fly_create_app, _fly_create_machine) - Extract _fly_build_machine_body() from _fly_create_machine (50→32 lines) - Use shared _extract_json_field in _fly_create_machine and _fly_wait_for_machine_start instead of inline python3 calls netcup/lib/common.sh: - Extract _netcup_is_success() for repeated status=='success' checks (was inline python3 in create_server, destroy_server, _netcup_wait_for_ip) - Extract _netcup_build_login_body() from netcup_get_session (51→30 lines) - Use _extract_json_field throughout instead of inline python3 one-liners - Net reduction: 351→335 lines (-16) Agent: complexity-hunter Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 05:07:53 -08:00
A	6c7ced54dd	fix: replace log_warn with log_step/log_info for non-warning messages (#604 ) Agent: ux-engineer Many shell scripts misused log_warn (yellow) for normal progress/status messages, making routine operations appear alarming. This fixes 59 files: - Progress messages -> log_step (cyan): "Injecting environment variables...", "Attaching volume...", "Powering on instance...", "Retrieving server IP...", "Terminating sandbox/server...", "Creating datacenter...", "Importing SSH key...", "Deleting service/app...", "Modal not authenticated. Running setup..." - Informational notices -> log_info (green): WhatsApp QR code authentication notices (30 nanoclaw scripts), codespace delete hints (14 scripts), "Appending environment variables to ~/.zshrc..." (6 local scripts), credential prompt hints, package update skipped, app reuse notices Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-12 03:24:30 -08:00
A	c8d7ea23e6	refactor: simplify BinaryLane wait loop and fix log_warn in 7 cloud polling loops (#538 ) Replace 25-line custom _binarylane_wait_for_active with 4-line generic_wait_for_instance call, matching the pattern used by 7 other clouds (DigitalOcean, Vultr, Linode, etc). Change log_warn to log_step for status/progress messages in polling loops across 7 cloud providers (aws-lightsail, exoscale, fly, kamatera, latitude, ovh, scaleway). These are normal status updates, not warnings. Agent: complexity-hunter Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Haiku 4.5 <noreply@anthropic.com>	2026-02-11 14:59:51 -08:00
A	0835b35a36	fix: use log_step (cyan) for progress messages instead of log_warn (yellow) (#534 ) ~1500 progress messages across 481 files were using log_warn (yellow) for normal status updates like "Installing...", "Setting up...", "Creating server...", etc. This made users think something was wrong when everything was proceeding normally. Changes: - Replace log_warn with log_step for all progress/status messages - Keep log_warn only for actual warnings (errors, remediation hints) - Remove emoji from 3 sprite completion messages Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com>	2026-02-11 14:37:43 -08:00
A	3d274bf3d2	fix: escape shell commands and sanitize JSON to prevent injection (#463 ) - Add printf %q command escaping to run_server/interactive_session in Koyeb, Render, Railway, and GitHub Codespaces (matching pattern used by E2B, Daytona, Northflank, Fly, and other providers) - Use json_escape in exchange_oauth_code to prevent JSON injection via crafted OAuth codes in shared/common.sh - Use json_escape in Fly.io _fly_create_app to prevent JSON injection via FLY_ORG env var, plus add validation for org slug format - Pass Fly.io _fly_create_machine values via env vars instead of Python string interpolation to prevent code injection Agent: security-auditor Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-11 07:20:41 -08:00
A	f1e8d946df	fix: secure upload_file functions against command injection in 5 clouds (#453 ) Replace unsafe printf '%q'-escaped unquoted variables with validated single-quoted embedding in upload_file() for fly, northflank, daytona, e2b, and koyeb. The previous pattern used unquoted $escaped_content and $escaped_path in command strings passed to bash -c or run_server, which could allow command injection via crafted filenames. The fix: - Validates remote_path rejects unsafe chars (', $, `, newlines) - Uses base64 content directly (alphanumeric + /+= is shell-safe) - Single-quotes both content and path in the command string - Uses printf '%s' instead of echo for safer output Matches the pattern already used by render, modal, and railway. Agent: security-auditor Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-11 06:28:45 -08:00
A	ccd7ff013a	refactor: reduce complexity by extracting shared interactive_pick() and using ensure_api_token_with_provider() (#411 ) - Extract interactive_pick() to shared/common.sh: generic numbered-menu picker that replaces 4 duplicate _pick_location/_pick_server_type/_pick_plan functions across hetzner and hostinger (156 lines -> 71 lines) - Replace ensure_fly_token() (53 lines) with ensure_api_token_with_provider() plus a flyctl CLI auth pre-check (17 lines) - Replace ensure_render_api_key() (38 lines + _save_render_api_key 8 lines) with ensure_api_token_with_provider() (6 lines) Net reduction: 156 lines removed across 5 files. No functionality changes. Agent: complexity-hunter Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Haiku 4.5 <noreply@anthropic.com>	2026-02-11 01:42:22 -08:00
A	4f23276338	refactor: reduce complexity in Fly, Koyeb, and Railway providers (#293 ) - Split _fly_create_and_start_machine (70 lines) into _fly_create_machine and _fly_wait_for_machine_start for single-responsibility - Replace ensure_koyeb_token (38 lines) with ensure_api_token_with_provider - Replace ensure_railway_token (37 lines) with ensure_api_token_with_provider - Remove _save_koyeb_token and _save_railway_token (handled by shared helper) Net reduction: ~80 lines of duplicated code Agent: complexity-hunter Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-10 15:30:32 -08:00
Sprite	cf46b42e3f	fix: Remove double-quoting in json_escape printf callers json_escape() returns a fully-quoted JSON string (e.g. "value") via Python's json.dumps(). Callers using printf templates were wrapping the result in additional quotes ("%s"), producing invalid JSON like ""value"". Remove the redundant quotes from all printf format strings so json_escape's quotes are used directly. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-10 20:04:30 +00:00
A	77db796aff	refactor: Decompose create_server in Scaleway and Fly.io providers (#171 ) Break down the two longest create_server functions (104 and 102 lines) into focused sub-functions for readability and reusability: Scaleway (104 -> 53 lines): - Extract _scaleway_extract_ip() for IP parsing from server response - Extract _scaleway_power_on_and_wait() for power-on + polling loop Fly.io (102 -> 14 lines): - Extract _fly_create_app() for app creation with "already exists" handling - Extract _fly_create_and_start_machine() for machine lifecycle Also fix ((attempt++)) to attempt=$((attempt + 1)) in Fly.io to avoid potential set -e failures when attempt is 0. Agent: complexity-hunter Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-10 07:18:00 -08:00
A	a24dc101e3	fix: Eliminate heredoc injection, eval, and API key exposure (#108 ) - Replace unquoted heredocs with printf + json_escape for all JSON config files containing credentials (8 cloud providers + shared lib) - Replace eval with printf -v for safe indirect variable assignment - Move RunPod API key from URL query param to api-key header Fixes #104, Fixes #105, Fixes #106 Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-09 11:19:34 -08:00
A	27973bfb28	refactor: Reduce complexity across CLI and cloud provider libs (#103 ) * refactor: Extract duplicated prompt flag parsing into extractFlagValue helper The --prompt and --prompt-file argument extraction in main() shared identical patterns for flag detection, value validation, and args splicing. Extracted into a reusable extractFlagValue() function that handles all three concerns. Agent: complexity-hunter * refactor: Consolidate multiple python3 JSON reads into single calls OVH, Kamatera, and UpCloud each spawned separate python3 processes to read different fields from the same JSON config file. Consolidate into a single python3 call per file, printing all fields at once and reading them with bash read. Also fixes OVH using string interpolation for the file path instead of the safer sys.argv[1] pattern. Agent: complexity-hunter * refactor: Extract flyctl auth and token validation from ensure_fly_token Split the 75-line ensure_fly_token into focused helpers: - _try_flyctl_auth: encapsulates flyctl CLI token retrieval - _validate_fly_token: encapsulates API validation with error reporting The main function is now a clear sequential flow of token source attempts. Agent: complexity-hunter * refactor: Deduplicate retry backoff logic in kamatera_api The two error branches (network error and HTTP 429/503) had identical interval update and attempt increment code. Restructure with early return for success, then unified backoff at the end of the loop. Agent: complexity-hunter * refactor: Remove unnecessary async IIFE wrapper in validateAndGetAgent The function wrapped its body in `return (async () => { ... })()` when it can simply be declared as `async function` directly. Agent: complexity-hunter --------- Co-authored-by: A <6723574+louisgv@users.noreply.github.com>	2026-02-09 10:26:03 -08:00
A	b0f924b511	fix: Prevent Python/shell injection via env vars and triple-quote strings (#102 ) - Fix triple-quote injection in SSH keys (Scaleway, UpCloud), userdata (BinaryLane), init scripts (Civo, Kamatera), and GraphQL queries (RunPod) by passing data via stdin/json_escape instead of inline string interpolation - Add input validation for all cloud provider env vars (region, type, plan, etc.) using validate_region_name/validate_resource_name to block shell metacharacters before they reach Python string interpolation - Validate Modal image name as Python identifier to prevent code injection - Validate numeric env vars (RAM, GPU count, disk size) across all providers Affects: 19 cloud provider lib/common.sh files Agent: security-auditor Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-09 10:22:39 -08:00
A	2915d7bca6	fix: Improve CLI error handling, fix bash compat, and update cloud READMEs (#90 ) - Show clear error when --prompt/-p or --prompt-file is used without a value (previously silently ignored) - Fix --prompt-file splice index bug when used after --prompt - Replace echo -e with printf in fly/lib/common.sh for macOS bash 3.x compatibility - Fix incorrect env var name in README (DIGITALOCEAN_TOKEN -> DO_API_TOKEN) - Add missing agent entries (gptme, OpenCode, Plandex) to 11 cloud READMEs - Add all 13 agents to Civo README (previously only had 3) Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-09 09:33:57 -08:00
A	bbbe815035	refactor: Security fixes, complexity reduction, and UX improvements (#58 ) Security: - Fix command injection in modal/lib/common.sh (run_server, upload_file, interactive_session) - Fix command injection in fly/lib/common.sh (run_server, upload_file, interactive_session) - All container providers now use printf '%q' for proper shell escaping Complexity: - Extract _api_should_retry_on_error() helper in shared/common.sh (-19 lines) - Refactor scaleway_api and upcloud_api to use shared retry helper (-24 lines) - Extract _save_fly_token() helper in fly/lib/common.sh (-11 lines) - Extract validateAndGetAgent() in commands.ts, reducing cmdRun/cmdAgentInfo duplication - Refactor cmdList column width calculation to use calculateColumnWidth() UX: - Add actionable next steps to error messages in shared/common.sh - Improve CLI bash fallback error messages with guidance (spawn.sh) - Add OAuth progress indicator during browser authentication wait - Show invalid model ID value and link to openrouter.ai/models - Add troubleshooting steps for agent installation failures Tests: - Update test assertions in test/run.sh to match refactored patterns - All tests passing: 74 TypeScript + 75 bash = 149 total, 0 failures Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-08 17:09:27 -08:00
Sprite	8f37ce3649	refactor: Automated improvements from cycle 1 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-08 06:02:07 +00:00
Sprite	6fdfe1b014	refactor: Extract ENV_TEMP pattern to provider-specific inject functions Completed ENV_TEMP pattern extraction across remaining providers: 1. Modal: gptme.sh (1 script) - uses inject_env_vars_local 2. GCP: all 10 agent scripts - uses inject_env_vars_ssh 3. Fly.io: all 11 agent scripts - uses new inject_env_vars_fly - Added inject_env_vars_fly() to fly/lib/common.sh - Handles both .bashrc and .zshrc (Fly-specific requirement) 4. Sprite: amazonq, cline, gemini (3 scripts) - uses inject_env_vars_sprite Total scripts converted in this commit: 25 Total scripts converted in Round 25 Task #1: 78 scripts Each conversion replaces 11-15 lines of temp file management with a single function call that handles creation, permissions, content generation, upload, sourcing, and cleanup. The only remaining ENV_TEMP patterns are DOTENV_TEMP in nanoclaw scripts, which are agent-specific .env files and should remain as-is. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-08 04:15:02 +00:00
L	b6ee6b6ab1	Add guardrails: CLAUDE.md rules, hooks, pre-commit validation (#33 ) * feat: add gptme agent to spawn matrix Add gptme (https://github.com/gptme/gptme) - a personal AI agent in the terminal with tools for code editing, terminal commands, web browsing, and more. Natively supports OpenRouter via OPENROUTER_API_KEY. - Add gptme agent entry to manifest.json with OpenRouter env vars - Implement sprite/gptme.sh deployment script - Implement hetzner/gptme.sh deployment script - Add "missing" matrix entries for remaining 8 clouds - Update README.md with usage instructions for Sprite and Hetzner Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: add Fly.io cloud provider with claude and aider agents Add Fly.io as a new cloud provider using the Machines REST API for provisioning and flyctl CLI for SSH access. Docker-based machines with pay-per-second pricing. - Create fly/lib/common.sh with Fly.io Machines API integration - Implement fly/claude.sh for Claude Code deployment - Implement fly/aider.sh for Aider deployment - Update README.md with Fly.io usage instructions and env vars Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: add gemini, amazonq, cline, gptme to Fly.io Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: add openclaw, nanoclaw, goose, codex, interpreter to Fly.io Implements 5 new agent scripts for the Fly.io cloud provider: - fly/openclaw.sh: OpenClaw with gateway + TUI, model selection, config - fly/nanoclaw.sh: NanoClaw WhatsApp agent with .env configuration - fly/goose.sh: Block's Goose agent with OpenRouter provider - fly/codex.sh: OpenAI Codex CLI with OpenRouter base URL override - fly/interpreter.sh: Open Interpreter with OpenRouter base URL override All scripts follow the Fly.io pattern (flyctl-based, no IP args for run_server/interactive_session) and use upload_file for env injection. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: add gptme agent to 8 remaining clouds Implement gptme agent scripts for digitalocean, vultr, linode, lambda, aws-lightsail, gcp, e2b, and modal. Each script follows the exact pattern of that cloud's existing aider.sh, adapted for gptme's install and launch commands. Updates manifest.json matrix entries from "missing" to "implemented". Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Add guardrails from insights: CLAUDE.md rules, hooks, pre-commit Based on usage insights analysis: CLAUDE.md: - Shell script rules: curl\|bash compat, macOS bash 3.x compat - Autonomous loop rules: test after each iteration, never revert fixes - Git workflow rules: always use feature branches .claude/settings.json: - PostToolUse hook validates .sh files on every Write/Edit: syntax check, no relative source, no echo -e, no set -u .githooks/pre-commit: - Blocks commits with: syntax errors, relative sources, echo -e, set -euo, references to deleted functions - Install: git config core.hooksPath .githooks README.md: - Added developer setup section with hook installation Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Sprite <noreply@sprite.dev> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-07 20:02:19 -08:00

42 commits