spawn

vrr/spawn

mirror of https://github.com/OpenRouterTeam/spawn.git synced 2026-05-08 18:39:50 +00:00

Author	SHA1	Message	Date
A	e163b38f09	test: add 44 tests for credential display functions in dry-run path (#961 ) Adds unit tests for buildCredentialStatusLines, formatAuthVarLine, and the credential section allSet detection in showDryRunPreview. These functions had zero direct test coverage despite being in the critical dry-run preview path. Tests cover: - formatAuthVarLine: env var set/missing display, URL hints, indentation - buildCredentialStatusLines: OPENROUTER_API_KEY always present, single and multi-var auth, URL hint placement, partial credentials, no-auth clouds, all-set scenarios - Dry-run allSet detection: all creds set, partial, multi-var, none auth - credentialHints allSet branch: the "appear to be set" path when all env vars are present but the error may be invalid/expired credentials - credentialHints partial credentials: mixed set/missing env var states Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 09:51:49 -08:00
A	221ed67f2e	test: add 27 tests for clearHistory and cmdListClear (#955 ) Add comprehensive test coverage for the previously untested clearHistory (history.ts) and cmdListClear (commands.ts) functions invoked via `spawn list --clear`. Tests cover: - Basic clearing: return value, file deletion, directory preservation - Edge cases: corrupted JSON, non-array values, empty files, null - Interaction: save-after-clear, filterHistory-after-clear, idempotency - cmdListClear: log.info vs log.success output, singular/plural grammar, file deletion, corrupted file handling, large history counts Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Haiku 4.5 <noreply@anthropic.com>	2026-02-13 08:48:39 -08:00
A	68122a6f70	fix: remove duplicate ensure_jq and fix help text alias placement (#954 ) - Remove duplicate ensure_jq() function in shared/common.sh (lines 2341-2372) that was accidentally left after extracting it to the shared lib in #946 - Move "Aliases: ls, history" onto the "spawn list" help line so it no longer appears to describe "spawn list --clear" Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 08:38:17 -08:00
A	0897f64f61	refactor: decompose Atlantic.Net and HOSTKEY create_server into focused helpers (#952 ) - Atlantic.Net create_server (59 lines -> 30 lines): - Extract _atlanticnet_extract_error for API error message parsing - Extract _atlanticnet_check_create_error for error checking + diagnostics - Extract _atlanticnet_parse_instance_response for response parsing - Replace inline python3 with shared _extract_json_field helper - Reuse _atlanticnet_extract_error in atlanticnet_register_ssh_key - HOSTKEY create_server (52 lines -> 24 lines): - Extract _hostkey_build_order_body for JSON body construction - Extract _hostkey_check_create_error for error checking + diagnostics - Extract _hostkey_parse_instance_response for response parsing - Update atlanticnet-provider.test.ts to check extracted helpers Agent: complexity-hunter Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 08:35:24 -08:00
A	ea39c8bf28	fix: prevent command injection in update-check reExecWithArgs (#951 ) Replace execSync with execFileSync in reExecWithArgs() to prevent shell metacharacter injection via binary path. execFileSync bypasses the shell entirely, executing the binary directly with an argv array. The performAutoUpdate() call retains execSync since it legitimately needs a shell for piping (curl \| bash). Fixes #950 Agent: security-auditor Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 08:34:04 -08:00
A	904bebfb70	test: add 68 tests for run retry logic and display formatting helpers (#945 ) Cover previously untested internal functions: - formatCacheAge (index.ts): cache age to human-readable string conversion - handleUserInterrupt (commands.ts): Ctrl+C detection in error messages - runWithRetries (commands.ts): SSH failure retry logic with MAX_RETRIES - printInfoHeader (commands.ts): agent/cloud info page header formatting - printGroupedList (commands.ts): grouped display with type labels - renderListTable (commands.ts): spawn history table output formatting Includes boundary transition tests, edge cases, and integration scenarios. Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 07:24:49 -08:00
A	2e7b083f8f	fix: show cloud URL for missing credentials in dry-run and add spawn list --clear (#944 ) Two UX improvements: 1. Dry-run credential status now shows the cloud provider's URL next to missing cloud-specific auth vars (e.g., HCLOUD_TOKEN), helping users find where to create their credentials. Previously only OPENROUTER_API_KEY showed a URL hint. 2. Added `spawn list --clear` command to let users clear their spawn history. Previously there was no way to reset the 100-entry history file without manually deleting ~/.spawn/history.json. Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 07:16:14 -08:00
A	f806085bb6	fix: improve UX of error messages and status output (#938 ) - Remove redundant "Warning:" prefix from API key format message (log_warn already conveys warning status) - Fix incorrect `export VAR=token spawn ...` syntax in auth failure hint (export makes it persistent, inline env var syntax is correct) - Replace attempt/retry jargon with elapsed time in SSH wait and instance polling messages (users care about time, not internal retry counts) - Show instance IP in friendlier "ready (IP: x.x.x.x)" format - Move HTTP status codes from error title to body in download failures (cleaner error headline, details still available) - Simplify dry-run credential warning (remove confusing double-negative "without --dry-run") - Remove redundant "Warning:" prefix from extra arguments message Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com>	2026-02-13 06:45:04 -08:00
A	f696f57129	test: add 44 tests for extract_api_error_message and generic_wait_for_instance (#798 ) These two critical shared/common.sh functions had zero test coverage despite being used across 4+ and 9 cloud providers respectively (10+ call sites each). extract_api_error_message tests cover: - All JSON error field patterns (message, error, error.message, error_message, reason) - Field priority ordering - Fallback behavior for invalid JSON, empty input, unrecognized fields - Real-world API response formats (Hetzner, DigitalOcean, Vultr, Contabo) - Edge cases (special characters, unicode, arrays, null) generic_wait_for_instance tests cover: - Successful polling (first attempt and multi-attempt) - IP extraction from flat and deeply nested JSON - Timeout behavior when status never reaches target - Continued polling when API returns errors or invalid JSON - Polling when status matches but IP is empty - Logging output (progress, success, timeout) Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Haiku 4.5 <noreply@anthropic.com>	2026-02-13 06:25:25 -08:00
A	c0cb32f9ce	test: add 97 tests for list command output helpers (#846 ) * test: add 97 tests for list command output helpers Cover buildRetryCommand (prompt truncation at 80 chars, quote escaping, prompt-file fallback), resolveDisplayName (null manifest fallback), buildRecordLabel/buildRecordHint (30-char hint truncation, picker formatting), parseAuthEnvVars (multi-var parsing, validation), hasCloudCredentials (multi-var auth, empty/unset vars), getImplementedClouds/getImplementedAgents (manifest filtering), isRetryableExitCode (SSH 255 detection), formatTimestamp (edge cases), and getStatusDescription (404 special case). Agent: test-engineer Co-Authored-By: Claude Haiku 4.5 <noreply@anthropic.com> * fix: import actual functions instead of duplicating them in tests - Export formatTimestamp, buildRecordLabel, buildRecordHint from commands.ts - Replace 11 duplicated function implementations with imports from commands.ts - Add @clack/prompts mock (required when importing commands.ts) - All 97 tests still pass against the real production code Agent: pr-maintainer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * fix: resolve rebase conflicts and update tests for formatRelativeTime Merged formatRelativeTime from main, exported formatTimestamp and buildRecordHint, updated tests to use relative time assertions. Agent: pr-maintainer Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Haiku 4.5 <noreply@anthropic.com>	2026-02-13 06:22:18 -08:00
A	c833f4ed3e	fix: improve UX with macOS compat fix, clearer messages, and less alarming prompts (#934 ) - Fix macOS compatibility bug in Atlantic.Net API signature: `base64 -w 0` fails on macOS (no `-w` flag); add fallback like other providers - Replace misleading "Use 'csb' CLI dashboard" in CodeSandbox interactive session with link to the actual web terminal at codesandbox.io/dashboard - Soften preflight credential check prompt from "will likely fail" to "will prompt you to authenticate" (scripts have built-in auth flows) - Bump CLI version to 0.2.72 Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-13 06:10:47 -08:00
A	75d7be29a4	fix: improve UX for stale manifest cache, list rerun hints, and version info (#805 ) - Show warning when manifest is loaded from stale cache (offline fallback) so users know the data may be outdated - Fix list footer rerun command: reuse buildRetryCommand instead of truncating prompts with "..." which produced broken copy-paste commands - Show manifest cache age in "spawn version" output for troubleshooting - Bump CLI version to 0.2.67 Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 06:09:56 -08:00
A	4c1a344a7a	test: add 59 tests for JSON extraction helpers in shared/common.sh (#804 ) Cover _extract_json_field and extract_api_error_message functions that were recently extracted (PRs #673, #767) but had zero test coverage. These are critical infrastructure used by Hetzner, DigitalOcean, Vultr, and Contabo for API error parsing and by generic_wait_for_instance for status polling. Tests cover: - _extract_json_field: basic extraction, nested fields, default values, complex Python expressions, real-world cloud provider patterns, edge cases - extract_api_error_message: all standard error field patterns (message, error, error.message, error.error_message, reason), field priority order, fallback behavior, real-world cloud provider error formats, edge cases Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 06:07:50 -08:00
A	099d8020cc	test: add 343 cloud lib source chain verification tests (#935 ) Verify that every cloud provider's lib/common.sh correctly sources shared/common.sh and exposes required shared functions. Tests run each cloud's lib in a real bash subprocess to catch source chain breaks, syntax errors, and missing function definitions. Coverage: - Source chain integrity for all 36 cloud lib files - Required shared function availability (logging, OAuth, API, SSH) - json_escape behavior (quotes, newlines, backslashes, tabs) - validate_api_token and validate_server_name security - calculate_retry_backoff bounds - extract_api_error_message parsing - Cross-cloud consistency (SSH_OPTS, API helpers) - bash -n syntax check on all lib files Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 06:02:06 -08:00
A	5948de15b8	fix: show 'ready to go' in quick start when all credentials are set (#866 ) When all required credentials (OPENROUTER_API_KEY + cloud auth vars) are already configured, the Quick start section in `spawn <agent>` and `spawn <cloud>` now shows a concise "credentials detected -- ready to go" message with just the launch command, instead of showing export instructions the user doesn't need. Previously, the `hasCreds` variable was computed but unused in both `printCloudQuickStart` and `cmdAgentInfo`. This change puts it to use to give users a clear signal when they're ready to launch. Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 05:58:22 -08:00
A	6589fd1f2f	refactor: extract helpers from performAutoUpdate in update-check.ts (#881 ) Break down the 70-line performAutoUpdate function (depth-4 nesting, mixed concerns) into focused helpers: - shellQuote: reusable shell-quoting utility - printUpdateBanner: boxed update notification formatting - reExecWithArgs: binary re-exec with exit code forwarding - performAutoUpdate: clean 22-line orchestrator Agent: complexity-hunter Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 05:53:33 -08:00
A	ebc5a6cc2f	test: add 84 tests for interactive input validation helpers in shared/common.sh (#880 ) Cover get_resource_name, get_validated_server_name, get_model_id_interactive, interactive_pick, _display_and_select, and show_server_name_requirements -- all previously untested functions used by every agent/cloud script. Tests exercise env-var bypass paths (critical for CI/non-interactive use), validation rejection of injection attempts, boundary conditions, and menu rendering output. Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 05:47:30 -08:00
A	6c762494e2	test: add Atlantic.Net and CodeSandbox provider pattern tests (268 tests) (#928 ) Validates provider-specific patterns for the two most recently added clouds: - Atlantic.Net: HMAC-SHA256 signing, query-param API, SSH delegation, dual-credential auth - CodeSandbox: Node.js SDK exec, sandbox ID validation, env-var-based injection security - Cross-provider contrast tests verifying SSH vs SDK architecture divergence - Manifest consistency checks for both providers Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Haiku 4.5 <noreply@anthropic.com>	2026-02-13 05:47:01 -08:00
A	00f8913f20	fix: show credential readiness in `spawn clouds` and relative timestamps in `spawn list` (#910 ) Two UX improvements: 1. `spawn clouds` now shows a green "ready" indicator next to clouds where credentials are already configured in the environment, making it immediately clear which providers the user can use without additional setup. 2. `spawn list` now shows relative timestamps ("5 min ago", "yesterday", "3d ago") instead of absolute dates, giving immediate temporal context. Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-13 05:20:04 -08:00
A	795a502efb	test: add comprehensive Atlantic.Net provider tests (165 tests) (#899 ) Adds test coverage for the Atlantic.Net cloud provider (added in PR #883), which had zero test coverage. Tests validate: - lib/common.sh structure, API surface, and shell conventions - HMAC-SHA256 signature auth flow correctness - Security patterns (credential storage, URL encoding, config permissions) - Credential management flow (env -> config -> prompt chain) - SSH delegation pattern to shared helpers - Server lifecycle functions (create, destroy, response parsing) - Default parameter helpers and manifest consistency - All 3 implemented agent scripts (claude, aider, openclaw) - Agent-specific setup patterns and error handling - API wrapper parameter handling - README documentation Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Haiku 4.5 <noreply@anthropic.com>	2026-02-13 05:11:31 -08:00
A	74b9535457	test: add 85 tests for run-path credential display and validation functions (#918 ) Tests prioritizeCloudsByCredentials (zero prior coverage), credential status display logic, entity validation, key resolution, retry command building, retryable exit code detection, and failure guidance for the critical spawn <agent> <cloud> run path. Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Haiku 4.5 <noreply@anthropic.com>	2026-02-13 05:11:24 -08:00
A	2b9a812433	test: add CodeSandbox cloud provider pattern tests (202 tests) (#922 ) Comprehensive test coverage for the CodeSandbox provider (merged in #857) which previously had zero dedicated tests. Validates: - Manifest integration (type, auth, exec_method, matrix entries) - lib/common.sh API surface (13 required functions, no SSH leakage) - SDK security: all 5 SDK functions pass user data via env vars - Sandbox ID validation (regex, error handling, called by consumers) - upload_file() security (path injection protection, base64 encoding) - Authentication flow (ensure_api_token_with_provider delegation) - create_server/destroy_server/list_servers SDK patterns - Agent scripts follow standard provisioning flow (3 scripts) - macOS bash 3.x compatibility (no echo -e, source <(), set -u) - Node.js SDK code quality (try/catch, process.exit, process.env) - No dangerous patterns (no eval, no unquoted expansions, no injection) Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com>	2026-02-13 05:11:17 -08:00
A	7731306f37	test: add local cloud provider pattern tests (239 tests) (#911 ) Adds comprehensive test coverage for the local cloud provider, which runs agents directly on the user's machine without cloud provisioning. Previously had zero dedicated tests despite 14 implemented agent scripts. Tests cover: - local/lib/common.sh API surface (no-op destroy, bash -c exec, cp uploads) - All 14 local agent scripts follow local-specific patterns - No SSH/SCP patterns leak into local scripts - OpenRouter API key handling with OAuth fallback - SPAWN_PROMPT handling for interactive/non-interactive modes - Installation verification (command -v checks) - Safety checks (no sudo, no rm -rf system dirs) - Manifest consistency for local cloud entries Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 04:02:45 -08:00
A	7a441813fd	fix: detect slash notation and suggest correct syntax (#859 ) When users type `spawn claude/hetzner` or `spawn hetzner/claude`, the CLI now splits on the slash and forwards to the correct handler with a helpful tip, instead of showing a confusing "invalid characters" error from identifier validation. Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 02:15:59 -08:00
A	fa5b4979e8	fix: upgrade SSH to StrictHostKeyChecking=accept-new (TOFU) and randomize temp paths (#849 ) - Change SSH default from StrictHostKeyChecking=no to accept-new, which accepts host keys on first connection but rejects if they change later (Trust On First Use). This protects against MITM attacks on subsequent connections. Requires OpenSSH 7.6+ (released Oct 2017). - Replace predictable $$-based temp file path in upload_config_file with $RANDOM to prevent symlink attacks on the remote server. Addresses findings from issue #763. Agent: security-auditor Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 02:11:47 -08:00
A	bfb125c028	test: add cloud lib API surface tests (#852 ) Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Haiku 4.5 <noreply@anthropic.com>	2026-02-13 02:09:56 -08:00
A	ebdab346df	fix: warn about missing credentials before running spawn scripts (#851 ) Previously, users would run `spawn claude hetzner` without HCLOUD_TOKEN set, the CLI would download and start executing the script, and it would fail mid-execution after potentially provisioning resources. Now the CLI checks for missing credentials before running and warns the user upfront. In interactive mode, shows a confirmation prompt so the user can abort or continue. In non-interactive mode, shows a warning without blocking. - Add preflightCredentialCheck() that inspects cloud auth env vars - Call it in cmdRun before script execution - 9 tests covering all credential states (all set, partial, missing, multi-var, CLI-based auth, none auth) - Version bump to 0.2.69 Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 01:52:41 -08:00
A	7b5f84141f	fix: show specific missing credentials in script failure messages (#813 ) When a spawn script fails, the error message now checks which required environment variables are actually set vs missing, instead of generically saying "Missing or invalid credentials". This helps users immediately see which credential they need to add. - All set: "Credentials appear to be set (invalid or expired?)" - Some missing: lists only the specific vars that are not set - None set: lists all required vars Version bump to 0.2.67. Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 01:45:01 -08:00
A	d9a18b49d3	fix: show credential-aware quick start in spawn <agent> and spawn <cloud> info (#817 ) Prioritize clouds with detected credentials in spawn <agent> info pages. Skip showing export instructions for env vars already set. Show credential status in spawn <cloud> info header and available clouds list. Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 01:33:19 -08:00
A	2f671d8edf	test: add 66 tests for OAuth security functions in shared/common.sh (#814 ) Cover previously untested security-critical OAuth functions: - _generate_oauth_html: HTML generation for success/error pages - _validate_oauth_server_args: port validation + CSRF state file - _generate_oauth_server_script: Node.js server script generation - cleanup_oauth_session: temp resource cleanup - exchange_oauth_code: JSON injection prevention via json_escape - execute_agent_non_interactive: prompt escaping with printf %q - wait_for_oauth_code: timeout behavior - _check_oauth_prerequisites: connectivity + runtime detection - find_node_runtime: bun/node discovery Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 01:24:33 -08:00
A	1725fa79d4	test: add cloud lib security convention regression tests (69 tests) (#816 ) Validates that all cloud provider lib/common.sh files follow security conventions from the security audit. Tests cover SSH key encoding (json_escape or python json.dumps), config file permissions, Python code injection prevention, API body JSON safety, heredoc injection prevention, shared/common.sh sourcing, and credential handling patterns. Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 01:23:20 -08:00
A	8446e785cf	test: add 88 tests for OAuth flow functions in shared/common.sh (#843 ) The OAuth flow is the primary authentication mechanism for spawn users, yet its component functions had zero test coverage. This adds tests for: - validate_oauth_port: port range validation (boundary values, injection) - _generate_csrf_state: CSRF token generation (entropy, uniqueness) - _generate_oauth_html: success/error HTML page generation - _generate_oauth_server_script: Node.js callback server (CSRF, ports) - _validate_oauth_server_args: prerequisite validation (port, state, runtime) - _init_oauth_session: temp directory and CSRF state file creation - cleanup_oauth_session: PID and directory cleanup - exchange_oauth_code: OAuth code-to-key exchange with json_escape security - check_openrouter_connectivity: network reachability fallback chain - Integration: session lifecycle and CSRF security properties Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 01:22:11 -08:00
A	6182348641	fix: show credential status in dry-run and specify missing env vars on failure (#841 ) Two UX improvements: 1. `spawn <agent> <cloud> --dry-run` now shows a Credentials section that checks which env vars (OPENROUTER_API_KEY, cloud auth vars) are set vs missing, so users can verify readiness before a real run. 2. Script failure guidance (exit code 1 and default) now checks which specific env vars are unset instead of showing a generic "need X + Y" message, making it immediately clear what's missing. Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 01:20:21 -08:00
A	b1a576a52a	test: add 51 tests for _classify_api_result and _report_api_failure (#834 ) These helpers were extracted from _cloud_api_retry_loop in PR #821 to reduce cyclomatic complexity but had zero test coverage. They are invoked on every cloud API call across all providers: - _classify_api_result: Classifies curl/HTTP results into retry reasons (network error, rate limit 429, service unavailable 503) or empty (success/non-retryable error). Tests cover all branches including curl exit codes 1/6/7/28, HTTP 429/503, success codes 200/201/204, non-retryable errors 400-502, and edge cases. - _report_api_failure: Generates user-facing error messages after retries are exhausted. Differentiates network vs HTTP errors, outputs API response body only for HTTP errors. Tests cover retry count display, response body handling, and special chars. Also includes integration tests verifying the classify-then-report pipeline and realistic cloud provider scenarios (Hetzner, DigitalOcean, DNS failures, auth errors, validation errors). Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Haiku 4.5 <noreply@anthropic.com>	2026-02-13 01:19:31 -08:00
A	087a14c276	test: add agent env injection contract tests (128 tests) (#838 ) Validates the critical contract that every implemented agent script correctly injects the environment variables from manifest.json. Catches silent breakage where an agent starts but cannot reach the LLM API due to missing OPENROUTER_API_KEY or provider-specific vars. Tests cover: - OPENROUTER_API_KEY presence in all scripts - Provider-specific env vars (ANTHROPIC_BASE_URL, OPENAI_BASE_URL, etc.) - OpenRouter API key acquisition patterns (env check, OAuth, manual) - Agent install and launch command references - Cloud lib env injection infrastructure - Base URL values pointing to openrouter.ai - No hardcoded API keys (security) - Full coverage statistics across all agents and clouds Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 01:19:14 -08:00
A	813089def7	test: add 67 tests for shared/github-auth.sh (zero prior coverage) (#832 ) Add comprehensive test coverage for the standalone GitHub auth helper (shared/github-auth.sh) merged in PR #824 with no tests. Coverage includes: - Source pattern and function availability (9 tests) - Fallback log functions when common.sh unavailable (3 tests) - ensure_gh_cli: detection, installation paths, error handling (7 tests) - _install_gh_binary: OS/arch detection, error paths, cleanup (11 tests) - ensure_gh_auth: token auth, interactive login, post-login checks (8 tests) - ensure_github_auth: combined wrapper success/failure (4 tests) - Direct execution mode and set -eo pipefail (2 tests) - Script conventions: bash 3.x compat, no echo -e, safe var access (10 tests) - Installation path coverage: macOS/Linux/APT/DNF/Homebrew (4 tests) - Error handling edge cases: curl failure, tar failure, auth failures (6 tests) - GITHUB_TOKEN security: piped via printf, not CLI arg (2 tests) - Shebang check (1 test) Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 01:17:57 -08:00
A	9f76af00d2	fix: show credential status in quick-start sections (#823 ) The quick-start sections in `spawn <cloud>` and `spawn <agent>` now show whether required env vars are already set (green with "set" indicator) or still need to be configured (cyan "export" instruction). This helps users immediately see what credentials are missing before launching. Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-12 23:59:57 -08:00
A	4d3c54a11e	refactor: extract helpers from execScript and _cloud_api_retry_loop (#821 ) Reduce cyclomatic complexity in the two highest-scoring functions: - cli/src/commands.ts: Extract `handleUserInterrupt` and `runWithRetries` from `execScript` (complexity score 6 -> 2 for execScript, retry logic now independently testable) - shared/common.sh: Extract `_classify_api_result` and `_report_api_failure` from `_cloud_api_retry_loop` (complexity score 9 -> 4, removes duplicated error-classification logic from loop body) Agent: complexity-hunter Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-12 23:57:20 -08:00
A	e73d6b9793	fix: support --flag=value syntax in CLI argument parsing (#826 ) Previously, `spawn --prompt="Fix bugs" claude sprite` or `spawn list --agent=claude` would fail with "Unknown flag" because the CLI only recognized `--flag value` (space-separated) syntax. Now `--flag=value` is expanded to `--flag value` early in the arg parsing pipeline, supporting the common GNU-style convention. Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com>	2026-02-12 23:55:46 -08:00
A	716da5d43b	fix: auto re-exec command after CLI auto-update (fixes #780 ) (#830 ) When a CLI auto-update triggers mid-command (e.g. `spawn claude sprite`), the updated binary now automatically re-runs with the original arguments instead of asking the user to manually re-run. Sets SPAWN_NO_UPDATE_CHECK=1 on re-exec to prevent infinite update loops. Falls back to the old "run again" message when no arguments were provided (bare `spawn`). Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-12 23:54:49 -08:00
A	317d931e87	test: add 32 tests for extract_api_error_message in shared/common.sh (#820 ) This function parses JSON error responses from cloud provider APIs (used by Hetzner, DigitalOcean, Vultr, and Contabo) and had zero test coverage. Tests cover: field priority order, fallback behavior, realistic cloud provider responses, and edge cases (non-object JSON, null/empty fields). Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-12 23:52:27 -08:00
A	fbea9303f0	test: add 48 tests for SSH key lifecycle functions (#828 ) Cover ensure_ssh_key_with_provider (zero prior coverage), plus edge cases for generate_ssh_key_if_missing, get_ssh_fingerprint, extract_ssh_key_ids, and check_ssh_key_by_fingerprint. Tests validate the callback-based SSH key registration flow used by all cloud providers. Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-12 23:52:22 -08:00
A	5169350feb	fix: use buildRetryCommand in spawn list footer to avoid truncated prompts (#819 ) The "Rerun last" hint in `spawn list` was truncating prompts at 30 characters and appending "...", producing broken copy-paste commands. Now delegates to the existing buildRetryCommand helper which properly handles long prompts by suggesting --prompt-file instead of truncating. Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-12 23:52:08 -08:00
A	3f28d5f29f	test: add 52 tests for SSH helpers and instance polling in shared/common.sh (#822 ) Cover critical infrastructure functions that had zero dedicated test coverage: - ssh_run_server, ssh_upload_file, ssh_interactive_session (SSH command construction) - ssh_verify_connectivity (ConnectTimeout, max_attempts, test command) - generic_ssh_wait (exponential backoff, success/failure, elapsed time logging) - wait_for_cloud_init (argument delegation, cloud-init file check) - generic_wait_for_instance (API polling, status matching, IP export, timeout) - extract_api_error_message (all 5 error field patterns + fallbacks) - SSH_USER default behavior (root fallback across all helpers) Uses mock SSH/SCP/sleep commands via PATH override to test argument construction and behavior without requiring network connectivity. Agent: test-engineer -- refactor/test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-12 23:51:46 -08:00
L	6633873ccc	refactor: replace Python with jq in Hetzner lib, fix /lab → /labs URLs (#827 ) Hetzner lib: replace all Python JSON parsing with jq. Uses the /datacenters API as the authoritative source for server type availability (server_types.available), cross-referenced with /server_types for specs and pricing. jq is auto-installed if missing. URLs: update openrouter.ai/lab/spawn → openrouter.ai/labs/spawn across all READMEs and CLI source. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-12 23:14:11 -08:00
A	0fe83fe311	fix: improve CLI error messages for retry commands and unknown names (#777 ) - buildRetryCommand: suggest --prompt-file for long prompts instead of truncating into a non-functional command (threshold raised to 80 chars) - showUnknownCommandError: change "Unknown command" to "Unknown agent or cloud" since users are passing agent/cloud names, not commands - Bump CLI version to 0.2.66 Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-12 17:19:46 -08:00
A	ff0ccfdbd0	refactor: reduce complexity in ramnode picker and cmdInteractive (#756 ) - Replace RamNode's custom _pick_flavor (37 lines) with shared interactive_pick helper (1 line), eliminating duplicated picker logic - Extract credential sorting from cmdInteractive into reusable prioritizeCloudsByCredentials helper for testability and clarity Agent: complexity-hunter Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-12 16:47:43 -08:00
A	f4b3d99cff	test: add 73 tests for logging, temp-file, cloud-init, and SSH key helpers (#765 ) Add comprehensive test coverage for previously untested utility functions in shared/common.sh that are used pervasively across all cloud providers: - log_step: cyan progress messages (added PR #757) - _log_diagnostic: structured error output (header + causes + numbered fixes) - check_python_available: Python 3 dependency detection with install hints - find_node_runtime: bun/node runtime discovery - track_temp_file + cleanup_temp_files: secure credential temp file cleanup - register_cleanup_trap: EXIT/INT/TERM signal handlers - get_cloud_init_userdata: cloud-init YAML generation for provisioning - calculate_retry_backoff: jittered exponential backoff - generate_ssh_key_if_missing: ed25519 key generation with directory creation - get_ssh_fingerprint: MD5 fingerprint extraction - opencode_install_cmd: opencode install script content - POLL_INTERVAL / SSH_OPTS: configurable constants and defaults - All 4 log functions: stderr-only output verification Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-12 16:46:13 -08:00
A	a290815108	test: add 111 tests for trigger-server security and validation logic (#774 ) Add comprehensive test coverage for the trigger-server HTTP service (.claude/skills/setup-agent-team/trigger-server.ts), which had zero test coverage despite recent security-critical changes (PRs #745, #747). Tests cover: - Timing-safe Bearer token auth (17 tests including injection attempts) - VALID_REASONS allowlist enforcement (13 tests including injection) - Issue parameter validation regex (17 tests including shell injection) - Issue dedup logic (8 tests) - Capacity checking (6 tests) - reapAndEnforce process cleanup (9 tests including boundary cases) - Health response structure (4 tests) - Streaming response metadata (4 tests) - Environment variable parsing (5 tests) - Route matching logic (10 tests) - Full validation flow with priority ordering (8 tests) Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Haiku 4.5 <noreply@anthropic.com>	2026-02-12 16:46:10 -08:00
A	cdf6f1dba5	fix: use log_step (cyan) for in-progress messages instead of log_info (green) (#768 ) In-progress actions (installing, starting, connecting...) should use log_step (cyan) to visually distinguish them from completion messages which use log_info (green). This makes it easier for users to see at a glance what is happening vs what has finished. Changes: - cli/install.sh: add log_step function, use it for install progress - shared/common.sh: OAuth flow and non-interactive exec messages - Cloud libs: interactive_session, auth, and cleanup messages - Agent scripts: gateway startup and session opening messages Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-12 16:45:58 -08:00

1 2 3 4 5 ...

282 commits