spawn

vrr/spawn

mirror of https://github.com/OpenRouterTeam/spawn.git synced 2026-05-08 10:09:30 +00:00

Author	SHA1	Message	Date
A	90417c2e1b	test: fix agent-config-setup.test.ts - shell mocking for HOME variable substitution (#1195 ) All 40 tests in agent-config-setup.test.ts now pass by properly handling $HOME variable substitution in mock_run callbacks. Added createMockSetup() helper function to DRY up repeated mock configuration across openclaw and continue tests (16 tests total). Changes: - Fix mock_run() to replace $HOME before evaluating commands - Add createMockSetup(tempDir, configDir) helper to reduce code duplication - Update all setup_openclaw_config and setup_continue_config tests to use helper - Ensures /tmp/spawn_config_* temp files are redirected to temp test directory Agent: test-engineer Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-15 05:16:11 -05:00
A	3db288c3dd	feat: trim to 9 curated launch clouds, upvote-driven discovery (#1184 ) Reduce from 41 cloud providers to 10 (9 + local) curated for launch: - local (free), oracle (free tier), hetzner (~€3.29/mo), ovh (~€3.50/mo), fly (free tier), aws-lightsail ($3.50/mo), daytona (pay-per-second), digitalocean ($4/mo), gcp ($7.11/mo), sprite (Fly.io VMs) Changes: - Remove 30 cloud directories, test fixtures, and provider-specific tests - Slim manifest.json from 600 to 150 matrix entries, sorted by price - Update CLAUDE.md with higher bar for adding clouds (prestige + pricing) - Transform discovery service from code-implementing team to upvote-driven demand tracker that creates proposal issues and only implements when a proposal reaches 50+ upvotes - Create GitHub issue #1183 as cloud wishlist with all dropped clouds - Add discovery-team/cloud-proposal/agent-proposal labels - Protect discovery-team issues from refactor team (no comments/changes) - Fix all CLI tests (8034 pass, 0 fail) and shell tests (80 pass, 0 fail) Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-15 00:19:39 -08:00
A	49c8c4f60b	feat: add VM reconnect functionality to spawn list (#1175 ) * feat: add VM reconnect functionality to spawn list (#1144) Implements ability to reconnect to previously spawned VMs instead of always creating new instances. Changes include: - Add VMConnection interface to track IP, user, and server metadata - Add save_vm_connection() bash function for scripts to persist connection info - Modify spawn list to show connection status and offer reconnect option - Support both SSH (cloud providers) and sprite console reconnection - Update digitalocean/claude.sh and sprite/claude.sh as reference implementations Fixes #1144 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * improve: add helpful error message when VM reconnect fails Show user-friendly message suggesting to spawn a new VM if reconnection fails, rather than just showing raw SSH error. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-15 00:16:53 -05:00
A	1826fceee3	test: add missing coverage for cmdLast (#1176 ) Added comprehensive test suite for cmdLast function (PR #1171 feature). Covers: - Empty history (no records) - History with records (rerunning latest) - Record hints and prompt display - Helper functions (buildRecordLabel, buildRecordHint) - Edge cases (old timestamps, metadata fields, selection logic) Tests increased from 13,685 to 13,712 (+27 tests). Agent: test-engineer Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-15 00:08:33 -05:00
A	89f1712761	test: fix failing test assertions to match implementations (#1173 ) Updates test assertion strings in 10 test files to match current implementation error messages. Implements changes from PR #1159 which were blocked due to merge conflicts. Fixes #1161 Agent: test-engineer Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 23:30:27 -05:00
A	70c7f9f8c5	ux: add spawn last command to instantly rerun most recent spawn (#1171 ) Adds a new `spawn last` command (with `rerun` alias) that instantly reruns the most recent spawn from history without requiring the interactive picker. This improves the workflow for users who frequently want to restart their last session. Features: - `spawn last` or `spawn rerun` to instantly rerun last spawn - Shows descriptive label and timestamp before rerunning - Handles empty history gracefully with helpful message - Preserves prompt from original spawn if it had one - Updated help text and examples Agent: ux-engineer Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 23:27:59 -05:00
A	58232baf4d	fix: improve error handling and reliability in OAuth flow and script download (#1170 ) This commit fixes 3 high-impact reliability issues that could cause runtime failures: 1. OAuth server PID race condition (shared/common.sh) - BEFORE: Used pgrep to find server PID, which could match wrong processes - AFTER: Store PID in a file and read it reliably - IMPACT: Prevents OAuth cleanup failures and orphaned server processes 2. Unhandled curl failures in OAuth code exchange (shared/common.sh) - BEFORE: curl failures returned empty response without error detection - AFTER: Check curl exit code and report network/API errors clearly - IMPACT: Users get actionable feedback instead of cryptic "empty key" errors 3. Missing error handling in script download (cli/src/commands.ts) - BEFORE: Caught download error but continued execution with undefined scriptContent - AFTER: Exit early when download fails to prevent crash - IMPACT: Prevents "Cannot read property of undefined" runtime errors All changes preserve existing behavior while adding defensive error handling. Agent: code-health Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 23:26:53 -05:00
A	2fbe225855	refactor: extract helper functions to reduce complexity in discovery and commands (#1172 ) Reduced complexity in 2 functions by extracting focused helpers: 1. preflightCredentialCheck (42 → 30 lines): - collectMissingCredentials(): validate env vars - getCredentialGuidance(): context-specific messaging - confirmContinueWithMissingCreds(): user confirmation logic 2. build_single_prompt (54 → 14 lines): - _find_first_gap(): extract matrix gap lookup - _print_gap_implementation_steps(): format implementation guidance - _print_matrix_full_guidance(): format discovery guidance Improves testability and readability while preserving behavior. Agent: complexity-hunter Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 23:23:26 -05:00
A	bf738bee69	ux: improve CLI help examples and remove duplicate auth text (#1163 ) - Diversify help command examples to showcase more agents and clouds (openclaw, goose, interpreter, vultr, digitalocean, linode) - Remove duplicate "Auth: token" text in cloud info display - Update test to match new help examples Agent: ux-engineer Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 22:11:42 -05:00
A	2605499f1a	refactor: extract helper functions to reduce checkEntity complexity (#1153 ) Split checkEntity into three focused helpers that each handle a specific correction strategy (wrong kind, same-kind typo, opposite-kind typo). This reduces cyclomatic complexity from 6 to 2 in the main function, making it easier to test and understand. Agent: complexity-hunter Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 20:48:48 -05:00
A	a653549e83	refactor: reduce complexity in cmdHelp and cmdAgentInfo functions (#1157 ) Extract cmdHelp's 76-line help message into 6 modular helper functions (getHelpUsageSection, getHelpExamplesSection, getHelpAuthSection, getHelpInstallSection, getHelpTroubleshootingSection, getHelpEnvVarsSection, getHelpFooterSection) to improve maintainability and allow reuse. Extract cmdAgentInfo's cloud listing logic into printAgentCloudsList helper to reduce the function's cognitive load and separate display concerns. Both refactorings maintain identical user-facing behavior while reducing code duplication and improving testability. Agent: complexity-hunter Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 20:45:44 -05:00
A	4f7f840f53	fix: align install script tests with actual implementation (#1154 ) - Fix install-script-validation tests that checked for non-existent source-mode fallback features (PRs #707, #710 were not implemented) - Rename test suite to "build fallback and binary download" to match actual behavior (pre-built binary download, not source mode) - Remove assertions for non-existent features (${HOME}/.spawn, exec bun wrapper, forced reinstall) - Add test for actual fallback behavior (downloading cli.js from releases) - Fix download-and-failure test to match actual error message casing ("Firewall or proxy" not "firewall or proxy") These tests were blocking CI and preventing clarity on actual vs desired implementation. Now tests accurately reflect the current install.sh behavior. Agent: test-engineer Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Haiku 4.5 <noreply@anthropic.com>	2026-02-14 20:45:35 -05:00
A	4e19859d77	test: fix outdated error message assertions (#1141 ) Update test expectations to match current UX error messages: - "Cannot run interactive picker" instead of "No interactive terminal" - "Next steps" instead of "What to do" - "experiencing issues" instead of "recovering" - "Firewall or proxy" (capitalized) instead of "firewall or proxy" All affected tests now pass with the current CLI error messages. Agent: ux-engineer Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 16:43:35 -08:00
A	30b31904df	refactor: reduce complexity in error handling and list display (cli/commands.ts) (#1142 ) - Move exit code conditional logic into EXIT_CODE_GUIDANCE callbacks - Extract buildEnvironmentLines() and buildPromptLines() helpers - Extract buildListFooterLines() to separate formatting from display - Reduces cyclomatic complexity and improves code reusability Agent: complexity-hunter Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 18:49:57 -05:00
A	69df76f3a8	fix: correct test expectations to match actual CLI error messages (#1143 ) Updated failing test cases to match the actual error messages generated by the CLI: - "Cannot run interactive picker: not a terminal" (not "No interactive terminal") - "Try manual installation:" (not "Try the installation manually") - "Retry with a fresh server" (not "Re-run spawn to try") - "installation failed" (not "installation failed to complete successfully") - "Next steps" (not "What to do") - "temporarily unavailable" (not "recovering") Shell tests (80/80) pass. CLI tests improved from 128 failures to 47 failures. Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 18:48:03 -05:00
A	92ac7b8f67	ux: clarify non-interactive terminal error message (#1137 ) Improved the error message when spawn is run without arguments in a non-interactive environment (piped/redirected stdin/stdout). Before: 'No interactive terminal detected.' After: 'Cannot run interactive picker: not a terminal' '(stdin/stdout is piped or redirected)' This makes it clearer why the interactive picker cannot run and what the actual issue is (not just 'detected' but explicitly explaining the stdin/stdout state). Agent: ux-engineer Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 18:11:47 -05:00
A	9576cd5005	refactor: reduce function complexity in shared/common.sh and cli/commands.ts (#1138 ) Extracted helper functions to improve code maintainability: 1. shared/common.sh: - Extracted _prompt_and_validate_api_key() from get_openrouter_api_key_manual() - Simplified API key validation loop and confirmation logic 2. cli/commands.ts: - Extracted selectAgent() from cmdInteractive() for agent selection - Extracted getAndValidateCloudChoices() for cloud validation and prioritization - Extracted selectCloud() for cloud selection UI - Extracted report404Failure() and reportHTTPFailure() from reportDownloadFailure() - Extracted classifyNetworkError(), showTimeoutCauses(), showConnectionCauses(), etc. - Simplified error handling with switch statement in reportDownloadError() These changes reduce cyclomatic complexity and improve testability while preserving all existing functionality. Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 18:11:36 -05:00
A	8e55123e43	test: improve test coverage for provider delegation patterns (#1135 ) * test: fix codesandbox provider pattern tests for helper function indirection Update tests to account for functions that delegate to SDK helpers (_csb_sdk_eval and _csb_run_cmd) rather than directly inlining SDK code. Also add aliyun CLI auth pattern to credential handling test. - Fix codesandbox tests to check for helper calls when patterns aren't direct - Update test_codesandbox_token test to accept "How to fix" variant - Allow interactive_session validation to check via run_server delegation - Fixed: 42 codesandbox failures reduced to 0, 1 alibabacloud failure fixed Agent: test-engineer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * test: fix alibabacloud provider pattern tests for delegation Update tests to account for alibabacloud delegating to shared SSH functions instead of implementing SSH/SCP directly. Also adjust validation expectations to match actual implementation which uses _aliyun_validate_create_params. - Accept _aliyun_validate_create_params as validation pattern - Update SSH test expectations for ssh_run_server and ssh_interactive_session - Fixed: 6 alibabacloud failures reduced to 0 Agent: test-engineer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * test: fix new-cloud-provider-patterns codesandbox validation tests Update tests to account for codesandbox delegating to _csb_run_cmd helper and interactive_session delegating to run_server. - Accept _csb_run_cmd as SDK execution pattern - Allow interactive_session validation via run_server delegation - Fixed: 2 codesandbox validation failures reduced to 0 Agent: test-engineer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 17:48:18 -05:00
A	22cdd75f80	ux: improve error message clarity and formatting (#1133 ) Enhance user-facing error messages with better structure and visual hierarchy: CLI Error Messages: - Add bold headers for "Next steps:" and "Possible causes:" sections - Make action items more scannable and directive - Simplify language (e.g., "temporarily" vs "temporarily unavailable") - Reduce redundancy in network error messages Shell Error Messages: - Add color-coded section headers (yellow for "Common causes" and "Next steps") - Apply syntax highlighting to commands with CYAN color - Improve readability of multi-step installation instructions - Use bullet points (•) instead of dashes for better visual scanning - Add inline comments to commands (e.g., "# Check disk space") Impact: Users experiencing errors will: - Find actionable steps faster with clear visual hierarchy - Copy-paste commands more easily with syntax highlighting - Understand root causes quicker with color-coded sections - Have a better experience during failure scenarios All changes maintain backward compatibility and work across bash 3.x (macOS) and modern bash. Agent: ux-engineer Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 17:44:47 -05:00
A	aaa886a7a9	test: fix failing test assertions to match actual implementation (#1130 ) Updated test assertions to reflect refactored helper functions and changed error messages. Key changes: - Fixed atlanticnet security tests to verify ensure_multi_credentials delegation instead of checking implementation details in provider code - Updated shared-common-decomposed-helpers tests to check actual error output messages instead of outdated wording - Fixed shared-github-auth test mocking to properly override command builtin for platform detection - Updated CloudSigma manifest auth field to explicitly mention HTTP Basic Auth Tests now pass with 517/517 success across affected test files. Agent: test-engineer Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 14:14:33 -08:00
A	b0843f6144	test: fix error message assertions in 7 test files (#1124 ) Fixed 24 failing test assertions by aligning test expectations with actual error message output from the codebase: - Updated error message strings to match actual implementation (e.g., "What to do" instead of "How to fix") - Fixed case sensitivity issues ("Report it" vs "report it", "Server is still booting" vs "may still be booting") - Adjusted assertions to match specific error paths (Network timeout vs Connection refused) - All 284 tests in these 7 files now pass Files fixed: - cli-entry-edge-cases.test.ts: 56 tests - cmdrun-happy-path.test.ts: 27 tests - commands-swap-resolve.test.ts: 23 tests - commands-update-download.test.ts: 17 tests - download-and-failure.test.ts: 42 tests - shared-common-ssh-helpers.test.ts: 52 tests - shared-common-untested-helpers.test.ts: 67 tests Co-authored-by: spawn-refactor-bot <refactor@openrouter.ai> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 15:49:12 -05:00
A	f666b40977	ux: prioritize ready clouds in "not yet implemented" errors (#1117 ) When a user tries to spawn an agent on an unimplemented cloud, the error message now shows alternative clouds sorted by credential availability. Clouds where credentials are already set are shown first and marked with "(ready)" to make it obvious which options require no setup. Before: Claude Code is available on 8 clouds. Try one of these instead: spawn claude hetzner spawn claude digitalocean spawn claude sprite After: Claude Code is available on 8 clouds. Try one of these instead: spawn claude sprite (ready) spawn claude hetzner spawn claude digitalocean ready = credentials already set This reduces friction by guiding users toward the path of least resistance when their initial choice isn't available. Agent: ux-engineer Co-authored-by: Spawn Refactor Service <refactor@spawn.service> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 13:16:46 -05:00
A	de42958eca	ux: fix install and upgrade success messages (#1113 ) Fixes two UX issues identified in #1106: 1. Install script: Raw escape codes weren't rendering in log_info - Before: "Run \033[1mspawn\033[0m\033[0;32m to get started\033[0m" - After: Uses printf with proper color variable interpolation 2. Update command: Confusing message after `spawn update` - Before: "Run your spawn command again to use the new version" - After: "Run spawn again to use the new version" - The word "your" implied the user had run some other command, but they explicitly ran `spawn update` Agent: ux-engineer Co-authored-by: Spawn Refactor Service <refactor@spawn.service> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 12:45:36 -05:00
A	cce815836f	test: update assertions to match improved error messages (#1109 ) The error messages were previously improved to be more user-friendly and actionable (see PR #1103), but some tests were still checking for the old error text. This commit updates test assertions to match the new, clearer error messages. Changes: - Update security.test.ts assertions to check for new error message patterns - Fix case-sensitivity issue in cli-version-and-dispatch.test.ts - Update index-main-routing.test.ts to match new validation messages The improved error messages now: - Tell users WHAT went wrong - Tell users HOW to fix it - Provide concrete examples and next steps Agent: ux-engineer Co-authored-by: Spawn Refactor Service <refactor@spawn.service> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 11:49:20 -05:00
A	f8b2178658	ux: improve error messages for better clarity and actionability (#1103 ) Enhance error messages throughout the codebase to provide clearer explanations and more actionable guidance for users. Changes: Shell Scripts (shared/common.sh): - Improve non-interactive mode error with better examples - Expand model ID validation to show valid characters and examples - Add detailed server name requirements with examples - Fix diagnostic function to handle cases without fixes section TypeScript CLI (cli/src/security.ts): - Enhance identifier validation with bullet points and examples - Add context about entity type (agent vs cloud) in errors - Improve path traversal error with specific character explanations - Better prompt validation messages with plain language guidance - Improve overly-long identifier/prompt errors with helpful context TypeScript CLI (cli/src/commands.ts): - Rewrite download failure messages to be more user-friendly - Change "Common causes" to "What's wrong" for clarity - Change "How to fix" to "What to do" for better action orientation - Add more specific troubleshooting steps for network issues - Improve wording to be less technical and more helpful Impact: - Users get clearer, more actionable error messages - Error messages now include examples of correct usage - Reduced cognitive load by using plain language instead of jargon - Better guidance for fixing issues without needing to consult docs Agent: ux-engineer Co-authored-by: Spawn Refactor Service <refactor@spawn.service> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 10:48:22 -05:00
A	dacb65785a	test: fix assertions to match actual function behavior (#1104 ) - generic_wait_for_instance: Fix IP address assertion by printing exported variable - local agent scripts: Update test to check for log_install_failed function calls - security encoding: Update error message assertion to match current validatePrompt output These fixes align test assertions with the actual implementation behavior, reducing test failures from 233 to 222. Agent: test-engineer Co-authored-by: Spawn Refactor Service <refactor@spawn.service> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 10:42:56 -05:00
A	b2c8c87435	test: fix test assertions to match updated error messages (#1101 ) Adjusts test expectations to handle recent UX improvements that changed error message formatting. Also adds support for variable-based test infrastructure detection in test-infra-sync.test.ts and includes missing cloud URL patterns for webdock, serverspace, and gcore. Agent: test-engineer Co-authored-by: Spawn Refactor Service <refactor@spawn.service> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 09:28:52 -05:00
A	baa60f3bd4	ux: improve security and validation error messages (#1097 ) Enhance error messages across validation and download failures to be more actionable and user-friendly: Security validation improvements (cli/src/security.ts): - validateIdentifier: Add examples of valid names, clearer length error - validateScriptContent: Improve empty script and shebang error messages - validatePrompt: Better guidance on prompt requirements and length limits - validatePromptFilePath: Clearer security warnings with concrete examples - validatePromptFileStats: More helpful messages for file size/empty errors Download failure improvements (cli/src/commands.ts): - reportDownloadFailure: Add "Common causes" section, better 404 guidance - reportDownloadError: Context-aware messages for timeout vs connection errors - validateNonEmptyString: Minor wording improvement All error messages now follow a consistent pattern: 1. What went wrong (clear, specific) 2. Why it might have happened (common causes) 3. How to fix it (numbered, actionable steps) Agent: ux-engineer Co-authored-by: Spawn Refactor Service <refactor@spawn.service> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 06:08:48 -08:00
A	8981376274	test: fix failing tests in agent-config-setup.test.ts (#1095 ) Fix 15 failing tests by implementing proper mock_run callbacks that handle path substitution for /tmp/spawn_config_* files and home directory paths. Updated all failing test cases to use sed-based path replacement before eval to correctly move configuration files to their final destinations. All 40 tests in agent-config-setup.test.ts now pass. Agent: test-engineer Co-authored-by: Spawn Refactor Service <refactor@spawn.service>	2026-02-14 05:47:45 -05:00
A	205f835411	ux: improve security and validation error messages (#1090 ) * ux: improve security and validation error messages Make error messages more user-friendly and actionable: Security validation errors: - Changed "contains invalid characters" to "Invalid agent: ..." with clearer formatting - Added context-specific guidance (spawn agents vs spawn clouds) - Replaced technical jargon with plain language - Changed "path traversal characters" to list specific disallowed characters Prompt validation errors: - Replaced "Prompt blocked: contains potentially dangerous pattern" with "Your prompt contains shell syntax that can't be safely processed" - Added specific suggestions for each pattern (e.g., 'Instead of "Fix $(ls)", try "Fix the output of ls command"') - Included helpful tip about using plain English instead of shell syntax Script download errors: - Replaced technical "must start with a valid shebang" message with bullet-point explanation of what went wrong - Added step-by-step "How to fix" section - More user-friendly language throughout Prompt file errors: - Changed "Refusing to read" to "Cannot use... as a prompt file" - Added clear "How to fix" with example commands - Better explanation of why certain paths are blocked All error messages now: - Start with what went wrong in plain language - Explain why it happened - Provide specific next steps to fix it - Use consistent formatting with bullet points and sections Agent: ux-engineer Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> * fix: Replace !! with ;; in gcore case branches in record.sh Addresses security review feedback. The !! syntax is invalid bash and broke the test recording infrastructure. -- refactor/pr-maintainer --------- Co-authored-by: Spawn Refactor Service <refactor@spawn.service> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 05:13:35 -05:00
A	a67f8dd837	test: Fix manifest consistency and improve test assertions (#1087 ) - Fix manifest.json matrix entries: change local/opencode and hostkey/open-interpreter from 'implemented' to 'missing' (scripts don't exist) - Rename agent entries in matrix to match actual agent keys (codex-cli→codex, gemini-cli→gemini, kilo→kilocode, open-interpreter→interpreter) - Update test assertions to match actual output formats (e.g., 'Extra argument ignored' instead of 'extra argument') - Fix shared-common-error-polling tests to check stderr output correctly - Simplify agent-config-setup tests to work within shell context limitations - Remove outdated install.sh test that expected non-existent 'WRAPPER' string - Ensure CLI dependencies are installed before test runs Co-authored-by: Spawn Refactor Service <refactor@spawn.service> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 01:10:03 -08:00
A	42b4dfc42e	refactor: reduce complexity in command functions (#1085 ) Extract large switch statement in getScriptFailureGuidance() into lookup tables and helpers for better maintainability. Break down renderCompactList() into separate helper functions for header, separator, and row rendering. This reduces cognitive complexity and makes the functions easier to test and modify. Agent: complexity-hunter Co-authored-by: Spawn Refactor Service <refactor@spawn.service> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 04:02:53 -05:00
A	58823bcc4f	fix(ux): show credential readiness in agents list, matrix, and preflight check (#1061 ) - `spawn agents` now shows "N ready" indicator when clouds have credentials - `spawn matrix` compact view adds a "Ready" column showing credential count - Preflight credential check gives context-specific guidance: mentions OAuth browser flow when only OPENROUTER_API_KEY is missing, improving clarity Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-14 01:55:52 -05:00
A	04e92fe727	test: add 178 tests for credential display, auth parsing, and internal helpers (#1063 ) Adds comprehensive test coverage for previously untested or weakly tested areas in commands.ts and index.ts: - formatCredStatusLine: newly exported function with zero prior tests - formatAuthVarLine (replicated): private helper for quick-start display - groupByType (replicated): private helper for clouds/agents grouping - formatCacheAge (replicated): version display cache age formatting - parseAuthEnvVars: 14 edge cases (CloudSigma format, short strings, hyphens) - hasCloudCredentials: 8 edge cases (empty values, multiple vars, none auth) - getImplementedClouds/getImplementedAgents: nonexistent/empty manifest cases - getMissingClouds: all/none/empty scenarios - calculateColumnWidth: boundary and empty array cases - prioritizeCloudsByCredentials/buildAgentPickerHints: empty/orphan agents - resolveDisplayName: null manifest, unknown keys - buildRecordLabel/buildRecordHint: null manifest, long prompts - formatRelativeTime/formatTimestamp: all time buckets + invalid input - getErrorMessage: null, undefined, boolean, object-without-message - levenshtein: empty strings, symmetry, insertions/deletions - findClosestMatch/findClosestKeyByNameOrKey: empty candidates, case sensitivity - resolveAgentKey/resolveCloudKey: display name resolution, empty string - checkEntity: swapped kind detection (agent as cloud, cloud as agent) - getStatusDescription: 404 vs other HTTP codes - isRetryableExitCode: all exit codes including empty/signal messages - buildRetryCommand: prompt length boundary (80 vs 81 chars), quote escaping - credentialHints: all-set, missing, custom verb - getSignalGuidance: all signals + dashboard URL - getScriptFailureGuidance: all exit codes + null + dashboard Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Haiku 4.5 <noreply@anthropic.com>	2026-02-14 01:49:41 -05:00
A	1af9878d0a	test: add 56 tests for shared/key-request.sh credential loading (#1057 ) Add comprehensive test coverage for shared/key-request.sh, which had zero existing test coverage. Tests cover: - get_cloud_env_vars: env var extraction from manifest (7 tests) - _parse_cloud_auths: manifest parsing for auth specs (6 tests) - _try_load_env_var: single var loading from JSON config (10 tests) - _load_cloud_credentials: multi-var credential loading (5 tests) - load_cloud_keys_from_config: full credential loader (8 tests) - invalidate_cloud_key: config deletion with path traversal guard (11 tests) - request_missing_cloud_keys: key server request behavior (4 tests) - Integration: end-to-end key loading scenarios (5 tests) Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Haiku 4.5 <noreply@anthropic.com>	2026-02-14 00:54:00 -05:00
A	44918b64c4	refactor: extract helpers to reduce function complexity (#1055 ) - Extract `readPromptFile` from `resolvePrompt` in index.ts (60 -> 40 lines), isolating prompt-file validation and reading into a standalone helper - Extract `formatCredStatusLine` from `buildCredentialStatusLines` in commands.ts, replacing repetitive set/not-set formatting with a reusable helper - Extract `_aliyun_validate_create_params` and `_aliyun_run_instances` from `create_server` in alibabacloud/lib/common.sh (69 -> 34 lines), separating validation, API call, and orchestration concerns Agent: complexity-hunter Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Haiku 4.5 <noreply@anthropic.com>	2026-02-14 00:41:39 -05:00
A	9336998168	fix(ux): add post-session summary to 10 exec-based cloud providers (#1056 ) Users on exec-based clouds (Fly, Render, Koyeb, Northflank, Railway, Modal, Daytona, E2B, CodeSandbox, GitHub Codespaces) got no warning when their session ended that their service was still running and incurring charges. This adds: - _show_exec_post_session_summary() in shared/common.sh for non-SSH providers that use CLI exec commands instead of direct SSH - SPAWN_DASHBOARD_URL for all 10 exec-based clouds so users get actionable dashboard links - Post-session summary calls in each cloud's interactive_session() - 33 new tests covering the exec post-session summary feature Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-14 00:38:10 -05:00
A	90c610a21d	refactor: simplify signal/failure guidance and credential formatting in CLI (#1053 ) Convert getSignalGuidance from switch statement to data-driven lookup table (SIGNAL_GUIDANCE), separating signal metadata from rendering logic. Extract optionalDashboardLine helper to deduplicate the conditional dashboard URL spreading in getScriptFailureGuidance. Extract formatCredentialIndicator from cmdClouds to clarify the nested ternary credential status formatting. All 92 script-failure-guidance tests and 216 related tests pass with zero regressions. Agent: complexity-hunter Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-14 00:33:50 -05:00
A	aca3ca0316	test: add 237 edge-case tests for core CLI modules (#1054 ) Covers edge cases in manifest.ts, security.ts, commands.ts, index.ts, and history.ts not addressed by existing tests or open PRs #1050/#1051. Key areas tested: - stripDangerousKeys: deep nesting, arrays, all-dangerous-keys objects - validatePromptFilePath: path traversal combos, all sensitive path patterns - validatePromptFileStats: boundary at exactly 1MB, empty files, non-files - validateIdentifier: boundary at 64/65 chars, various invalid characters - validateScriptContent: fork bomb, destructive ops, HTML error pages - validatePrompt: backtick detection, boundary at 10KB, injection patterns - checkEntity: swapped agent/cloud detection, typo correction - cmdHelp: content verification (all subcommands, flags, env vars, sections) - expandEqualsFlags: equals-in-value, empty value, mixed arg types - history: combined filters, case-insensitive filters, corrupted files - parseAuthEnvVars: multi-auth, short vars, non-env-var auth strings - buildRetryCommand: quote escaping, long prompt threshold - isRetryableExitCode: SSH exit 255 vs other codes - getSignalGuidance/getScriptFailureGuidance: all signal/exit code branches - hasCloudCredentials: multi-auth partial setup - formatRelativeTime: all time ranges including future timestamps Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 00:26:38 -05:00
A	b1c2ad0a0d	test: add 159 edge-case tests for commands.ts exported helpers (#1051 ) Comprehensive tests covering boundary conditions and interactions between exported helpers in commands.ts: - parseAuthEnvVars: unusual auth formats, multi-var, edge patterns - hasCloudCredentials: env var presence/absence, multi-var auth - resolveAgentKey/resolveCloudKey: display name resolution, case sensitivity, hyphenated keys, cross-kind rejection - getImplementedClouds/getImplementedAgents: large manifest fixtures - getMissingClouds: partial/full/zero implementation coverage - prioritizeCloudsByCredentials: mixed credential states, multi-var - buildAgentPickerHints: cloud counts, singular/plural, ready status - formatRelativeTime: future dates, extreme dates, invalid input - formatTimestamp: epoch, invalid, empty string - buildRetryCommand: quote escaping, long prompt threshold, newlines - isRetryableExitCode: all code paths including edge codes - getErrorMessage: Error, plain object, primitives, null, undefined - getSignalGuidance: all signal branches with/without dashboard URL - getScriptFailureGuidance: all exit codes with auth/dashboard hints - credentialHints: multi-var auth, partial/full/no credentials - resolveDisplayName: null manifest, unknown keys - buildRecordLabel/buildRecordHint: null manifest, prompt truncation - levenshtein: symmetry, empty strings, transposition - findClosestMatch: large candidate lists, empty list, case handling - calculateColumnWidth: empty array, single char, varying lengths Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Haiku 4.5 <noreply@anthropic.com>	2026-02-13 23:37:22 -05:00
A	44b9a5bdff	fix(security): harden weak crypto fallbacks, key validation, and temp paths (#1039 ) * fix(security): harden weak crypto fallbacks, key validation, and temp paths - CSRF state generation: fail instead of using predictable date+$RANDOM fallback when openssl and /dev/urandom are unavailable (OAuth CSRF bypass) - Kamatera password: fail instead of using predictable date-based password when no secure random source available - key-server validKeyVal: enforce 8-512 char limits and ASCII-only check to block malformed/oversized values (Fixes #969) - upload_config_file: use mktemp-derived randomness for remote temp paths instead of predictable $RANDOM (symlink attack on remote server) Agent: security-auditor Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(test): update assertions for upload_config_file mktemp-derived paths The upload_config_file function now uses mktemp-derived basenames (spawn_config_tmp.XXX) instead of the original filename for remote temp paths. Update test/run.sh assertions to: - Match "spawn_config" in the -file upload path - Verify mv commands move files to correct final destinations (settings.json, .claude.json) Addresses reviewer feedback on PR #1039. Agent: pr-maintainer Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 21:43:37 -05:00
A	881067bf8b	test: add 50 tests for unified printQuickStart and buildDashboardHint helpers (#1042 ) (#1044 ) Cover the unified printQuickStart function and extracted buildDashboardHint helper from PR #1042. Tests verify: - buildDashboardHint edge cases: empty string URL fallback, very long URLs, consistency across signal types, per-exit-code dashboard URL inclusion - printQuickStart unified behavior via cmdCloudInfo: single-auth, no-agent, ready-to-go shortcut, non-parseable auth, none-auth - printQuickStart unified behavior via cmdAgentInfo: credential-prioritized cloud ordering, no-implementations, single-cloud ready-to-go - cmdCloudInfo agent list count display and metadata - cmdAgentInfo cloud list count display and metadata Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 21:43:23 -05:00
A	dde41b1357	refactor(cli): unify quick-start printing and extract dashboard hint helper (#1042 ) Merge printAgentQuickStart and printCloudQuickStart into a single printQuickStart function, eliminating duplicated credential-checking and auth-var-line printing logic. Extract buildDashboardHint from the identical pattern repeated in getSignalGuidance and getScriptFailureGuidance. Agent: complexity-hunter Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 20:54:13 -05:00
A	122b59e4da	test: add 99 tests for post-session summary and SPAWN_DASHBOARD_URL convention (#1040 ) Cover the _show_post_session_summary function and updated ssh_interactive_session integration from PR #1037. Tests verify: - Summary warns user their server is still running with IP - Dashboard URL shown when SPAWN_DASHBOARD_URL is set - Generic message when no dashboard URL is available - Reconnect command uses correct SSH_USER and IP - SSH exit code preserved through the summary display - All 25 SSH-based cloud providers set SPAWN_DASHBOARD_URL - SPAWN_DASHBOARD_URL uses HTTPS and is defined before usage - Detects custom interactive_session implementations missing summary (alibabacloud flagged as known gap) Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 20:46:40 -05:00
A	beceb69962	test: add 151 tests for key-server security-critical logic (#1036 ) Add comprehensive test coverage for the key-server (.claude/skills/setup-agent-team/key-server.ts), which previously had zero tests despite containing security-critical logic: - validKeyVal: API key validation (control chars, shell metacharacters, length limits) - 37 tests - SAFE_PROVIDER_RE: path traversal prevention in provider names - 21 tests - UUID_RE: batch ID format validation - 12 tests - signHmac/verifyHmac: HMAC signing and verification for signed URLs - 17 tests - isAuthed: timing-safe Bearer token auth - 9 tests - rateCheck: rate limiting logic - 8 tests - esc: HTML escaping for XSS prevention - 13 tests - cleanup: data store batch expiry logic - 9 tests - Key submission validation flow - 6 tests - Route matching, security headers, backward compat - 19 tests Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 20:11:35 -05:00
A	8c052aceb8	test: add 239 tests for CloudSigma provider patterns and conventions (#1030 ) Validates CloudSigma's unique architecture: region-based API URLs, HTTP Basic Auth (email + password), drive cloning workflow, python3 JSON construction, SSRF-preventing region validation, and SSH with 'cloudsigma' user. Covers lib/common.sh API surface, all 8 agent scripts, manifest consistency, and test infrastructure (mock.sh + record.sh). Agent: test-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 19:39:58 -05:00
A	059690f8d7	fix(ux): include cloud provider dashboard URLs in script failure and interrupt messages (#1029 ) When spawn scripts fail or are interrupted, error messages now include the cloud provider's actual dashboard URL instead of generic "check your cloud provider dashboard" text. This helps users quickly navigate to their provider to check server status, clean up orphaned resources, or debug provisioning failures. Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 16:01:57 -08:00
A	7d6bc0292b	fix(ux): add preflight credential check to interactive mode (#1027 ) The interactive flow (bare `spawn`) was missing the preflight credential warning that the direct `spawn <agent> <cloud>` path already had. Users who picked an agent and cloud interactively would not be warned about missing credentials, leading to confusing failures from the cloud provider script. Now both paths warn about missing credentials before launching. Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 18:52:03 -05:00
A	b6a07e3c60	fix: prevent sensitive file exfiltration via --prompt-file flag (#1024 ) Add path validation to --prompt-file to block reading sensitive files (SSH keys, cloud credentials, .env files, etc.) whose contents would be sent to remote agents. Also adds file size validation (1MB limit) and stat-based file type checking. Fixes #991 Agent: security-auditor Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 18:30:05 -05:00
A	b76f04cd78	fix(ux): show cloud count and credential readiness in interactive agent picker (#1025 ) When users run `spawn` interactively, the agent picker now shows how many clouds each agent supports and how many have credentials ready. This helps users quickly identify which agents they can deploy immediately. Before: "Claude Code AI coding assistant" After: "Claude Code 2 clouds, 1 ready" Agent: ux-engineer Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-13 18:29:25 -05:00

1 2 3 4 5 ...

351 commits