spawn

vrr/spawn

mirror of https://github.com/OpenRouterTeam/spawn.git synced 2026-04-28 03:49:31 +00:00

Author	SHA1	Message	Date
A	c487ea215f	refactor: move test fixtures to root /fixtures directory (#1849 ) * refactor: move test fixtures to root /fixtures directory Moves test/fixtures/ → fixtures/ at the repo root for easier discoverability. Updates all references in CLAUDE.md and QA prompt files. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * refactor: move install.ps1 to sh/cli/ and fixtures to root - Moves cli/install.ps1 → sh/cli/install.ps1 (consistent with install.sh) - Moves test/fixtures/ → fixtures/ at the repo root - Updates all references in README, CLAUDE.md, QA prompts, and the ps1 itself Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-23 21:38:18 -08:00
A	b84adfb74e	refactor: move all shell scripts to /sh directory (#1843 ) Reorganizes the project so all shell scripts live under a dedicated /sh directory, enabling the OpenRouter rewrite URL to point at /sh/ instead of the repository root. Moves: - cli/install.sh → sh/cli/install.sh - shared/.sh → sh/shared/.sh - {cloud}/{agent}.sh → sh/{cloud}/{agent}.sh (48 scripts) - {cloud}/README.md → sh/{cloud}/README.md - e2e/.sh → sh/e2e/.sh - test/macos-compat.sh → sh/test/macos-compat.sh - test/fixtures/*/.sh → sh/test/fixtures/*/.sh Updates all references: - RAW_BASE path construction in commands.ts, update-check.ts - GitHub auth URL in agent-setup.ts - Self-referencing URLs in install.sh, github-auth.sh - CI workflow paths in lint.yml, cli-release.yml - Test file paths in install-script-validation, manifest-integrity - Documentation in README.md, cli/README.md, CLAUDE.md - QA scripts in .claude/skills/ Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-23 21:14:54 -08:00
A	430390bcf5	feat: enforce worktree-first workflow via PreToolUse hook (#1808 ) Replace the "block on main" hook with a stricter worktree check: edits are only allowed when the target file lives inside a git worktree, not the main checkout. Also blocks edits on the main branch even within a worktree. CLAUDE.md updated with the new worktree-first workflow instructions. Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-23 08:03:48 -08:00
A	aba55b3895	docs: enforce PR workflow for ALL changes, no exceptions (#1793 ) * docs: enforce PR workflow for ALL changes, no exceptions Rewrites the "Draft PR First" section to be unambiguous: - Every file edit (including CLAUDE.md itself) requires a PR - Explicit list of change types that are NOT exempt - Step-by-step workflow: branch → change → commit → draft PR → merge - Finished PRs must be converted from draft and merged immediately Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * remove unnecessary "don't commit to main" warning Branch protection already prevents direct commits — no need to restate it as a rule. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: add branch-check-first workflow and stash recovery Agents must check their branch before editing files. If on main, branch first. If they already have uncommitted changes on main, stash → branch → unstash. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: add PreToolUse hook to block edits on main branch - Adds a PreToolUse hook that exits 2 (blocks) any Write/Edit when the current branch is main, with a clear error message telling the agent to create a branch first - Updates CLAUDE.md to reference the hook and use cherry-pick (not stash) for recovering commits made on main by mistake Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-22 23:22:20 -08:00
A	b62dc1af33	feat: ban `as` type assertions, add runtime schema validation with valibot (#1775 ) * fix: resolve all biome lint warnings across the codebase - Replace all noExplicitAny with proper types (unknown, Record<string, unknown>) - Fix useBlockStatements in picker.ts (braceless if) - Fix useNumberNamespace in picker.ts (parseInt → Number.parseInt) - Codebase now passes biome lint with 0 errors and 0 warnings Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: ban `as` type assertions, add runtime schema validation with valibot Replace all ~170 unsafe `as` type assertions across the entire codebase (production + tests) with runtime-validated alternatives: - Add GritQL biome plugin (`no-type-assertion.grit`) that bans all `as` casts except `as const` - Add valibot for schema-validated JSON parsing (`parseJsonWith`) - Add shared utilities: `parse.ts` (schema parsing), `type-guards.ts` - Replace `as` casts in all 5 cloud modules (aws, daytona, hetzner, digitalocean, fly) with valibot schemas + type guards - Replace `as` casts in shared modules (manifest, update-check, oauth, commands, history, ui) - Replace `as any` in all 26 test files with proper `new Response()` mocks and typed variables - Add 13 tests for parseJsonWith/parseJsonRaw - Add "Embrace Bold Changes" culture rule to CLAUDE.md - Bump version 0.6.19 → 0.7.0 1859 tests pass, 0 lint errors across 95 files, bundle +6KB from valibot. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * refactor: move GritQL plugin into cli/lint/ directory Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-22 18:50:53 -08:00
A	ec210e37af	fix: Result monad for retry logic — prevent duplicate server creation (#1771 ) * fix: Result monad for retry logic — prevent duplicate server creation SSH exit 255 after an interactive session caused runWithRetries to retry the entire bash script, creating duplicate servers. The old withRetry also blindly retried all errors including timeouts where the remote command may have already completed. Introduces a Result<T> monad (Ok/Err) so callers explicitly signal whether a failure is retryable (return Err) or fatal (throw). Adds wrapSshCall() that classifies SSH errors: transient connection failures are retryable, timeouts are not. Removes retry loop from the top-level script runner entirely since it spans server creation + interactive session. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * docs: mandate draft-PR-first workflow for all changes Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: add biome lint to CI and pre-commit hook, fix lint violations - Add Biome lint job to .github/workflows/lint.yml - Add TypeScript lint check to .githooks/pre-commit - Fix useBlockStatements violations in ui.ts and tests - Add biome lint to CLAUDE.md "After Each Change" checklist Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * refactor: rename Result.value to Result.data Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: clean up stale pre-commit hook - Remove dead check for deleted functions (write_oauth_response_file, create_oauth_response_html) — they no longer exist in the codebase - Fix early exit skipping Biome lint when no .sh files are staged - Replace echo -e with printf (the hook was using the pattern it bans) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: resolve biome lint errors blocking CI - Fix useImportType: import { type Result } → import type { Result } - Fix noUnusedImports: remove unused KNOWN_FLAGS import - Fix noUnusedTemplateLiteral: template literal → string literal Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-22 20:39:42 -05:00
A	5524559a9d	fix: remove stale test/shell refs from prompts, add issue-filing rule (#1731 ) - CLAUDE.md: remove Mock Test Infrastructure section, update testing docs to unit-test-only, fix file structure, add "Filing Issues for Discovered Problems" rule - discovery-team-prompt.md: test/record.sh + test/mock.sh → bun test - security-review-all-prompt.md: remove "verify source fallback pattern" Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-22 11:37:42 -08:00
A	60986e5a05	refactor: remove shared/common.sh and 27 subprocess-heavy test files (#1728 ) shared/common.sh (3852 lines) was dead code — the entire architecture was rewritten to TypeScript in cli/src/. No agent scripts source it anymore. The only consumer was github-auth.sh which just needed 4 log functions (now inlined). Remove 27 test files that spawned ~800+ real bash/bun subprocesses per run (the root cause of slow bun test). Every shared-common-*.test.ts file forked a real bash shell per test case to source shared/common.sh. CLI subprocess tests spawned `bun run index.ts` per assertion. These were integration tests, not unit tests. Also removes: - mock-tests CI job from test.yml (ran test/mock.sh which opens browser) - Stale plan files referencing deleted infrastructure - All CLAUDE.md/README.md references to the old lib/common.sh pattern Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-22 11:32:27 -08:00
A	d79c941593	feat: draft PR workflow + stale PR cleanup for agent teams (#1706 ) Agents now open draft PRs immediately after first commit and convert to non-draft when work is complete, enabling visibility into in-progress work. Security reviewer skips draft PRs. Stale drafts (7+ days) are auto-closed. Refactor pr-maintainer picks up stale non-draft PRs (3+ days). Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-22 09:28:12 -08:00
A	c55483d16d	docs: add ESM-only rule to CLAUDE.md — never use require/CJS (#1639 ) Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-21 15:47:34 -08:00
A	d7ff0739a2	fix: fly auth token deprecated + org picker + macaroon tokens (#1603 ) * fix: fly auth token deprecated + org picker + macaroon discharge tokens Three fixes for the fly/ TypeScript provider: 1. `fly auth token` is deprecated — newer flyctl outputs a message, not a token. Now tries `fly tokens create org --expiry 24h` first, with `fly auth token` as fallback. Uses org tokens (not deploy) since spawn needs to create new apps. 2. Token sanitization stripped macaroon discharge tokens at commas (`fm2_[^ ,]` → `fm2_\S+`). The full composite token `fm2_xxx,fm2_yyy,fo1_zzz` is now preserved. 3. Org picker upgraded from numbered 1/2 input to arrow-key interactive selector with cursor navigation, scroll windowing, and fallback to numbered list when TTY is unavailable. Also fixes: testFlyToken fallback sent `Bearer FlyV1 ...` (double prefix) for macaroon tokens — now dispatches FlyV1 vs Bearer correctly. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> docs: never run test/mock.sh locally — opens browser, CI only Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-21 11:06:19 -08:00
A	a0da57d559	docs: add CLAUDE.md rule — always use bun+ts for inline scripting, never python (#1595 ) Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-21 08:46:46 -08:00
A	8a4f5873f9	feat: remove Oracle Cloud, add featured_cloud per agent (#1430 ) Oracle Cloud is removed as a supported provider. Each agent now has a `featured_cloud` field in manifest.json that controls cloud sort order in the CLI picker — featured clouds appear after credential-detected clouds but before CLI-installed ones, with a "recommended" hint. Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-17 22:52:41 -08:00
A	c4eccbd72f	feat: prioritize clouds with CLI installed + hcloud CLI integration (#1375 ) * fix: auto-run gcloud auth login on expired GCP tokens Instead of telling users to run `gcloud auth login` manually, just run it automatically when auth check fails or instance creation hits a reauthentication error, then retry. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: prioritize clouds with CLI installed + hcloud CLI integration When selecting a cloud provider, clouds are now sorted in 3 tiers: 1. Credentials detected (env vars set) — top priority 2. CLI installed (e.g., gcloud, hcloud, aws) — middle priority 3. Neither — default order Also adds hcloud CLI-first support for Hetzner operations (server create/delete/list, SSH key management, auth) with automatic fallback to the existing REST API when hcloud is not available. Closes #1370 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * refactor: rename aws-lightsail to aws across the project Simplifies the cloud key from "aws-lightsail" to "aws" — AWS should have a single entry regardless of the underlying service used. Renames the directory, updates manifest.json matrix keys, CLI map, test fixtures, README, and all agent scripts. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-16 20:12:35 -08:00
A	01ed74ba95	fix: Hetzner Claude Code installation + add --debug mode (#1198 ) Fixed Hetzner installation issue where curl to claude.ai/install.sh was returning 403 errors. Added fallback to use bun (already installed by cloud-init) to install Claude Code. Also added --debug flag to enable verbose bash output (set -x) for easier troubleshooting. Changes: - hetzner/claude.sh: Use bun fallback installation method - CLI: Added --debug flag support (v0.2.86) - shared/common.sh: Enable set -x when SPAWN_DEBUG=1 Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-15 16:37:04 -08:00
A	3db288c3dd	feat: trim to 9 curated launch clouds, upvote-driven discovery (#1184 ) Reduce from 41 cloud providers to 10 (9 + local) curated for launch: - local (free), oracle (free tier), hetzner (~€3.29/mo), ovh (~€3.50/mo), fly (free tier), aws-lightsail ($3.50/mo), daytona (pay-per-second), digitalocean ($4/mo), gcp ($7.11/mo), sprite (Fly.io VMs) Changes: - Remove 30 cloud directories, test fixtures, and provider-specific tests - Slim manifest.json from 600 to 150 matrix entries, sorted by price - Update CLAUDE.md with higher bar for adding clouds (prestige + pricing) - Transform discovery service from code-implementing team to upvote-driven demand tracker that creates proposal issues and only implements when a proposal reaches 50+ upvotes - Create GitHub issue #1183 as cloud wishlist with all dropped clouds - Add discovery-team/cloud-proposal/agent-proposal labels - Protect discovery-team issues from refactor team (no comments/changes) - Fix all CLI tests (8034 pass, 0 fail) and shell tests (80 pass, 0 fail) Co-authored-by: lab <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-15 00:19:39 -08:00
Ahmed Abushagur	353f20d53a	docs: expand test infrastructure instructions for discovery bot (#987 ) The bot was under-updating test/mock.sh when adding new clouds because the prompt only mentioned URL stripping. Now lists all 4 required mock.sh functions and all 5 required record.sh functions explicitly. Also adds a "Mock Test Infrastructure" reference table to CLAUDE.md so both human contributors and bots know exactly what to update. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 11:41:25 -08:00
L	13575a7a20	feat: Add PR comment triage to security reviewer and refactor pr-maintainer (#940 ) Both the security team's pr-reviewer and the refactor team's pr-maintainer now read PR comments before passing verdict. If comments indicate a PR is superseded, duplicate, or abandoned, they close it instead of reviewing. Security reviewer: new step 2 (comment-based triage) before staleness check, steps renumbered 3→4 through 11→12, verdict options include closed-duplicate. Refactor pr-maintainer: reads comments first, closes when clearly indicated, rule updated from "NEVER close" to "only close when comments justify it". Also updates CLAUDE.md dual-mode table: Agents → Teammates. Co-authored-by: Security Reviewer <security-reviewer@spawn.dev> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-13 06:25:45 -08:00
Sprite	aff3000d01	rename: setup-trigger-service -> setup-agent-team Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-12 21:38:58 +00:00
Ahmed Abushagur	8b9f9a0e5a	QA-Bot setup (#335 ) * feat: testing * feat: auto-fix dead apis * fix: mock works * feat: new fixtures * fix: more clouds tested * fix: dry run fix * fix: civo valid size * fix: civo result wait * feat: fixtures * feat: per cloud agent	2026-02-10 19:51:07 -08:00
B	3d0ac5e562	docs: Replace 'coding agent' with 'agents using remote API inference' Clarifies that spawn agents use remote LLM APIs, not local inference, which is why cheap CPU instances suffice and GPU clouds are unnecessary. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-11 00:26:00 +00:00
B	ae7a999217	docs: Focus cloud discovery on cheap CPU compute, not GPU clouds Spawn runs coding agents that call LLM APIs — they need affordable CPU instances with SSH access, not expensive GPU VMs. Updated the discovery prompts and CLAUDE.md to explicitly avoid GPU clouds and prioritize budget VPS providers and container platforms. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-11 00:24:15 +00:00
B	1029320cff	refactor: Rename improve to discovery and remove improve CLI command Rename the GitHub workflow, scripts, and service from "improve" to "discovery" to better reflect what the automation does. Remove the `spawn improve` CLI command entirely — the discovery/refactor loops are internal automation, not user-facing CLI features. File renames: - .github/workflows/improve.yml → discovery.yml - .claude/skills/.../improve.sh → discovery.sh - .claude/skills/.../start-improve.sh → start-discovery.sh - Service: improve-trigger → discovery-trigger Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-10 16:13:56 +00:00
Sprite	875e9adb6b	chore: Bump version to 0.2.5 and add version bump policy - Bumped CLI version from 0.2.4 to 0.2.5 - Added rule to CLAUDE.md: ANY change to cli/ requires a version bump - Uses semantic versioning (patch for fixes, minor for features, major for breaking) - Auto-update ensures users get latest version immediately Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-10 07:24:28 +00:00
A	30f19b7df6	refactor: Drop spawn.sh bash fallback, auto-install bun instead (#163 ) The 663-line bash CLI (spawn.sh) has drifted from the TypeScript CLI, missing --prompt, security validation, download fallback, and other features. Rather than maintaining two implementations, the installer now auto-installs bun (~5 seconds) when it's not present, ensuring every user gets the full-featured TypeScript CLI. - Remove cli/spawn.sh (663 lines) - Simplify install.sh: remove npm method, add bun auto-install - Extract build_and_install() helper to deduplicate build logic - Update cli/README.md and CLAUDE.md to reflect bun-only strategy Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-09 22:30:48 -08:00
A	de46a17d00	docs: Add refactoring service architecture guide to CLAUDE.md (#158 ) Documents the dual-mode cycle system (issue vs refactor), concurrency model, worktree isolation, and guidance for modifying the service. Also adds trigger service files to the file structure convention. Co-authored-by: A <6723574+louisgv@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-09 22:16:27 -08:00
B	34f992630f	docs: Update CLAUDE.md to say AI agents, not coding agents Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-09 18:04:09 +00:00
L	803f9de9bf	Enforce PR merge-or-close-with-comment policy (#50 ) PRs created by autonomous loops must always be either merged or closed with a comment explaining why. Updates improve.sh, refactor.sh, and CLAUDE.md to enforce this consistently. Co-authored-by: Sprite <noreply@sprite.dev> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-07 23:12:51 -08:00
Sprite	85c3d92946	docs: Remove TESTING_NON_INTERACTIVE.md and add documentation policy - Remove testing documentation file per documentation policy - Add rule to CLAUDE.md: docs must go in .docs/ directory (git-ignored) - Only README.md, CLAUDE.md, and cloud READMEs allowed in repo Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-08 05:30:21 +00:00
Sprite	288d191320	refactor: Migrate tests from vitest to bun:test and add testing rules - Convert all test files to use bun:test instead of vitest - Update CLAUDE.md to prohibit vitest, mandate bun:test - Replace vi.fn() with mock() from bun:test - Replace vi.spyOn with spyOn from bun:test - Note: commands.test.ts needs module mocking refactor (TODO) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-08 04:29:37 +00:00
Sprite	70bb72b1dd	Refocus discovery: clouds over agents, require community buzz improve.sh: - Cloud Scouts get 2 slots, Agent Scout gets 1 (and only if justified) - Agent discovery requires evidence: 1000+ GH stars, 50+ HN points, 100+ Reddit upvotes — at least 2 of these - New Issue Responder role checks gh issues for user requests - Agents must be searched on HN Algolia API, Reddit, GitHub before adding - Priority order: fill gaps > new clouds > new agents > issue triage CLAUDE.md: - Reordered: clouds are priority #2, agents are #3 with strict gate - Added community buzz requirements with specific thresholds - Added GitHub issue response as #4 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-08 04:17:45 +00:00
L	b6ee6b6ab1	Add guardrails: CLAUDE.md rules, hooks, pre-commit validation (#33 ) * feat: add gptme agent to spawn matrix Add gptme (https://github.com/gptme/gptme) - a personal AI agent in the terminal with tools for code editing, terminal commands, web browsing, and more. Natively supports OpenRouter via OPENROUTER_API_KEY. - Add gptme agent entry to manifest.json with OpenRouter env vars - Implement sprite/gptme.sh deployment script - Implement hetzner/gptme.sh deployment script - Add "missing" matrix entries for remaining 8 clouds - Update README.md with usage instructions for Sprite and Hetzner Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: add Fly.io cloud provider with claude and aider agents Add Fly.io as a new cloud provider using the Machines REST API for provisioning and flyctl CLI for SSH access. Docker-based machines with pay-per-second pricing. - Create fly/lib/common.sh with Fly.io Machines API integration - Implement fly/claude.sh for Claude Code deployment - Implement fly/aider.sh for Aider deployment - Update README.md with Fly.io usage instructions and env vars Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: add gemini, amazonq, cline, gptme to Fly.io Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: add openclaw, nanoclaw, goose, codex, interpreter to Fly.io Implements 5 new agent scripts for the Fly.io cloud provider: - fly/openclaw.sh: OpenClaw with gateway + TUI, model selection, config - fly/nanoclaw.sh: NanoClaw WhatsApp agent with .env configuration - fly/goose.sh: Block's Goose agent with OpenRouter provider - fly/codex.sh: OpenAI Codex CLI with OpenRouter base URL override - fly/interpreter.sh: Open Interpreter with OpenRouter base URL override All scripts follow the Fly.io pattern (flyctl-based, no IP args for run_server/interactive_session) and use upload_file for env injection. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: add gptme agent to 8 remaining clouds Implement gptme agent scripts for digitalocean, vultr, linode, lambda, aws-lightsail, gcp, e2b, and modal. Each script follows the exact pattern of that cloud's existing aider.sh, adapted for gptme's install and launch commands. Updates manifest.json matrix entries from "missing" to "implemented". Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Add guardrails from insights: CLAUDE.md rules, hooks, pre-commit Based on usage insights analysis: CLAUDE.md: - Shell script rules: curl\|bash compat, macOS bash 3.x compat - Autonomous loop rules: test after each iteration, never revert fixes - Git workflow rules: always use feature branches .claude/settings.json: - PostToolUse hook validates .sh files on every Write/Edit: syntax check, no relative source, no echo -e, no set -u .githooks/pre-commit: - Blocks commits with: syntax errors, relative sources, echo -e, set -euo, references to deleted functions - Install: git config core.hooksPath .githooks README.md: - Added developer setup section with hook installation Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Sprite <noreply@sprite.dev> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-07 20:02:19 -08:00
L	4087deb14e	Drop nounset (set -u) flag — incompatible with env var checks (#27 ) The autonomous refactoring added `set -euo pipefail` but the scripts check optional env vars with `[[ -n "$VAR" ]]` which is a fatal error under nounset when the var isn't set (e.g. SPRITE_NAME, OPENROUTER_API_KEY). Fix: downgrade to `set -eo pipefail` across all 42 affected files. Co-authored-by: Sprite <noreply@sprite.dev> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-07 16:22:04 -08:00
L	3fb2e77b03	Autonomous refactoring: 5 rounds, ~1,400 lines eliminated, production-ready Five rounds of autonomous AI agent team refactoring with security fixes, code consolidation, and expanded test coverage.	2026-02-08 00:06:46 +00:00
L	6ac59e6bb3	Fix OAuth server for macOS bash 3.x (#24 ) Three issues broke the OAuth callback server on macOS: 1. echo -e doesn't work in bash 3.x — \r\n appears as literal text in the HTTP response, browser gets malformed headers. Fix: pre-write response with printf to a file before the subshell. 2. local variables inside ( ... ) & subshell — undefined behavior in bash 3.x since subshells aren't function scope. Fix: use plain variables in subshells. 3. ((elapsed++)) when elapsed=0 evaluates to falsy — set -e kills the script on the first iteration of the timeout loop. Fix: use elapsed=$((elapsed + 1)) instead. Also simplified nc_listen detection to only check for BusyBox (the -p flag check could misfire on macOS nc). Applied to all 10 lib/common.sh files. Co-authored-by: Sprite <noreply@sprite.dev> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-07 14:21:47 -08:00
L	7940d43169	Implement Claude Code Agent Teams for continuous improvement (#20 ) Rewrites improve.sh to use the experimental Agent Teams feature: - Lead coordinates in delegate mode (never touches code) - Teammates work in parallel: Gap Fillers, Agent Scouts, Cloud Scouts - Shared task list for self-claiming work - Plan approval required for cloud provider work (lib/common.sh) CLAUDE.md updated with team role definitions and coordination rules. .claude/settings.json enables CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS. Usage: ./improve.sh # one team cycle ./improve.sh --loop # continuous team cycles ./improve.sh --single # old single-agent mode (fallback) Co-authored-by: Sprite <noreply@sprite.dev> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-07 13:54:27 -08:00
L	6002e6c7f7	Add continuous improvement workflow for spawn matrix (#5 ) - manifest.json: tracks agents x clouds matrix with metadata - CLAUDE.md: instructions for Claude Code to fill gaps and discover new agents/clouds - improve.sh: loop script that launches Claude Code to expand the matrix Current matrix: 3 agents (claude, openclaw, nanoclaw) x 2 clouds (sprite, hetzner) with 2 gaps remaining (hetzner/openclaw, hetzner/nanoclaw). Co-authored-by: Sprite <noreply@sprite.dev> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-02-07 08:47:21 -08:00

37 commits