ruvector/scripts
rUv bd1e253755 feat(brain): ADR-130 service split — SSE proxy, worker, internal queue
* fix(brain): SSE connection limiter, pipeline rate limit, Firestore pagination fallback (ADR-130)

Three fixes for recurring pi.ruv.io outages:

1. SSE connection limiter (max 50) — prevents MCP reconnect storms from
   exhausting Cloud Run concurrency slots. Tracks active count with
   AtomicUsize, rejects excess with 429.

2. Pipeline optimize rate limiter — max 1 concurrent request with 30s
   cooldown. Prevents scheduler thundering herd from CPU-saturating
   the instance.

3. Firestore pagination offset fallback — when page tokens go stale
   after OOM restart (400 Bad Request), switches to offset-based
   pagination to load all documents instead of stopping at first batch.

Also adds /v1/ready lightweight probe (zero-cost, no state access)
for Cloud Run health checks.

ADR-130 documents the full decoupling architecture (SSE service split).

Co-Authored-By: claude-flow <ruv@ruv.net>

* feat(brain): ADR-130 service split — SSE proxy, worker binary, internal queue

Implements full MCP SSE decoupling to eliminate recurring outages:

1. ruvbrain-sse: Thin SSE proxy (308 lines) that manages MCP connections
   independently from the API. Max 200 concurrent SSE, forwards JSON-RPC
   to the API, polls /internal/queue/drain for responses. No business logic.

2. ruvbrain-worker: Batch worker binary (202 lines) for Cloud Run Jobs.
   Runs scheduler actions (train, drift, transfer, graph, cleanup, attractor)
   with direct Firestore access. Runs once and exits.

3. Internal queue endpoints on the API:
   - POST /internal/queue/push (forward JSON-RPC to session)
   - GET /internal/queue/drain (poll for responses)
   - POST /internal/session/create (register session)
   - DELETE /internal/session/:id (cleanup)

4. Deploy infrastructure:
   - Dockerfile.sse, Dockerfile.worker
   - cloudbuild-sse.yaml, cloudbuild-worker.yaml
   - scripts/deploy_brain_services.sh [api|sse|worker|all]

Architecture: SSE (500 concurrency, 512MB) → API (80 concurrency, 4GB) ← Worker (Cloud Run Job, 4GB)

Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-30 11:54:01 -04:00
..
benchmark chore: reorganize scripts into subfolders 2026-01-21 23:48:37 -05:00
build feat: Update NAPI-RS bindings with new capabilities (v0.1.15) 2025-11-26 18:47:48 +00:00
ci chore: reorganize scripts into subfolders 2026-01-21 23:48:37 -05:00
deploy chore: reorganize scripts into subfolders 2026-01-21 23:48:37 -05:00
patches/hnsw_rs Add WebAssembly binary and TypeScript definitions for rvlite 2025-12-25 19:50:53 +00:00
publish chore: reorganize scripts into subfolders 2026-01-21 23:48:37 -05:00
test chore: reorganize scripts into subfolders 2026-01-21 23:48:37 -05:00
training feat(training): ADR-129 RuvLTRA training pipeline — calibration, SFT, benchmarks, HF publishing 2026-03-30 07:58:07 -04:00
validate chore: reorganize scripts into subfolders 2026-01-21 23:48:37 -05:00
analyze-evolution.js feat: DrAgnes + Common Crawl WET + Gemini grounding agents (#282) 2026-03-23 10:12:50 -04:00
analyze-ham10000.js feat: DrAgnes + Common Crawl WET + Gemini grounding agents (#282) 2026-03-23 10:12:50 -04:00
build-solver.sh feat: Implement complete sublinear-time sparse solver crate 2026-02-20 06:49:14 +00:00
check_brain_status.sh feat: add brain training and status scripts for pi.ruv.io 2026-03-16 23:14:43 -04:00
create-brainpedia.py fix: ruvector-postgres v0.3.1 — audit bug fixes, 46 SQL functions, Docker publish (#227) 2026-03-03 12:53:10 -05:00
deploy-crawl-phase1.sh feat: DrAgnes + Common Crawl WET + Gemini grounding agents (#282) 2026-03-23 10:12:50 -04:00
deploy-dragnes.sh feat: DrAgnes + Common Crawl WET + Gemini grounding agents (#282) 2026-03-23 10:12:50 -04:00
deploy-gemini-agents.sh feat: DrAgnes + Common Crawl WET + Gemini grounding agents (#282) 2026-03-23 10:12:50 -04:00
deploy-wet-job.sh feat: DrAgnes + Common Crawl WET + Gemini grounding agents (#282) 2026-03-23 10:12:50 -04:00
deploy_brain_services.sh feat(brain): ADR-130 service split — SSE proxy, worker, internal queue 2026-03-30 11:54:01 -04:00
deploy_trainer.sh feat: update ADR-093 + add deploy_trainer.sh for Cloud Run scheduling 2026-03-16 23:14:43 -04:00
discover_and_train.sh feat: discover ↔ train feedback loop with live API discovery 2026-03-16 23:16:24 -04:00
gemini-agents.js feat: DrAgnes + Common Crawl WET + Gemini grounding agents (#282) 2026-03-23 10:12:50 -04:00
generate-rvf-manifest.py fix: ruvector-postgres v0.3.1 — audit bug fixes, 46 SQL functions, Docker publish (#227) 2026-03-03 12:53:10 -05:00
historical-crawl-import.sh feat: DrAgnes + Common Crawl WET + Gemini grounding agents (#282) 2026-03-23 10:12:50 -04:00
publish-rvf.sh feat(rvf): RuVector Format — Universal Cognitive Container SDK (#166) 2026-02-14 13:14:49 -05:00
README.md chore: reorganize scripts into subfolders 2026-01-21 23:48:37 -05:00
run_mincut_bench.sh feat: Add min-cut gating experiment scaffolding (WIP) 2026-02-20 06:52:43 +00:00
seed-brain-all.py fix: ruvector-postgres v0.3.1 — audit bug fixes, 46 SQL functions, Docker publish (#227) 2026-03-03 12:53:10 -05:00
seed-brain.rs fix: ruvector-postgres v0.3.1 — audit bug fixes, 46 SQL functions, Docker publish (#227) 2026-03-03 12:53:10 -05:00
seed-specialized.py fix: ruvector-postgres v0.3.1 — audit bug fixes, 46 SQL functions, Docker publish (#227) 2026-03-03 12:53:10 -05:00
setup-gcs-examples.sh fix: ruvector-postgres v0.3.1 — audit bug fixes, 46 SQL functions, Docker publish (#227) 2026-03-03 12:53:10 -05:00
sql-audit-v3.sql fix: ruvector-postgres v0.3.2 — 100% audit pass (HNSW + hybrid fixes) (#230) 2026-03-03 13:21:48 -05:00
swarm_train_15.sh feat: 15-agent concurrent discovery swarm with 12 new data sources 2026-03-16 23:16:24 -04:00
sync-lockfile.sh style: apply rustfmt across entire codebase 2026-01-28 17:00:26 +00:00
train-lora.py fix: ruvector-postgres v0.3.1 — audit bug fixes, 46 SQL functions, Docker publish (#227) 2026-03-03 12:53:10 -05:00
train_brain.sh feat: brain trainer core module + auth fix — 56 discoveries ingested 2026-03-16 23:14:43 -04:00
training_orchestrator.sh update: training orchestrator with improved PII stripping and color output 2026-03-16 23:21:01 -04:00
upvote_memories.py fix(brain): defer sparsifier build on startup for large graphs 2026-03-24 12:29:52 +00:00
vote-boost.py fix: ruvector-postgres v0.3.1 — audit bug fixes, 46 SQL functions, Docker publish (#227) 2026-03-03 12:53:10 -05:00
wet-filter-inject.js feat: DrAgnes + Common Crawl WET + Gemini grounding agents (#282) 2026-03-23 10:12:50 -04:00
wet-full-import.sh feat: DrAgnes + Common Crawl WET + Gemini grounding agents (#282) 2026-03-23 10:12:50 -04:00
wet-job.yaml feat: DrAgnes + Common Crawl WET + Gemini grounding agents (#282) 2026-03-23 10:12:50 -04:00
wet-orchestrate.sh feat: DrAgnes + Common Crawl WET + Gemini grounding agents (#282) 2026-03-23 10:12:50 -04:00
wet-processor.sh feat: DrAgnes + Common Crawl WET + Gemini grounding agents (#282) 2026-03-23 10:12:50 -04:00

RuVector Automation Scripts

This directory contains automation scripts organized by purpose.

📁 Directory Structure

scripts/
├── README.md           # This file
├── benchmark/          # Performance benchmarking
├── build/              # Build utilities
├── ci/                 # CI/CD automation
├── deploy/             # Deployment scripts
├── patches/            # Patch files
├── publish/            # Package publishing
├── test/               # Testing scripts
└── validate/           # Validation & verification

🚀 Deployment

Scripts for deploying to production.

Script Description
deploy/deploy.sh Comprehensive deployment (crates.io + npm)
deploy/test-deploy.sh Test deployment without publishing
deploy/DEPLOYMENT.md Full deployment documentation
deploy/DEPLOYMENT-QUICKSTART.md Quick deployment guide

Usage:

# Full deployment
./scripts/deploy/deploy.sh

# Dry run
./scripts/deploy/deploy.sh --dry-run

# Test deployment
./scripts/deploy/test-deploy.sh

📦 Publishing

Scripts for publishing packages to registries.

Script Description
publish/publish-all.sh Publish all packages
publish/publish-crates.sh Publish Rust crates to crates.io
publish/publish-cli.sh Publish CLI package
publish/publish-router-wasm.sh Publish router WASM package
publish/check-and-publish-router-wasm.sh Check and publish router WASM

Usage:

# Set credentials first
export CRATES_API_KEY="your-crates-io-token"
export NPM_TOKEN="your-npm-token"

# Publish all
./scripts/publish/publish-all.sh

# Publish crates only
./scripts/publish/publish-crates.sh

📊 Benchmarking

Performance benchmarking scripts.

Script Description
benchmark/run_benchmarks.sh Run core benchmarks
benchmark/run_llm_benchmarks.sh Run LLM inference benchmarks

Usage:

# Run core benchmarks
./scripts/benchmark/run_benchmarks.sh

# Run LLM benchmarks
./scripts/benchmark/run_llm_benchmarks.sh

🧪 Testing

Testing and validation scripts.

Script Description
test/test-wasm.mjs Test WASM bindings
test/test-graph-cli.sh Test graph CLI commands
test/test-all-graph-commands.sh Test all graph commands
test/test-docker-package.sh Test Docker packaging

Usage:

# Test WASM
node ./scripts/test/test-wasm.mjs

# Test graph CLI
./scripts/test/test-graph-cli.sh

Validation

Package and build verification scripts.

Script Description
validate/validate-packages.sh Validate package configs
validate/validate-packages-simple.sh Simple package validation
validate/verify-paper-impl.sh Verify paper implementation
validate/verify_hnsw_build.sh Verify HNSW build

Usage:

# Validate packages
./scripts/validate/validate-packages.sh

# Verify HNSW
./scripts/validate/verify_hnsw_build.sh

🔄 CI/CD

Continuous integration scripts.

Script Description
ci/ci-sync-lockfile.sh Auto-fix lock files in CI
ci/sync-lockfile.sh Sync package-lock.json
ci/install-hooks.sh Install git hooks

Usage:

# Install git hooks (recommended)
./scripts/ci/install-hooks.sh

# Sync lockfile
./scripts/ci/sync-lockfile.sh

🛠️ Build

Build utility scripts located in build/.

🩹 Patches

Patch files for dependencies located in patches/.

🚀 Quick Start

For Development

  1. Install git hooks (recommended):

    ./scripts/ci/install-hooks.sh
    
  2. Run tests:

    ./scripts/test/test-wasm.mjs
    

For Deployment

  1. Set credentials:

    export CRATES_API_KEY="your-crates-io-token"
    export NPM_TOKEN="your-npm-token"
    
  2. Dry run first:

    ./scripts/deploy/deploy.sh --dry-run
    
  3. Deploy:

    ./scripts/deploy/deploy.sh
    

🔐 Security

Never commit credentials! Always use environment variables or .env file.

See deploy/DEPLOYMENT.md for security best practices.