10 KiB
Ops & Deploy — Global Fix Map
🏥 Quick Return to Emergency Room
You are in a specialist desk.
For full triage and doctors on duty, return here:
- WFGY Global Fix Map — main Emergency Room, 300+ structured fixes
- WFGY Problem Map 1.0 — 16 reproducible failure modes
Think of this page as a sub-room.
If you want full consultation and prescriptions, go back to the Emergency Room lobby.
A compact hub to ship safely and keep RAG/LLM systems stable after release.
Use this folder to pick the right guardrail, verify with measurable targets, and recover fast when things wobble. No infra change required.
Open these first
- Visual recovery map → RAG Architecture & Recovery
- Retrieval knobs end-to-end → Retrieval Playbook
- Traceability and snippet schema → Retrieval Traceability · Data Contracts
- Boot order and deploy traps → Bootstrap Ordering · Deployment Deadlock · Pre-Deploy Collapse
- Live ops tools → Live Monitoring for RAG · Debug Playbook
When to use this folder
- First calls after deploy crash or return stale content.
- ΔS and citations look fine yesterday but flip today.
- Rate limits cascade, queues spike, latency climbs.
- Canary looks good then full rollout breaks retrieval.
- Index swap succeeds but answers cite old snippets.
- Retries cause duplicate side effects or charges.
- Feature flags bleed traffic into unfinished paths.
- Maintenance windows corrupt embeddings or anchors.
Acceptance targets for a safe rollout
- ΔS(question, retrieved) ≤ 0.45 across three paraphrases.
- Coverage ≥ 0.70 on the expected new section.
- λ remains convergent on 2 seeds during rollout.
- Idempotency ≥ 99.9% on retry storms.
- Zero silent index mismatches (hash + counts match).
- P95 latency stays in budget with backpressure active.
Quick routes — per-page guides
| Scenario | Fix Page |
|---|---|
| Rollout readiness | rollout_readiness_gate.md |
| Canary strategy | staged_rollout_canary.md |
| Blue/green cutover | blue_green_switchovers.md |
| Version pin & freeze | version_pinning_and_model_lock.md |
| Vector index swap | vector_index_build_and_swap.md |
| Cache warmup | cache_warmup_invalidation.md |
| Rate limits | rate_limit_backpressure.md |
| Feature flags | feature_flags_safe_launch.md |
| Idempotency | idempotency_dedupe.md |
| Retry logic | retry_backoff.md |
| Rollback plan | rollback_and_fast_recovery.md |
| Postmortems | postmortem_and_regression_tests.md |
| Change freeze | release_calendar_and_change_freeze.md |
| Incident comms | incident_comms_and_statuspage.md |
| Shadow traffic | shadow_traffic_mirroring.md |
| Maintenance window | read_only_mode_and_maintenance_window.md |
| DB migrations | db_migration_guardrails.md |
60-second ship checklist
- Freeze the world → Pin model IDs, prompt revs, index hashes.
- Warm up safely → Build index off-path, preload caches with canary.
- Shadow then canary → Mirror prod queries, step rollout 5% → 25% → 100%.
- Guard the edge → Enable backpressure, retries with jitter, idempotency keys.
- Know your exit → Keep rollback switch and comms draft ready.
Symptoms → exact fix
| What you see | Open this |
|---|---|
| Deploy points to old snippets | vector_index_build_and_swap.md · cache_warmup_invalidation.md |
| Canary fine, full rollout breaks | staged_rollout_canary.md · feature_flags_safe_launch.md |
| Wrong model after failover | version_pinning_and_model_lock.md |
| Retries duplicate charges | idempotency_dedupe.md · retry_backoff.md |
| RL storms, timeouts | rate_limit_backpressure.md |
| Need rollback now | rollback_and_fast_recovery.md · blue_green_switchovers.md |
| Maintenance corrupts anchors | read_only_mode_and_maintenance_window.md · db_migration_guardrails.md |
| Unsure if safe to ship | rollout_readiness_gate.md |
FAQ
Q: What does ΔS mean here?
A: ΔS is a stability score. It measures how much the retrieved content drifts from the expected anchor when you change the query slightly. Lower is better (≤ 0.45 is safe).
Q: What is λ convergence?
A: λ tracks whether retrieval order flips unpredictably. If λ is stable across seeds, your rollout is consistent.
Q: Why do I need idempotency keys?
A: Without them, retries can double-charge a user or run the same side-effect twice. Keys make every request “safe to retry.”
Q: How do I know if my index swap worked?
A: Check doc counts and hashes before cutover. If they mismatch, you’re pointing at an incomplete index.
Q: Canary looked fine but production broke — why?
A: Canary often hides tail-latency, cache misses, or load-based rate limits. Always test at increasing % of live traffic.
Q: Why do you mention rollback comms?
A: Technical rollback is only half. Users and stakeholders need fast updates, so pre-draft Statuspage or Slack messages are essential.
🔗 Quick-Start Downloads (60 sec)
| Tool | Link | 3-Step Setup |
|---|---|---|
| WFGY 1.0 PDF | Engine Paper | 1️⃣ Download · 2️⃣ Upload to your LLM · 3️⃣ Ask “Answer using WFGY + <your question>” |
| TXT OS (plain-text OS) | TXTOS.txt | 1️⃣ Download · 2️⃣ Paste into any LLM chat · 3️⃣ Type “hello world” — OS boots instantly |
🧭 Explore More
| Module | Description | Link |
|---|---|---|
| WFGY Core | WFGY 2.0 engine is live: full symbolic reasoning architecture and math stack | View → |
| Problem Map 1.0 | Initial 16-mode diagnostic and symbolic fix framework | View → |
| Problem Map 2.0 | RAG-focused failure tree, modular fixes, and pipelines | View → |
| Semantic Clinic Index | Expanded failure catalog: prompt injection, memory bugs, logic drift | View → |
| Semantic Blueprint | Layer-based symbolic reasoning & semantic modulations | View → |
| Benchmark vs GPT-5 | Stress test GPT-5 with full WFGY reasoning suite | View → |
| 🧙♂️ Starter Village 🏡 | New here? Lost in symbols? Click here and let the wizard guide you through | Start → |
👑 Early Stargazers: See the Hall of Fame —
Engineers, hackers, and open source builders who supported WFGY from day one.
⭐ WFGY Engine 2.0 is already unlocked. ⭐ Star the repo to help others discover it and unlock more on the Unlock Board.