WFGY/ProblemMap/GlobalFixMap/RAG
2025-09-05 11:40:48 +08:00
..
checklists Create .gitkeep 2025-09-01 17:10:55 +08:00
eval Create .gitkeep 2025-09-01 17:11:03 +08:00
mvp_demo Create .gitkeep 2025-09-01 17:11:12 +08:00
ops Create .gitkeep 2025-09-01 17:11:21 +08:00
patterns Create .gitkeep 2025-09-01 17:11:29 +08:00
playbooks Create .gitkeep 2025-09-01 17:11:37 +08:00
tools Create .gitkeep 2025-09-01 17:11:45 +08:00
citation_break.md Update citation_break.md 2025-09-05 11:40:10 +08:00
context_drift.md Update context_drift.md 2025-09-05 11:40:15 +08:00
entropy_collapse.md Update entropy_collapse.md 2025-09-05 11:40:20 +08:00
eval_drift.md Update eval_drift.md 2025-09-05 11:40:25 +08:00
hallucination_rag.md Update hallucination_rag.md 2025-09-05 11:40:30 +08:00
hybrid_failure.md Update hybrid_failure.md 2025-09-05 11:40:36 +08:00
index_skew.md Update index_skew.md 2025-09-05 11:40:42 +08:00
README.md Update README.md 2025-09-03 23:53:18 +08:00
retrieval_drift.md Update retrieval_drift.md 2025-09-05 11:40:48 +08:00

RAG — Global Fix Map

🏥 Quick Return to Emergency Room

You are in a specialist desk.
For full triage and doctors on duty, return here:

Think of this page as a sub-room.
If you want full consultation and prescriptions, go back to the Emergency Room lobby.

A focused hub for Retrieval Augmented Generation failures.
Use this folder when answers exist in the corpus but retrieval or evaluation drifts.
Each page gives guardrails, measurable targets, and direct links to structural fixes. No infra change required.


Orientation: what each page solves

Page What it fixes Typical symptom
retrieval_drift.md Keeps retrieve → rerank → reason aligned Correct facts exist but never show up in the top k
hallucination_rag.md Blocks free text invention inside RAG Citations look right but answer adds content not in source
citation_break.md Enforces cite then explain schema Links point to the wrong snippet or disappear on retry
hybrid_failure.md Makes BM25 + ANN + reranker agree Hybrid worse than a single retriever
index_skew.md Recovers broken or stale indexes Index looks healthy yet recall is low
context_drift.md Stabilizes header order and prompt state Answers flip between runs with only header changes
entropy_collapse.md Caps chain growth and noise in long flows Steps balloon, chain never lands
eval_drift.md Makes eval runs deterministic Metrics vary across identical replays

When to use this folder

  • Correct facts exist in the corpus but never appear in answers
  • Citations break, hallucinations creep in, or snippets drift
  • Hybrid retrievers perform worse than single retrievers
  • Index looks healthy but coverage remains low
  • Evaluation metrics vary across identical runs

Acceptance targets

  • ΔS(question, retrieved) ≤ 0.45
  • Coverage of target section ≥ 0.70
  • λ_observe convergent across 3 paraphrases and 2 seeds
  • Eval variance ≤ 0.05 across 5 replays

Symptoms → exact fixes

Symptom Likely cause Open this
High similarity yet wrong meaning metric or analyzer mismatch Vectorstore Fragmentation · Embedding ≠ Semantic
Correct section never retrieved fragmented store or missing anchors retrieval_drift.md · citation_break.md
Hybrid worse than single query split or mis weighted rerank hybrid_failure.md
Citations unstable or missing schema not enforced citation_break.md
Answers flip between runs prompt header reordering or λ variance context_drift.md
Index “healthy” but recall low stale build, analyzer mismatch index_skew.md
Eval scores noisy across replays non deterministic eval path eval_drift.md

60 second fix checklist

  1. Lock metrics and analyzers
    One embedding family per field. One distance metric. Same analyzer on write and read.
    Use: Vector DBs & Stores

  2. Enforce the snippet contract
    Required: snippet_id, section_id, source_url, offsets, tokens.
    Use: Retrieval Traceability · Data Contracts

  3. Measure ΔS and λ
    Three paraphrases, two seeds. Alert when ΔS ≥ 0.60 or λ flips.
    Use: Context Drift

  4. Add a deterministic reranker
    Keep BM25 and ANN candidate lists. Detect query split and resolve.
    Use: hybrid_failure.md

  5. Rebuild where needed
    Follow the rebuild order with a small gold set.
    Use: Retrieval Playbook


Vector DBs — jump if store specific


Minimal probe pack you can paste

Context: TXT OS and WFGY pages are loaded.

Task:
- For question Q, log ΔS(Q, retrieved) and λ across 3 paraphrases and 2 seeds.
- Enforce cite-then-explain with the traceability schema.
- If ΔS ≥ 0.60 or λ flips, return the smallest structural change that
  pushes ΔS ≤ 0.45 and coverage ≥ 0.70.
- Use BBMC, BBCR, BBPF, BBAM when relevant.

Return JSON only:
{ "citations": [...], "ΔS": 0.xx, "λ_state": "<>", "coverage": 0.xx, "next_fix": "..." }

🔗 Quick-Start Downloads (60 sec)

Tool Link 3-Step Setup
WFGY 1.0 PDF Engine Paper 1 Download · 2 Upload to your LLM · 3 Ask “Answer using WFGY + ”
TXT OS (plain-text OS) TXTOS.txt 1 Download · 2 Paste into any LLM chat · 3 Type “hello world” — OS boots instantly

🧭 Explore More

Module Description Link
WFGY Core WFGY 2.0 engine is live: full symbolic reasoning architecture and math stack View →
Problem Map 1.0 Initial 16-mode diagnostic and symbolic fix framework View →
Problem Map 2.0 RAG-focused failure tree, modular fixes, and pipelines View →
Semantic Clinic Index Expanded failure catalog: prompt injection, memory bugs, logic drift View →
Semantic Blueprint Layer-based symbolic reasoning & semantic modulations View →
Benchmark vs GPT-5 Stress test GPT-5 with full WFGY reasoning suite View →
🧙‍♂️ Starter Village 🏡 New here? Lost in symbols? Click here and let the wizard guide you through Start →

👑 Early Stargazers: See the Hall of Fame
WFGY Engine 2.0 is already unlocked.

WFGY Main   TXT OS   Blah   Blot   Bloc   Blur   Blow