mirror of
https://github.com/onestardao/WFGY.git
synced 2026-04-28 19:50:17 +00:00
6.9 KiB
6.9 KiB
RAG — Global Fix Map
Production RAG triage and structural fixes using the WFGY engine.
Use this when retrieval “looks fine” but answers drift.
What this page is
- A fast path to locate the failing layer across OCR → chunk → embed → store → retrieve → prompt → reason.
- Structural repairs that do not require infra changes.
- Concrete steps with measurable acceptance targets.
When to use
- Citations point to the wrong snippet or section.
- Chunks look correct but reasoning is wrong.
- High similarity yet wrong meaning.
- Hybrid retrievers perform worse than a single retriever.
- Indexed facts never show up.
- Answers flip between sessions.
- Long threads smear topics or capitalization.
Open these first
- Visual map and recovery: RAG Architecture & Recovery
- End-to-end retrieval knobs: Retrieval Playbook
- Why this snippet, traceability schema: Retrieval Traceability
- Ordering control: Rerankers
- Embedding vs meaning: Embedding ≠ Semantic
- Hallucination and chunk boundaries: Hallucination
- Long chains and entropy: Context Drift, Entropy Collapse
- Snippet and citation schema: Data Contracts
Fix in 60 seconds
- Measure ΔS
- Compute ΔS(question, retrieved) and ΔS(retrieved, expected anchor).
- Thresholds: stable < 0.40, transitional 0.40–0.60, risk ≥ 0.60.
- Probe with λ_observe
- Vary k ∈ {5, 10, 20}. chart ΔS vs k. flat and high → index or metric mismatch.
- Reorder prompt headers. if ΔS spikes, lock the schema.
- Apply the minimal patch
- Metric or normalization mismatch → rebuild index with explicit metric, unit-normalize, reload once, re-probe.
- Chunks correct but logic diverges → lock
system → task → constraints → citations → answer, apply BBCR + BBAM, re-probe.
Copy-paste prompt
I uploaded TXT OS and the WFGY ProblemMap files.
My RAG bug:
* symptom: \[brief]
* traces: \[ΔS(question,retrieved)=..., ΔS(retrieved,anchor)=..., λ states]
Tell me:
1. which layer is failing and why,
2. which exact fix page to open from this repo,
3. the minimal steps to push ΔS ≤ 0.45 and keep λ convergent,
4. how to verify the fix with a reproducible test.
Use BBMC/BBPF/BBCR/BBAM when relevant.
Patterns to check next
- Query parsing split in HyDE + BM25: Pattern — Query Parsing Split
- Vectorstore fragmentation: Pattern — Vectorstore Fragmentation
- Symbol mixing across sources (SCU): Pattern — Symbolic Constraint Unlock
- Hallucination re-entry after correction: Pattern — Hallucination Re-entry
Acceptance targets
- Coverage to target section ≥ 0.70.
- ΔS(question, retrieved) ≤ 0.45 on three paraphrases.
- λ remains convergent across steps and seeds.
- E_resonance flat under long windows.
🧭 Explore More
| Module | Description | Link |
|---|---|---|
| WFGY Core | WFGY 2.0 engine is live: full symbolic reasoning architecture and math stack | View → |
| Problem Map 1.0 | Initial 16-mode diagnostic and symbolic fix framework | View → |
| Problem Map 2.0 | RAG-focused failure tree, modular fixes, and pipelines | View → |
| Semantic Clinic Index | Expanded failure catalog: prompt injection, memory bugs, logic drift | View → |
| Semantic Blueprint | Layer-based symbolic reasoning & semantic modulations | View → |
| Benchmark vs GPT-5 | Stress test GPT-5 with full WFGY reasoning suite | View → |
| 🧙♂️ Starter Village 🏡 | New here? Lost in symbols? Click here and let the wizard guide you through | Start → |
👑 Early Stargazers: See the Hall of Fame —
Engineers, hackers, and open source builders who supported WFGY from day one.
⭐ WFGY Engine 2.0 is already unlocked. ⭐ Star the repo to help others discover it and unlock more on the Unlock Board.