Update example_02_self_reflection.md

2026-04-28 11:40:07 +00:00 · 2025-08-12 18:09:03 +08:00 · 2025-08-12 18:09:03 +08:00 · e738d84630
commit e738d84630
parent 6ff1726685
1 changed files with 333 additions and 1 deletions
--- a/ProblemMap/examples/example_02_self_reflection.md
+++ b/ProblemMap/examples/example_02_self_reflection.md
@ -1 +1,333 @@
-111
+# Example 02 — Self-Reflection Trace and Triage (No.1 Hallucination & No.2 Query Parsing)
+
+**Goal**  
+Turn raw traces into a **decision report** that tells you where the failure started: retrieval or generation.  
+No SDKs, no extra dependencies. Works with the trace produced in Example 01.
+
+**Problem Map link**  
+Targets **No.1 Hallucination & Chunk Drift** and **No.2 Query Parsing / Intent Split**.  
+Second-order benefits for No.4 (Tail Noise) once you cut low-value evidence.
+
+**Outcome**  
+- A per-question **triage label**: `retrieval_drift`, `generation_drift`, or `ok`  
+- A short **why** field pointing to the exact symptom  
+- A machine-readable JSONL report you can diff over time
+
+---
+
+## 1) Inputs
+
+- `runs/trace.jsonl` from Example 01 (each line: `{"q": "...", "chunks":[{"id":"..."},...], "answer":"...", "ok": true/false }`)
+- `data/chunks.json` (chunk id → text)
+
+> If you skipped Example 01, create the same two files now. Keep chunk ids stable.
+
+---
+
+## 2) Reflection rubric (simple and reproducible)
+
+We use deterministic checks, not another LLM, to avoid circular reasoning.
+
+**Signals**
+- **Template compliance**: either contains `not in context` or a `citations:` line  
+- **Citation overlap**: ids mentioned in the answer must be a subset of retrieved ids  
+- **Evidence containment**: ≥ 1 non-trivial phrase (≥ 5 chars, letters/digits) from the answer appears verbatim in evidence  
+- **Question-evidence alignment**: ≥ 1 query token appears in evidence; if not, likely retrieval drift
+
+**Labels**
+- `generation_drift`: template violated **or** citation overlap = 0 **or** evidence containment = 0 while evidence contains relevant terms  
+- `retrieval_drift`: evidence lacks query terms (poor alignment), even if the answer looks “reasonable”  
+- `ok`: citations present **and** evidence containment > 0 **and** some query terms found in evidence  
+- `refusal_ok`: answer is exactly `not in context` and evidence truly lacks query terms  
+- `refusal_suspect`: `not in context` but evidence actually contains query terms (over-refusal)
+
+---
+
+## 3) Path A — Python reflection (single file)
+
+Create `reflect.py`.
+
+```python
+# reflect.py -- classify traces as retrieval_drift / generation_drift / ok
+import json, re, sys
+from typing import Dict, List
+
+PHRASE_RE = re.compile(r"[A-Za-z0-9][A-Za-z0-9\-\s]{4,}")  # >=5 chars
+
+def load_chunks(path: str) -> Dict[str, str]:
+    items = json.load(open(path, encoding="utf8"))
+    return {c["id"]: c["text"] for c in items}
+
+def extract_citation_ids(ans: str) -> List[str]:
+    # naive parser for: citations: [p1#1, p2#3]
+    m = re.search(r"citations\s*:\s*\[([^\]]*)\]", ans, re.IGNORECASE)
+    if not m: return []
+    raw = m.group(1)
+    return [t.strip() for t in re.split(r"[,\s]+", raw) if t.strip()]
+
+def phrases(s: str) -> List[str]:
+    return [p.strip() for p in PHRASE_RE.findall(s)]
+
+def any_phrase_in_evidence(ans: str, ev: str) -> bool:
+    ev_low = ev.lower()
+    for p in phrases(ans):
+        if len(p) >= 5 and p.lower() in ev_low:
+            return True
+    return False
+
+def any_query_token_in_evidence(q: str, ev: str) -> bool:
+    qtok = {w for w in re.split(r"\W+", q.lower()) if len(w) >= 3}
+    if not qtok: return False
+    evtok = set(re.split(r"\W+", ev.lower()))
+    return len(qtok & evtok) > 0
+
+def reflect_one(rec: Dict, chunk_map: Dict[str, str]) -> Dict:
+    q, ans = rec["q"], rec.get("answer","")
+    ids = [c["id"] for c in rec.get("chunks", []) if "id" in c]
+    ev = "\n\n".join(chunk_map.get(i, "") for i in ids)
+
+    tmpl_ok = ("not in context" in ans.lower()) or ("citations:" in ans.lower())
+    cit_ids = extract_citation_ids(ans)
+    cit_overlap = all(cid in ids for cid in cit_ids) if cit_ids else False
+    ev_contains_answer = any_phrase_in_evidence(ans, ev)
+    q_align = any_query_token_in_evidence(q, ev)
+
+    if "not in context" in ans.lower():
+        if q_align:
+            label, why = "refusal_suspect", "evidence contains query terms but model refused"
+        else:
+            label, why = "refusal_ok", "no query terms found in evidence; refusal acceptable"
+    else:
+        if not tmpl_ok or (cit_ids and not cit_overlap):
+            label, why = "generation_drift", "template/citations violated"
+        elif not ev_contains_answer and q_align:
+            label, why = "generation_drift", "answer text not grounded in evidence"
+        elif not q_align:
+            label, why = "retrieval_drift", "evidence lacks query terms"
+        else:
+            label, why = "ok", "citations present and answer grounded"
+
+    return {
+        "q": q, "label": label, "why": why,
+        "chunks": ids, "citations_in_answer": cit_ids
+    }
+
+if __name__ == "__main__":
+    if len(sys.argv) < 3:
+        print("usage: python reflect.py runs/trace.jsonl data/chunks.json")
+        sys.exit(1)
+    trace_path, chunk_path = sys.argv[1], sys.argv[2]
+    chunks = load_chunks(chunk_path)
+    out = []
+    with open(trace_path, encoding="utf8") as f:
+        for line in f:
+            if not line.strip(): continue
+            rec = json.loads(line)
+            out.append(reflect_one(rec, chunks))
+    for r in out:
+        print(json.dumps(r, ensure_ascii=False))
+````
+
+Run:
+
+```bash
+python reflect.py runs/trace.jsonl data/chunks.json > runs/report.jsonl
+```
+
+**Pass criteria**
+
+* For “What is X?” you should see `label: ok`
+* For “What is Z?” you should see `label: refusal_ok`
+* If you deliberately break the template or citations, you should get `generation_drift`
+* If you swap chunks to unrelated text, you should get `retrieval_drift`
+
+---
+
+## 4) Path B — Node reflection (single file, no deps)
+
+Create `reflect.mjs`.
+
+```js
+// reflect.mjs -- classify traces as retrieval_drift / generation_drift / ok
+import fs from "node:fs";
+
+function loadChunks(path) {
+  const arr = JSON.parse(fs.readFileSync(path, "utf8"));
+  const m = {};
+  for (const c of arr) m[c.id] = c.text;
+  return m;
+}
+
+function extractCitationIds(ans) {
+  const m = ans.match(/citations\s*:\s*\[([^\]]*)\]/i);
+  if (!m) return [];
+  return m[1].split(/[, \t\r\n]+/).map(s => s.trim()).filter(Boolean);
+}
+
+function phrases(s) {
+  return (s.match(/[A-Za-z0-9][A-Za-z0-9-\s]{4,}/g) || []).map(x => x.trim());
+}
+
+function anyPhraseInEvidence(ans, ev) {
+  const evLow = ev.toLowerCase();
+  return phrases(ans).some(p => p.length >= 5 && evLow.includes(p.toLowerCase()));
+}
+
+function anyQueryTokenInEvidence(q, ev) {
+  const qtok = new Set(q.toLowerCase().split(/\W+/).filter(w => w.length >= 3));
+  if (!qtok.size) return false;
+  const evtok = new Set(ev.toLowerCase().split(/\W+/));
+  for (const w of qtok) if (evtok.has(w)) return true;
+  return false;
+}
+
+function reflectOne(rec, chunkMap) {
+  const q = rec.q, ans = rec.answer || "";
+  const ids = (rec.chunks || []).map(c => c.id).filter(Boolean);
+  const ev = ids.map(i => chunkMap[i] || "").join("\n\n");
+
+  const tmplOk = ans.toLowerCase().includes("not in context") || /citations\s*:/i.test(ans);
+  const citIds = extractCitationIds(ans);
+  const citOverlap = citIds.length ? citIds.every(id => ids.includes(id)) : false;
+  const evContainsAnswer = anyPhraseInEvidence(ans, ev);
+  const qAlign = anyQueryTokenInEvidence(q, ev);
+
+  let label, why;
+  if (ans.toLowerCase().includes("not in context")) {
+    if (qAlign) { label = "refusal_suspect"; why = "evidence contains query terms but model refused"; }
+    else        { label = "refusal_ok";      why = "no query terms found in evidence; refusal acceptable"; }
+  } else {
+    if (!tmplOk || (citIds.length && !citOverlap)) {
+      label = "generation_drift"; why = "template/citations violated";
+    } else if (!evContainsAnswer && qAlign) {
+      label = "generation_drift"; why = "answer text not grounded in evidence";
+    } else if (!qAlign) {
+      label = "retrieval_drift";  why = "evidence lacks query terms";
+    } else {
+      label = "ok";               why = "citations present and answer grounded";
+    }
+  }
+  return { q, label, why, chunks: ids, citations_in_answer: citIds };
+}
+
+if (process.argv.length < 4) {
+  console.error("usage: node reflect.mjs runs/trace.jsonl data/chunks.json");
+  process.exit(1);
+}
+const [tracePath, chunkPath] = process.argv.slice(2);
+const chunkMap = loadChunks(chunkPath);
+
+const lines = fs.readFileSync(tracePath, "utf8").split(/\r?\n/).filter(Boolean);
+for (const line of lines) {
+  const rec = JSON.parse(line);
+  console.log(JSON.stringify(reflectOne(rec, chunkMap)));
+}
+```
+
+Run:
+
+```bash
+node reflect.mjs runs/trace.jsonl data/chunks.json > runs/report.jsonl
+```
+
+Pass criteria are the same as Python.
+
+---
+
+## 5) Read the report as a triage table
+
+Turn `runs/report.jsonl` into a quick view:
+
+```bash
+echo -e "| q | label | why |\n|---|---|---|" > runs/report.md
+cat runs/report.jsonl | python - <<'PY'
+import sys, json
+for line in sys.stdin:
+    r = json.loads(line)
+    q = (r["q"][:60] + "…") if len(r["q"])>60 else r["q"]
+    print(f"| {q} | **{r['label']}** | {r['why']} |")
+PY >> runs/report.md
+```
+
+You’ll get a Markdown table you can paste into an issue or PR.
+
+---
+
+## 6) What to do for each label
+
+* `retrieval_drift`
+  Move to **Example 03 (Intersection + Rerank)**; reduce chunk size so entity + constraints live together; drop tail after the score knee.
+
+* `generation_drift`
+  Keep the evidence-only template at the end of the system prompt; set temperature to 0 for evaluation; if needed, require exact `citations: [id,...]`.
+
+* `refusal_ok`
+  This is good. It means the model did not fabricate beyond evidence.
+
+* `refusal_suspect`
+  Increase top-k before rerank. If query is truly absent from your corpus, consider a fallback answer or a different data source.
+
+* `ok`
+  Baseline reached. Move to Eval examples to measure precision/recall and stability.
+
+---
+
+## 7) Optional: LLM-assisted reflection (still deterministic output)
+
+If you prefer a short natural-language summary per case, you can add a second pass that prompts an LLM to **summarize** the deterministic checks into 1–2 sentences.
+Keep the JSON decision from the scripts above as source-of-truth, and store the LLM summary as an auxiliary field.
+
+---
+
+## 8) Common mistakes
+
+* Mixing traces from different chunking pipelines → chunk ids do not resolve. Keep a single chunker per index build.
+* Relying only on citation strings → models can fabricate ids. Always check overlap with retrieved ids.
+* Treating refusal as a failure → under constrained evidence, refusal is the correct output.
+
+---
+
+## 9) Next steps
+
+* Run **Example 03** to harden retrieval
+* Use **Eval** docs to compare before/after on precision, refusal, and citation overlap
+* Wire this reflection into CI: fail a build if `generation_drift` exceeds your threshold
+
+---
+
+### 🧭 Explore More
+
+| Module                | Description                                                          | Link                                                                                               |
+| --------------------- | -------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------- |
+| WFGY Core             | Standalone semantic reasoning engine for any LLM                     | [View →](https://github.com/onestardao/WFGY/tree/main/core/README.md)                              |
+| Problem Map 1.0       | Initial 16-mode diagnostic and symbolic fix framework                | [View →](https://github.com/onestardao/WFGY/tree/main/ProblemMap/README.md)                        |
+| Problem Map 2.0       | RAG-focused failure tree, modular fixes, and pipelines               | [View →](https://github.com/onestardao/WFGY/blob/main/ProblemMap/rag-architecture-and-recovery.md) |
+| Semantic Clinic Index | Expanded failure catalog: prompt injection, memory bugs, logic drift | [View →](https://github.com/onestardao/WFGY/blob/main/ProblemMap/SemanticClinicIndex.md)           |
+| Semantic Blueprint    | Layer-based symbolic reasoning & semantic modulations                | [View →](https://github.com/onestardao/WFGY/tree/main/SemanticBlueprint/README.md)                 |
+| Benchmark vs GPT-5    | Stress test GPT-5 with full WFGY reasoning suite                     | [View →](https://github.com/onestardao/WFGY/tree/main/benchmarks/benchmark-vs-gpt5/README.md)      |
+
+---
+
+> 👑 **Early Stargazers: [See the Hall of Fame](https://github.com/onestardao/WFGY/tree/main/stargazers)** —
+> Engineers, hackers, and open source builders who supported WFGY from day one.
+
+> <img src="https://img.shields.io/github/stars/onestardao/WFGY?style=social" alt="GitHub stars"> ⭐ Help reach 10,000 stars by 2025-09-01 to unlock Engine 2.0 for everyone  ⭐ **[Star WFGY on GitHub](https://github.com/onestardao/WFGY)**
+
+<div align="center">
+
+[![WFGY Main](https://img.shields.io/badge/WFGY-Main-red?style=flat-square)](https://github.com/onestardao/WFGY)
+ 
+[![TXT OS](https://img.shields.io/badge/TXT%20OS-Reasoning%20OS-orange?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS)
+ 
+[![Blah](https://img.shields.io/badge/Blah-Semantic%20Embed-yellow?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS/BlahBlahBlah)
+ 
+[![Blot](https://img.shields.io/badge/Blot-Persona%20Core-green?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS/BlotBlotBlot)
+ 
+[![Bloc](https://img.shields.io/badge/Bloc-Reasoning%20Compiler-blue?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS/BlocBlocBloc)
+ 
+[![Blur](https://img.shields.io/badge/Blur-Text2Image%20Engine-navy?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS/BlurBlurBlur)
+ 
+[![Blow](https://img.shields.io/badge/Blow-Game%20Logic-purple?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS/BlowBlowBlow)
+
+</div>
+