Update eval_benchmarking.md

2026-04-28 11:40:07 +00:00 · 2025-09-05 10:48:49 +08:00 · 2025-09-05 10:48:49 +08:00 · 547d9c6167
commit 547d9c6167
parent e722f1ed68
1 changed files with 17 additions and 0 deletions
--- a/ProblemMap/GlobalFixMap/Eval/eval_benchmarking.md
+++ b/ProblemMap/GlobalFixMap/Eval/eval_benchmarking.md
@ -1,5 +1,22 @@
 # Eval Benchmarking — Protocols, Targets, and Reporting

+<details>
+  <summary><strong>🧭 Quick Return to Map</strong></summary>
+
+<br>
+
+  > You are in a sub-page of **Eval**.  
+  > To reorient, go back here:  
+  >
+  > - [**Eval** — model evaluation and benchmarking](./README.md)  
+  > - [**WFGY Global Fix Map** — main Emergency Room, 300+ structured fixes](../README.md)  
+  > - [**WFGY Problem Map 1.0** — 16 reproducible failure modes](../../README.md)  
+  >
+  > Think of this page as a desk within a ward.  
+  > If you need the full triage and all prescriptions, return to the Emergency Room lobby.
+</details>
+
+
 This page defines a clean, repeatable way to benchmark your pipeline and prove that a fix actually improved behavior. It uses the same WFGY instruments as everywhere else: ΔS for semantic stress, λ\_observe for stability, and E\_resonance for coherence over long windows.

 ## Open these first