diff --git a/ProblemMap/Twin_Atlas/demos/reproduce-in-60-seconds.md b/ProblemMap/Twin_Atlas/demos/reproduce-in-60-seconds.md new file mode 100644 index 00000000..b9cc2fa8 --- /dev/null +++ b/ProblemMap/Twin_Atlas/demos/reproduce-in-60-seconds.md @@ -0,0 +1,154 @@ + + +# โšก Reproduce in 60 Seconds + +> The shortest public rerun path for the current WFGY 4.0 Twin Atlas proof surface. + +This page exists for readers who do not want a long explanation first. + +If you want to know whether the governance shift is visible, this is the fastest public path. + +You need exactly two files: + +- [Twin Atlas Runtime TXT](./prompts/wfgy-4_0-twin-atlas-runtime.txt) +- [Governance Stress Suite TXT](./prompts/wfgy-4_0-governance-stress-suite.txt) + +--- + +## ๐Ÿ•’ One-minute summary + +Do this in order: + +1. open your target AI system +2. paste the Twin Atlas runtime TXT +3. paste the governance stress suite TXT +4. let the model complete both passes +5. compare the BEFORE pass and the AFTER pass + +That is it. + +--- + +## ๐Ÿ“‚ The two files + +### 1. Twin Atlas runtime +Use this first: + +- [wfgy-4_0-twin-atlas-runtime.txt](./prompts/wfgy-4_0-twin-atlas-runtime.txt) + +This loads the public WFGY 4.0 Twin Atlas runtime. + +### 2. Governance stress suite +Use this second: + +- [wfgy-4_0-governance-stress-suite.txt](./prompts/wfgy-4_0-governance-stress-suite.txt) + +This runs the current public governance stress surface. + +--- + +## ๐Ÿงช What you are actually checking + +The question is not: + +โ€œdid the model become more polite?โ€ +โ€œdid the model become more cautious?โ€ +โ€œdid the answer get softer?โ€ + +The real question is: + +**did the model stop turning plausibility into public conclusion too early?** + +That is the core public test. + +A strong AFTER pass should make at least one of these shifts visible: + +- less illegal commitment +- less evidence-boundary crossing +- less single-cause compression +- less contradiction suppression +- more lawful downgrade +- stronger preservation of still-live ambiguity + +--- + +## ๐Ÿ“Œ What not to overclaim + +This page is useful because it is fast. + +That does **not** mean it is the cleanest reviewer-facing protocol or a universal benchmark. + +A visible before / after shift is meaningful. +A repeated pattern across public runs is meaningful. +A reproducible TXT-based path is meaningful. + +But none of those things automatically prove universal superiority in every domain or every future deployment environment. + +This page is honest about that. + +--- + +## ๐Ÿ” If you want a cleaner public rerun path + +Use: + +- [Advanced Clean Protocol](../evidence/advanced-clean-protocol.md) + +That page is better if you want the reviewer-facing path instead of the shortest path. + +--- + +## ๐Ÿ–ผ๏ธ If you want to compare with the current public runs + +Use: + +- [AI Eval](./ai-eval.md) +- [Screenshots](./screenshots/) +- [Raw Runs](../evidence/raw-runs/) +- [Results Summary](../evidence/results-summary.md) + +--- + +## ๐Ÿงญ Where to go next + +### If you want screenshot-first proof +- [AI Eval](./ai-eval.md) + +### If you want aggregate interpretation +- [Results Summary](../evidence/results-summary.md) + +### If you want original model outputs +- [Raw Runs](../evidence/raw-runs/) + +### If you want the flagship case layer +- [Flagship Cases](../evidence/flagship-cases.md)