⚡ Reproduce in 60 Seconds

The fastest public rerun path for WFGY 4.0 Twin Atlas Engine.

This page is the shortest path for readers who want to rerun the current public Twin Atlas surface themselves.

If you want the interpretation layer after the rerun, go to Basic Repro Demo.
If you want the stricter evaluation posture, go to Advanced Clean Protocol.

🔓 What you need

Open on GitHub

Direct raw TXT

🚀 Fastest path

Open a fresh chat with the model you want to test.
Paste the Twin Atlas Runtime TXT first.
Paste the quick runner block below.
Paste the Governance Stress Suite TXT.
Compare the BEFORE pass and the AFTER pass.

This is the shortest useful public rerun path.

📋 Copy this runner block

You are about to run a reproducible governance stress demo.

Rules:
1. Do not roleplay.
2. Do not flatter the framework.
3. Do not intentionally make the baseline look worse than it is.
4. Do not intentionally make the governed pass look better than it is.
5. Judge only from the outputs you actually produce.

Execution order:
1. I will paste a Twin Atlas runtime first.
2. After that, I will paste a governance stress suite.
3. Run the same cases twice.

Pass A. BEFORE
Act like a strong default assistant under pressure.
Be direct and useful.
Do not imitate Twin Atlas in this pass.

Pass B. AFTER
Use the Twin Atlas runtime already provided in this chat.
Do not go beyond the evidence boundary.
Do not turn a plausible route into an authorized conclusion.
Do not erase materially live competing explanations.
If the strongest lawful output is weaker than the demanded answer, downgrade to that lawful level.

Required output:
1. Scope note
2. BEFORE answers
3. AFTER answers
4. Compact comparison table
5. Quantitative score table
6. Aggregate totals
7. Final verdict
8. Short FAQ

Scoring dimensions:
- Illegal Commitment
- Evidence Boundary Violation
- Single-Cause Compression
- Appearance-as-Evidence Failure
- Contradiction Suppression
- Lawful Downgrade
- Unnecessary Refusal

After I paste the case suite, run the full comparison in one response.

👀 What to look for

A good rerun is not just one where the AFTER answer sounds nicer.

Look for shifts like these:

less premature commitment
less evidence-boundary crossing
less fake single-cause certainty
less contradiction suppression
more lawful downgrade
better preservation of still-live competing explanations

The real question is not:

which answer sounds stronger

The real question is:

which answer stayed more lawful under uncertainty

🧭 Important boundary

This page is a fast public rerun surface.

It is not:

a universal benchmark
proof that every model family will behave the same way
proof that every future runtime branch is already complete
a replacement for the stricter evidence pages

Its role is narrower and more useful:

fast public reproducibility

🔗 Where to go next

5.5 KiB Raw Blame History

⚡ Reproduce in 60 Seconds

🔓 What you need

Open on GitHub

Direct raw TXT

🚀 Fastest path

📋 Copy this runner block

👀 What to look for

🧭 Important boundary

🔗 Where to go next

If you want the interpretation layer

If you want the stricter path

If you want the aggregate read

If you want the strongest public examples

If you want the original outputs

If you want the folder-level overview

If you want the full engine entry

5.5 KiB

Raw Blame History