diff --git a/ProblemMap/Inverse_Atlas/experiments/repro-60-seconds.md b/ProblemMap/Inverse_Atlas/experiments/repro-60-seconds.md new file mode 100644 index 00000000..5c41b68b --- /dev/null +++ b/ProblemMap/Inverse_Atlas/experiments/repro-60-seconds.md @@ -0,0 +1,360 @@ + + +# Repro in 60 Seconds + +> The fastest public way to feel what Inverse Atlas changes βš–οΈ + +This page is for one simple goal: + +**show the difference quickly** + +It is not trying to be a full academic protocol. + +It is trying to let a person reproduce the core behavioral shift of Inverse Atlas in the shortest clean way. + +The basic idea is simple: + +- ask the same question twice +- once with no Inverse Atlas +- once with Inverse Atlas +- compare whether the governed version becomes more lawful under pressure + +That is the whole point of this page. + +--- + +## Quick Links πŸ”Ž + +| Section | Link | +|---|---| +| Inverse Atlas Home | [Inverse Atlas README](../README.md) | +| Versions | [Versions](../versions.md) | +| Quick Start | [Quick Start](../quickstart.md) | +| Runtime Guide | [Runtime Guide](../runtime-guide.md) | +| Experiments Home | [Experiments](./README.md) | +| Status and Boundaries | [Status and Boundaries](../status-and-boundaries.md) | +| Runtime Layer | [Runtime Artifacts](../runtime/README.md) | +| Twin Atlas | [Twin Atlas](../../Twin_Atlas/README.md) | + +--- + +## The shortest answer 🧩 + +If you only want the fastest path, do this: + +### Window A +Ask the question normally. + +### Window B +Load one Inverse Atlas version, then ask the same question. + +### Compare +Look for whether the Inverse Atlas version is better at: + +- refusing illegal high-resolution escalation +- avoiding fake completion +- refusing cosmetic repair inflation +- staying coarse or unresolved lawfully when needed +- keeping visible confidence below what was actually earned + +That is the cleanest first reproduction path. + +--- + +## What you need first βœ… + +Before you start, you only need four things: + +- one strong language model +- one Inverse Atlas version +- one test case +- two chat windows or two clean comparison runs + +You do **not** need the full paper first. + +You do **not** need a benchmark notebook first. + +You do **not** need the Bridge layer first. + +You only need enough structure to compare one ordinary answer against one governed answer. + +--- + +## Version choice first 🌟 + +If you do not know which version to use, start with: + +### **Recommended: Advanced** + +That is the current best default for most serious users. + +If you want a lighter first try, use **Basic**. + +If you want a harder audit-style run, use **Strict**. + +--- + +## Method A Β· Fastest experience + +This is the fastest public-facing version. + +### Step 1 +Load one Inverse Atlas version into your system instructions, project instructions, or equivalent runtime layer. + +### Step 2 +Paste the demo harness. + +### Step 3 +Choose one case from the case pack. + +### Step 4 +Run the prompt and inspect the output. + +In the current artifact design, this is intended to produce: + +- a simulated baseline response +- an inverse-governed response +- a compact structural difference summary + +This is the most product-like path, and it is the best first contact for most people. + +--- + +## Method B Β· Cleaner comparison + +This is the cleanest baseline vs inverse comparison for ordinary users. + +### Window A +Use the same model with no Inverse Atlas layer. + +Ask your chosen case directly. + +### Window B +Use the same model with one Inverse Atlas version loaded. + +Ask the same case again. + +### Then compare +Do not judge only by tone. + +Judge by structure: + +- Did baseline escalate too early +- Did baseline collapse neighboring routes too quickly +- Did baseline overclaim certainty +- Did baseline offer cosmetic repair as if structural +- Did the inverse-governed answer stay lawful under the same pressure + +This comparison matters because a baseline answer may look more decisive while still being less lawful. The current paper explicitly treats this as one of the main reasons the demo harness and evaluator exist. + +--- + +## Method C Β· Stricter evaluator pass + +Use this when you want a more formal comparison layer. + +### Step 1 +In one chat, produce: + +- the baseline answer +- the inverse-governed answer + +### Step 2 +Open a second clean chat. + +### Step 3 +Load the evaluator. + +### Step 4 +Provide: + +- the original user input +- the baseline output +- the inverse output + +### Step 5 +Ask for pair evaluation. + +At the current MVP stage, the evaluator is designed to report things like: + +- legality winner +- baseline main risk +- inverse-governed main strength +- contrast on resolution behavior +- contrast on certainty behavior +- contrast on repair legality +- contrast on public-ceiling compliance + +This path is very useful for Hero Log, research notes, and later benchmark-style records. + +--- + +## Method D Β· Twin Atlas path + +This is the more complete dual-layer direction. + +### Step 1 +Use the forward Atlas first to get: + +- likely route +- likely family +- likely broken invariant region + +### Step 2 +Pass that along with the original user input into the Inverse Atlas runtime. + +### Step 3 +Let Inverse Atlas decide whether the route is actually strong enough for lawful public output. + +The key law here is: + +**forward guidance is a weak prior, not automatic authorization** + +That asymmetry must be preserved. + +The current paper is explicit on this point: the forward layer informs the inverse layer, but does not overrule it. + +--- + +## What kind of case should you use first 🎯 + +For the first reproduction, do **not** pick an easy toy prompt. + +Pick something that pressures legality. + +Good first-case properties include: + +- topic lure +- thin evidence +- neighboring-cut conflict +- fake repair temptation +- forced illegal specificity +- false completion pressure +- long-context contamination + +The current paper’s case design is intentionally built around these kinds of pressures, because they are exactly the places where legality-first governance should diverge from normal direct answering. + +--- + +## What success looks like πŸ‘€ + +A good first reproduction does **not** need to prove everything. + +It only needs to make the difference visible. + +At the MVP stage, that usually means you can already see one or more of these: + +- less illegal resolution escalation +- less false completion +- better ambiguity retention +- more lawful downgrade into COARSE or UNRESOLVED +- better refusal of cosmetic repair inflation +- better public-ceiling discipline + +That is already enough for a meaningful first win. + +--- + +## What this page is not trying to do β›” + +This page is not trying to be: + +- the full benchmark protocol +- the full evaluator manual +- the full Twin Atlas handoff specification +- the final empirical validation layer + +It is much narrower. + +Its job is simply this: + +**make the first reproduction path obvious and fast** + +That is why this page is product-style, not paper-style. + +--- + +## Recommended public entry order 🧭 + +For most public-facing readers, the cleanest order is: + +1. choose **Advanced** +2. run one strong demo case +3. compare baseline vs inverse +4. only then try evaluator mode +5. only after that try Twin Atlas mode + +That order is important. + +If you start with the hardest research path first, many people will miss the product value. + +If you start with the fast difference first, the framework becomes much easier to feel. + +--- + +## Where to go next πŸ“š + +If you want the overall experiments layer, go to: + +[Experiments](./README.md) + +If you want the runtime logic behind the artifacts, go to: + +[Runtime Guide](../runtime-guide.md) + +If you want the version strategy, go to: + +[Versions](../versions.md) + +If you want the current honesty boundary, go to: + +[Status and Boundaries](../status-and-boundaries.md) + +If you want the larger family direction, go to: + +[Twin Atlas](../../Twin_Atlas/README.md) + +--- + +## Final Note + +This page matters because a framework becomes much more real when a new user can feel it quickly. + +The 60-second reproduction path is not the whole story. + +But it is one of the most important first doors into the current Inverse Atlas MVP. + +That is why this page should stay simple, fast, and honest. 🌱