Update README.md

This commit is contained in:
PSBigBig + MiniPS 2026-03-25 21:05:31 +08:00 committed by GitHub
parent 49da1fd84b
commit 3bd8329440
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -1,57 +1,35 @@
# 🧭 Inverse Atlas · Before AI Answers, It Must Earn the Right
> A legitimacy-first AI runtime for rigorous reasoning in the age of over-answering.
> Inverse Atlas governs whether a model may answer, how strongly it may answer, when it must stay broad,
> unresolved, or stop, and how far it may go under legitimate reasoning conditions. ⚖️
> Legitimacy-first AI runtime for rigorous reasoning.
> Inverse Atlas decides whether a model may answer, how strongly it may answer, and when it must stay `COARSE`, `UNRESOLVED`, or `STOP`. ⚖️
<img width="1536" height="1024" alt="Inverse_Atlas_Hero" src="https://github.com/user-attachments/assets/4761f80e-b134-43d0-8ce4-48c3e656e877" />
| Default AI order | Inverse Atlas order |
|---|---|
| answer first | constitute first |
| soften later | authorize before emission |
| patch after overclaim | preserve lawful uncertainty |
Most AI systems are built as if answering is the default.
**Not another safer wrapper. Not a post hoc filter. Not just a stricter prompt.**
**Inverse Atlas is a pre-generative governance layer for AI output.**
Inverse Atlas changes that order.
Instead of:
- answer first
- soften later
- repair after overclaim
Inverse Atlas asks a harder prior question:
**has this answer actually earned the right to exist at this level of resolution?**
That is the core shift.
**generation is not treated as a default right**
**generation is treated as an authorized act**
This is not just a safer tone wrapper.
It is not just a post hoc filter.
It is not just a stricter prompt.
It is a new pre-generative order for AI output.
Built for:
- vibe coders
- AI builders
- agent builders
- engineers debugging with LLMs
- anyone tired of false certainty, premature diagnosis, cosmetic repair inflation, and public overclaim
**Built for:** vibe coders, AI builders, agent builders, engineers debugging with LLMs, and anyone tired of false certainty, premature diagnosis, cosmetic repair inflation, and public overclaim.
---
## ⚡ Start in 60 Seconds
If you want the fastest way to feel what Inverse Atlas changes, use this order:
| Step | Action |
|---|---|
| 1 | Start with [Inverse Atlas Advanced](./runtime/inverse-advanced.txt) |
| 2 | Run the [Demo Harness](./runtime/inverse-demo.txt) |
| 3 | Pick one case from the [Case Pack](./runtime/inverse-cases.txt) |
| 4 | Compare baseline vs inverse-governed output |
| 5 | Score it with the [Evaluator](./runtime/inverse-eval.txt) |
| 6 | Then read the [Paper PDF](./paper/inverse-troubleshooting-atlas-pre-generative-governance-for-ai-legitimacy.pdf) |
1. Start with [Inverse Atlas Advanced](./runtime/inverse-advanced.txt)
2. Run the [Demo Harness](./runtime/inverse-demo.txt)
3. Pick one killer case from the [Case Pack](./runtime/inverse-cases.txt)
4. Compare baseline vs inverse-governed output
5. Score the difference with the [Evaluator](./runtime/inverse-eval.txt)
6. Then read the [paper PDF](./paper/inverse-troubleshooting-atlas-pre-generative-governance-for-ai-legitimacy.pdf)
If you want the supporting docs first:
**Docs**
- [Quick Start](./quickstart.md)
- [Runtime Guide](./runtime-guide.md)
- [Experiments Hub](./experiments/README.md)
@ -59,151 +37,84 @@ If you want the supporting docs first:
---
## 🚀 What Inverse Atlas Is For
## 🚀 What It Is For
Inverse Atlas is designed to reduce a specific family of AI failures:
| Failure family | What goes wrong |
|---|---|
| early illegal resolution | the model closes too early |
| false certainty | the tone outruns the support |
| neighboring-cut collapse | live alternatives disappear too soon |
| cosmetic repair inflation | surface cleanup is mislabeled as structural repair |
| public overclaim | final output exceeds the evidence ceiling |
- early illegal resolution
- false certainty under weak support
- neighboring-cut collapse
- cosmetic repair posing as structural repair
- public-facing conclusions that outrun the current evidence ceiling
In simple language:
**it does not merely help AI answer**
**it helps AI answer lawfully**
That means the system is allowed to:
- stay broad when broad is all that is justified
- stay unresolved when ambiguity is still real
- stop when a stronger answer would be illegitimate
- propose repair only when the repair is more than surface cleanup
A fluent answer is not enough.
A plausible answer is not enough.
A detailed answer is not enough.
**The answer must be earned.**
**Inverse Atlas does not merely help AI answer. It helps AI answer lawfully.**
---
## 🧩 Pick Your Runtime
### 1. [Inverse Atlas Advanced](./runtime/inverse-advanced.txt)
**Recommended default**
| Runtime | Role | Best for | Start here? |
|---|---|---|---|
| [Advanced](./runtime/inverse-advanced.txt) | recommended default | serious use, demos, comparison, first public experience | **Yes** |
| [Basic](./runtime/inverse-basic.txt) | fastest onboarding | casual first try, onboarding, quick copy-paste use | maybe |
| [Strict](./runtime/inverse-strict.txt) | audit / stress / research | hard-case review, benchmark pressure, internal audits | only if testing hard |
Use this first if you want the best overall MVP experience.
Advanced is the main product-facing runtime:
- legitimacy-first
- readable and useful
- strong enough for serious testing
- balanced between governance and practical usability
Best for:
- general serious use
- product demos
- side-by-side comparison
- first public experience
### 2. [Inverse Atlas Basic](./runtime/inverse-basic.txt)
**Fastest onboarding**
Use this if you want lower friction and more natural user-facing output.
Basic is designed for:
- easier first contact
- simpler prompting
- natural prose output
- useful lawful answers without heavy structure exposure
Best for:
- casual first try
- onboarding
- quick copy-paste use
- lightweight daily testing
### 3. [Inverse Atlas Strict](./runtime/inverse-strict.txt)
**Audit / stress / research mode**
Use this when you want the hardest legality discipline.
Strict is designed for:
- audit-style runs
- benchmark pressure
- evaluator alignment
- structural stress testing
- research demonstrations
Best for:
- hard-case review
- black-hat testing
- internal audits
- structured-output analysis
**Quick rule**
- Start with **Advanced**
- Use **Basic** for lower friction
- Use **Strict** for hardest legality discipline
---
## 🎯 Killer Demo
The killer demo is not:
| Part | What it does |
|---|---|
| [Demo Harness](./runtime/inverse-demo.txt) | compares baseline vs inverse-governed output |
| [Evaluator](./runtime/inverse-eval.txt) | scores legality, not swagger |
| [Case Pack](./runtime/inverse-cases.txt) | provides pressure-tested scenarios |
“look, the answer sounds nicer.”
| Recommended case | What it reveals |
|---|---|
| thin evidence forced confidence | overclaim under weak support |
| neighboring-cut conflict | fake closure while alternatives remain alive |
| illegal resolution demand | forced escalation beyond lawful support |
| world alignment instability | unstable frame, invalid strong output |
The killer demo is:
“look where ordinary direct generation over-resolves, overcommits, fakes repair, or speaks past its evidence ceiling, and how Inverse Atlas changes that order.”
Use:
- [Demo Harness](./runtime/inverse-demo.txt)
- [Evaluator](./runtime/inverse-eval.txt)
- [Case Pack](./runtime/inverse-cases.txt)
Recommended killer cases:
- thin evidence forced confidence
- neighboring-cut conflict
- illegal resolution demand
- world alignment instability
What the demo should reveal:
- a baseline may sound stronger while being less lawful
- confident tone does not equal authorized output
- rhetorical closure does not equal structural closure
- lawful restraint is not weakness
- ambiguity honestly preserved is often better than fake completion
This matters because many of the frameworks benefits are invisible if you only look at one final answer.
The demo makes the order change visible.
| Baseline may look stronger | Inverse Atlas may still be better because |
|---|---|
| more confident | confidence is not authorization |
| more final | rhetorical closure is not structural closure |
| more detailed | detail can exceed the evidence ceiling |
| more decisive | lawful restraint is not weakness |
---
## 📊 What the MVP Measures
This MVP does not ask you to “just trust the vibe.”
| Metric | What it asks |
|---|---|
| Legality Win Rate | does inverse beat baseline on legality more often? |
| Failure Code Reduction | do major failure patterns decrease? |
| Expected-State Match | does the runtime land in the lawful mode for the case? |
| Seven-Dimension Evaluation | does the output hold up across full legality review? |
It gives you a direct comparison surface.
<details>
<summary><strong>Evaluation details</strong></summary>
The current measurement idea is simple and inspectable:
### 1. Legality Win Rate
Across the case pack, does the inverse-governed answer win on legality more often than the baseline?
### 2. Failure Code Reduction
Does Inverse Atlas reduce major failure patterns such as:
### Major failure patterns
- illegal resolution escalation
- neighboring-cut dishonesty
- cosmetic-only repair posing as structural
- public ceiling exceedance
### 3. Expected-State Match
For each case, does the runtime land in a lawful mode such as:
- STOP
- COARSE
- UNRESOLVED
- AUTHORIZED
### Expected lawful modes
- `STOP`
- `COARSE`
- `UNRESOLVED`
- `AUTHORIZED`
### 4. Seven-Dimension Evaluation
Use the evaluator to judge:
### Evaluator dimensions
- problem frame legality
- world alignment honesty
- route judgment plausibility
@ -212,288 +123,146 @@ Use the evaluator to judge:
- repair legality
- public ceiling compliance
This is the important boundary:
</details>
**we are not claiming universal proof at MVP stage**
**we are claiming a directly inspectable legality-centered comparison surface**
**Boundary:** we are not claiming universal proof at MVP stage.
**Claim:** we are offering a directly inspectable legality-centered comparison surface.
In other words:
**not “trust us”**
**run the killer cases and inspect the deltas**
**Not “trust us.” Run the killer cases and inspect the deltas.**
---
## 🧠 Why This Exists
The first appearance of the forward atlas should be explicit:
The forward atlas, [Troubleshooting Atlas](../wfgy-ai-problem-map-troubleshooting-atlas.md), improved the first structural cut.
The forward atlas, [Troubleshooting Atlas](../wfgy-ai-problem-map-troubleshooting-atlas.md), helps the system find the likely structural region of failure.
But one problem remained:
That was a major step.
But a second problem remained:
even if the route looks promising, that does **not** automatically mean the model has earned the right to emit a strong answer yet.
> even if a route looks promising, that does not automatically mean the model has earned the right to emit a strong answer yet
That second half is the job of Inverse Atlas.
So the split is clean:
### Troubleshooting Atlas
Route-first structural orientation
It helps answer:
- where is the failure likely located
- what family or region is active
- what is the likely first structural move
### Inverse Atlas
Legitimacy-first generation governance
It helps answer:
- may the system answer yet
- how strongly may it answer
- must it remain broad, unresolved, or stop
- is the proposed repair structural or cosmetic
- is the public emission ceiling being exceeded
One layer provides the map.
The other governs the right to speak from within the map.
That is why Inverse Atlas is not a side note.
It is a second major atlas line.
| Layer | Core job | Main question |
|---|---|---|
| [Troubleshooting Atlas](../wfgy-ai-problem-map-troubleshooting-atlas.md) | route-first structural orientation | where is the failure likely located? |
| Inverse Atlas | legitimacy-first generation governance | has the system earned the right to resolve this yet? |
---
## 🛠️ What It Actually Does
## 🛠️ The 7 Legality Gates
At MVP level, Inverse Atlas governs generation through seven checks:
1. **Problem Constitution**
Has the problem been formed clearly enough for lawful reasoning?
2. **World Alignment**
Is the active world frame aligned enough for the answer to mean anything?
3. **Route / Collapse Estimate**
What is the leading structural route, and how risky would premature resolution be?
4. **Neighboring-Cut Review**
Are nearby competing routes still materially alive?
5. **Resolution Authorization**
Has the system actually earned the right to resolve at this level?
6. **Repair Legality**
Is the proposed fix structural, tentative, or merely cosmetic?
7. **Public Emission Control**
Would the final visible answer exceed what is currently supportable?
This means Inverse Atlas does not merely check style.
It governs:
- whether the model may answer
- how far it may go
- when ambiguity must be preserved
- when repair must stay tentative
- when strong output is lawful
- when stopping is the correct result
| Gate | Purpose |
|---|---|
| Problem Constitution | is the problem formed clearly enough? |
| World Alignment | is the active frame aligned enough to mean anything? |
| Route / Collapse Estimate | what is the leading route and how risky is premature resolution? |
| Neighboring-Cut Review | are competing routes still materially alive? |
| Resolution Authorization | has the system earned the right to resolve at this level? |
| Repair Legality | is the proposed fix structural, tentative, or cosmetic? |
| Public Emission Control | would the visible answer exceed current support? |
---
## 🚦 The Four Governance Modes
## 🚦 The 4 Governance Modes
Inverse Atlas uses four main output states:
| Mode | Use it when |
|---|---|
| `STOP` | the problem is too under-formed, weakly grounded, or unstable for substantive output |
| `COARSE` | broad structure is visible, but finer claims would overreach |
| `UNRESOLVED` | one route leads, but a competing route remains materially alive |
| `AUTHORIZED` | the frame, support, and separation are strong enough for strong output |
### STOP
Use when the problem is too under-formed, too weakly grounded, or too unstable for substantive output.
### COARSE
Use when broad structure is visible, but finer claims would overreach.
### UNRESOLVED
Use when one route leads, but a competing route remains materially alive.
### AUTHORIZED
Use only when the problem frame, world alignment, route separation, and requested detail are strong enough to justify strong output.
The key principle:
**AUTHORIZED is earned, not assumed.**
**Key principle:** `AUTHORIZED` is earned, not assumed.
---
## 🔥 What Actually Changes When You Use It
## 🔥 What Actually Changes
If Inverse Atlas is working, you should see less of this:
- early illegal closure
- unsupported specificity
- topic lure turning into fake diagnosis
- cosmetic rewrite being mislabeled as structural repair
- final answers that outrun evidence
And more of this:
- lawful restraint
- honest ambiguity
- cleaner uncertainty handling
- better distinction between route guess and authorized emission
- better repair honesty
- safer public output at the right resolution
This is not a cosmetic improvement layer.
It changes the order of cognition:
- orientation first
- governance second
- emission only after authorization
| Less of this | More of this |
|---|---|
| early illegal closure | lawful restraint |
| unsupported specificity | honest ambiguity |
| topic lure becoming fake diagnosis | cleaner uncertainty handling |
| cosmetic rewrite mislabeled as structural repair | better repair honesty |
| answers outrunning evidence | safer public output at the right resolution |
---
## 🧪 Included in the Current MVP
The current Inverse Atlas MVP already includes:
| Layer | Includes | Why it matters |
|---|---|---|
| Runtime | [Advanced](./runtime/inverse-advanced.txt), [Basic](./runtime/inverse-basic.txt), [Strict](./runtime/inverse-strict.txt), [Runtime Notes](./runtime/README.md) | the core operating surface |
| Demo / Evaluation | [Demo Harness](./runtime/inverse-demo.txt), [Evaluator](./runtime/inverse-eval.txt), [Case Pack](./runtime/inverse-cases.txt), [Experiments Hub](./experiments/README.md), [Showcase Cases](./experiments/showcase-cases.md), [Evidence Snapshot](./experiments/evidence-snapshot.md), [Case Studies](./experiments/case-studies/README.md), [Colab](./colab.md) | the public comparison surface |
| Theory | [Paper PDF](./paper/inverse-troubleshooting-atlas-pre-generative-governance-for-ai-legitimacy.pdf), [Paper Notes](./paper/README.md), [Figures](./figures/README.md) | the formal explanatory layer |
- [Inverse Atlas Advanced](./runtime/inverse-advanced.txt)
- [Inverse Atlas Basic](./runtime/inverse-basic.txt)
- [Inverse Atlas Strict](./runtime/inverse-strict.txt)
- [Demo Harness](./runtime/inverse-demo.txt)
- [Evaluator](./runtime/inverse-eval.txt)
- [Case Pack](./runtime/inverse-cases.txt)
- [Runtime Notes](./runtime/README.md)
- [Experiments Hub](./experiments/README.md)
- [Showcase Cases](./experiments/showcase-cases.md)
- [Evidence Snapshot](./experiments/evidence-snapshot.md)
- [Case Studies](./experiments/case-studies/README.md)
- [Colab / Reproduction Entry](./colab.md)
- [Paper PDF](./paper/inverse-troubleshooting-atlas-pre-generative-governance-for-ai-legitimacy.pdf)
- [Paper Notes](./paper/README.md)
- [Figures](./figures/README.md)
This is already enough to make the current layer:
- visible
- testable
- comparable
- discussable
- attackable in public
That matters.
Because a framework that cannot be surfaced as an inspectable object is much harder to evaluate honestly.
This is already enough to make the current layer visible, testable, comparable, and publicly attackable.
---
## 📘 Paper, Figures, and Theory
## 📘 Paper and Theory
If you want the formal layer, go here:
| Resource | Link |
|---|---|
| Paper PDF | [Read](./paper/inverse-troubleshooting-atlas-pre-generative-governance-for-ai-legitimacy.pdf) |
| Paper Notes | [Open](./paper/README.md) |
| Figures | [Open](./figures/README.md) |
- [Read the paper PDF](./paper/inverse-troubleshooting-atlas-pre-generative-governance-for-ai-legitimacy.pdf)
- [Read the paper notes](./paper/README.md)
- [See the figures](./figures/README.md)
The paper is not a footnote.
It is the formal surface that explains:
- why this is not just another checker
- why legitimacy failure is earlier than output-quality failure
- why the demo harness matters
- why the evaluator is legality-centered
- why the case pack defines the MVP benchmark seed
- why forward-layer mapping and inverse-layer governance are complementary
The paper explains why this is not just another checker, why legitimacy failure is earlier than output-quality failure, and why the demo harness, evaluator, and case pack matter together.
---
## 💬 Quick FAQ
### Is this just a stricter prompt?
No.
It changes the order of generation.
Instead of answer first and clean up later, it asks whether the answer is currently lawful enough to emit.
<details>
<summary><strong>Is this just a stricter prompt?</strong></summary>
### Which runtime should I start with?
Start with [Inverse Atlas Advanced](./runtime/inverse-advanced.txt).
It is the recommended default.
No. It changes the order of generation. Instead of answer first and clean up later, it asks whether the answer is lawful enough to emit.
</details>
<details>
<summary><strong>Which runtime should I start with?</strong></summary>
Start with [Advanced](./runtime/inverse-advanced.txt). It is the recommended default.
</details>
<details>
<summary><strong>What does the killer demo actually show?</strong></summary>
### What does the killer demo actually show?
It shows where a plausible direct-answer baseline escalates too early, overclaims certainty, skips neighboring-cut honesty, or presents cosmetic repair as structural.
### Do I need the full experiment stack to understand it?
No.
You can start with Advanced + Demo Harness + one killer case.
The experiment layer simply gives you a cleaner public comparison surface.
</details>
### Is this already claiming universal benchmark superiority?
No.
This README describes an MVP product direction with a runtime, demo, evaluator, case pack, paper, and figure set.
It does not claim that the full closed-loop WFGY 4.0 architecture is already complete.
<details>
<summary><strong>Do I need the full experiment stack to understand it?</strong></summary>
No. Start with Advanced + Demo Harness + one killer case. The experiment layer gives you a cleaner public comparison surface.
</details>
---
## ⛔ What Is Not Yet Claimed
## ⛔ Current Boundary
This page does **not** claim:
- full hallucination elimination
- universal superiority across all tasks
- a completed production operating system
- a finished forward-plus-inverse closed loop
- a fully completed WFGY 4.0 bridge implementation
The current claim is narrower and stronger:
**Inverse Atlas already exists as a real MVP artifact layer**
**the broader architecture is still ahead**
**Current claim:** Inverse Atlas already exists as a real MVP artifact layer.
---
## 🌉 Where This Goes Next
## 🏁 Position in the Atlas Family
Inverse Atlas is not the end state.
| Layer | Role |
|---|---|
| [Troubleshooting Atlas](../wfgy-ai-problem-map-troubleshooting-atlas.md) | find the likely structural region of failure |
| Inverse Atlas | govern whether the system has earned the right to resolve |
| [Twin Atlas README](../Twin_Atlas/README.md) | broader conceptual pairing direction |
It is one side of a larger architecture.
If the route-first layer keeps improving structural orientation, and the inverse layer keeps improving output legitimacy, then the next natural step is tighter pairing inside the broader twin-atlas direction.
For that conceptual pairing layer, see:
- [Twin Atlas README](../Twin_Atlas/README.md)
The larger vision is not:
“make answers look safer.”
The larger vision is:
**make generative systems know when they have, and have not, earned the right to speak strongly**
That is a much bigger shift.
---
## 🏁 Final Positioning
Inverse Atlas is not just another GitHub artifact.
It is a new governance layer for AI generation.
It says:
- not every prompt has earned an answer
- not every likely route has earned public resolution
- not every repair has earned the word structural
- not every strong tone has earned trust
The forward atlas, [Troubleshooting Atlas](../wfgy-ai-problem-map-troubleshooting-atlas.md), helps answer:
**where is the failure likely located?**
Inverse Atlas answers the second question:
**has the system actually earned the right to resolve this yet?**
Put together, those two questions create a much stronger family.
That is why Inverse Atlas is not a side feature.
It is a second major atlas line, and a necessary step toward a larger generation architecture that treats legitimacy as seriously as intelligence. ✨
Inverse Atlas is a second major atlas line in a larger generation architecture that treats legitimacy as seriously as intelligence. ✨