WFGY/Avatar/eval/blackfan-testing.md
2026-04-01 17:31:53 +08:00

507 lines
12 KiB
Markdown

<!--
AI_NOTE_START
Document role:
This page explains the blackfan testing surface of WFGY 5.0 Avatar.
What this page is for:
1. Explain why blackfan-style pressure testing matters.
2. Show that Avatar should be inspected not only under friendly conditions, but also under hostile reading and aggressive scrutiny.
3. Clarify what kinds of collapse patterns blackfan testing is trying to expose.
4. Help users understand the difference between route strength and surface charm.
5. Keep this page practical, sharp, and honest without pretending all blackfan evaluation is already exhausted.
What this page is not:
1. Not the full theory of adversarial evaluation.
2. Not the full route-recognizability checklist.
3. Not a complete public archive of all future blackfan cases.
4. Not a claim that every route is already hardened against all hostile conditions.
5. Not a replacement for workflow, multilingual, or build-level evaluation.
How to use this page:
1. Read this page as the pressure-testing counterpart to cleaner evaluation pages.
2. Use it to understand what failure shapes matter when a route is attacked.
3. Compare blackfan results with normal route behavior checks.
4. Use the linked pages later if you want broader route evaluation or tuning workflow.
5. Treat this page as a stress surface, not a universal final verdict.
Important boundary:
Blackfan testing is designed to reveal weakness under pressure.
It is not the only evaluation lens.
A route can fail here in one way and still be promising in another area, but routes that collapse too easily under scrutiny should not be overclaimed.
AI_NOTE_END
-->
# 🪓 Blackfan Testing
This page explains one of the harder evaluation surfaces inside **WFGY 5.0 Avatar**:
**blackfan testing**
The idea is simple:
A route should not only look good when everything is easy.
It should also be inspected under pressure.
That pressure may come from:
- hostile reading
- aggressive questioning
- skeptical interpretation
- nitpicking
- anti-hype framing
- attempts to expose weakness
- deliberate efforts to separate real route strength from surface charm
That is what this page is about.
---
## Why This Surface Exists
A lot of AI systems look stronger than they really are when they are evaluated only in friendly conditions.
They can seem:
- warm
- smart
- clean
- impressive
- stylish
- emotionally appealing
And then the moment someone pushes harder, the route starts to reveal weaker structure underneath.
That is why blackfan testing matters.
It asks a harder question:
**what happens when the route is not being admired**
That is often where the real strength starts showing.
Or the real weakness.
---
## What “Blackfan” Means Here
“Blackfan” here is not just random negativity.
It is a pressure style.
It means reading the route from a hostile or skeptical angle and asking things like:
- is this actually distinct, or only polished
- is this real warmth, or fake sugar
- is this route strong, or just loud
- is this explanation grounded, or merely dramatic
- is this branch reusable, or only good in one lucky setup
- does this route survive hard questioning, or collapse into vagueness
- is this confidence earned, or just performed
That is a very useful kind of stress surface.
It helps reveal whether an avatar has real structure or only good cosmetics.
---
## What Blackfan Testing Is Trying to Expose
Blackfan testing is not trying to punish the route for existing.
It is trying to reveal where weakness hides.
Some of the most important failure shapes include:
- generic collapse
- over-polish collapse
- sugar collapse
- route blur
- fake depth
- fake warmth
- false confidence
- performance theater
- branch fragility
- emotional distortion under pressure
These are all worth checking.
Because a route that only survives praise is not strong enough yet.
---
## Failure Shape 1. Generic Collapse
This happens when pressure causes the route to lose whatever made it distinct.
It starts falling back into:
- default assistant tone
- safe filler language
- vague niceness
- smooth but empty generality
- low-risk, low-identity output
A route may seem special in friendly settings, then collapse into genericness the moment it is challenged.
That is a major warning sign.
---
## Failure Shape 2. Over-Polish Collapse
This happens when pressure makes the route overcompensate through polish.
It starts sounding:
- too clean
- too quoteable
- too line-perfect
- too ready-made
- too elegantly packaged
- less alive
- less grounded
- less believable
This can fool people at first, because polished failure often looks respectable.
But under harder reading, it becomes obvious that the route is protecting itself with presentation instead of substance.
That matters.
---
## Failure Shape 3. Sugar Collapse
This happens when warmth turns weak under stress.
Instead of staying emotionally real, the route becomes:
- too soft
- too eager to comfort
- too fake-friendly
- too smoothing
- too emotionally padded
- too unwilling to risk honesty
This is especially important for companion-facing or warm routes.
A route that can only survive by becoming sweeter is often weaker than it looks.
Warmth is not the same thing as sugar.
Blackfan testing helps reveal that difference.
---
## Failure Shape 4. Route Blur
This happens when the route stops feeling like itself under pressure.
The identity becomes hard to distinguish from:
- another route
- generic AI output
- a random polished assistant
- a temporary style mask
Route blur is dangerous because it weakens everything else:
- branch identity
- reuse value
- tuning clarity
- save-worthiness
- later community legibility
If the route cannot stay itself under pressure, it becomes much harder to trust.
---
## Failure Shape 5. Fake Depth
This happens when the route sounds deep without actually saying much.
Typical signs:
- ornamental abstraction
- serious tone without local grip
- vague wisdom statements
- elegant gestures toward insight
- emotional mood standing in for explanation
- conceptual fog mistaken for sophistication
Fake depth is one of the easiest traps to miss if the route is otherwise stylish.
Blackfan testing is good at exposing it because hostile reading strips away the “sounds impressive” shield.
---
## Failure Shape 6. Performance Theater
This happens when the route starts acting strong instead of being strong.
For example:
- louder confidence
- more staged authority
- dramatic phrasing
- high-pressure wording without grounded reason
- exaggerated cleverness
- visible effort to seem special
This kind of failure often looks energetic, but it is structurally weak.
It is especially dangerous in public-writing or high-charisma branches.
A route that performs strength too hard may lose real reuse value.
---
## Failure Shape 7. Branch Fragility
Some branches look distinct, but only because the exact conditions are friendly.
Then one hostile push and the route starts falling apart.
Branch fragility often shows up as:
- identity loss
- unstable pressure
- exaggerated drift
- overreaction to challenge
- quick movement into noise or flatness
This matters because a branch that cannot handle scrutiny is much weaker as a reusable build.
A route does not need to be invincible.
But it should not disintegrate too easily.
---
## Why This Matters for Reusable Builds
Reusable builds should survive more than admiration.
They should survive at least some level of aggressive inspection.
That does not mean every route needs to become a fortress.
It does mean this:
if a build only looks good when no one questions it, then it is less reusable than it seems
Blackfan testing helps separate:
- routes worth keeping
- routes worth retuning
- routes that are still mostly surface
- branches that need stronger grounding before they deserve a name
This is one reason blackfan testing matters so much for build quality.
---
## Why This Matters for Community Later
Once people begin submitting avatars later, blackfan testing becomes even more useful.
Because a community layer will eventually need to distinguish between:
- avatars that only have a cool aesthetic
- avatars that actually have route substance
- branches that survive scrutiny
- branches that are still mostly theater
Without some kind of pressure surface, a gallery can become too easy to game.
With blackfan testing, stronger branches become easier to recognize.
That is healthier for the ecosystem.
---
## How Blackfan Testing Differs From Normal Route Checks
Normal route behavior checks ask things like:
- is the route recognizable
- is it grounded
- is it too polished
- is it reusable
- is the emotional shape in range
Blackfan testing asks something harsher:
- what breaks first when the route is attacked
These are related, but not identical.
A route may look stable in ordinary checks and still reveal a hidden weakness under pressure.
That is exactly why this surface deserves its own page.
---
## A Practical Blackfan Reading Pass
A simple blackfan pass can start with questions like:
- what is fake here
- what is too polished here
- where is the route hiding behind style
- what sounds confident but is not earned
- what becomes generic under pressure
- what becomes sugary instead of honest
- what is more theater than route
These questions are intentionally a little aggressive.
That is the point.
You are not doing comfort review here.
You are stress-testing the route.
---
## Suggested Review Format
If you want a simple structure for a blackfan pass, use something like this:
```md
## Blackfan Pass
### Route
<route name>
### Task
<what was tested>
### Generic Collapse
<low / medium / high>
### Over-Polish Collapse
<low / medium / high>
### Sugar Collapse
<low / medium / high>
### Route Blur
<low / medium / high>
### Fake Depth Risk
<low / medium / high>
### Performance Theater Risk
<low / medium / high>
### Branch Fragility
<low / medium / high>
### Notes
<short honest explanation of what broke first>
````
This is not a permanent universal law.
It is a practical pressure-reading shape.
---
## What a Stronger Route Looks Like Here
A stronger route under blackfan pressure usually shows some of these signs:
* it stays more recognizably itself
* it does not immediately fall into generic AI safety language
* it does not hide behind polished slogans
* it does not become sweeter just to survive
* it keeps some grounding
* it still feels reusable after the pressure pass
* it takes the hit without becoming total noise
This is not about perfection.
It is about structural resilience.
---
## What This Page Does Not Claim
This page helps pressure-test routes, but the boundary matters.
It does **not** claim:
* that every route must perform equally well under all hostile styles
* that blackfan pressure is the only evaluation lens
* that one bad pressure pass makes a route worthless forever
* that all future blackfan cases are already publicly documented
* that pressure testing fully replaces normal route inspection
* that Avatar is already fully hardened against every hostile condition
This page is about useful stress.
Not fake totality.
---
## Why This Makes the Product Stronger
A product becomes much more serious when it can survive not only affection, but scrutiny.
That is what this page is trying to support.
Without blackfan testing, Avatar could still look interesting.
With blackfan testing, it becomes easier to ask tougher questions like:
* is this route actually real
* is this branch only pretty
* is this persona too fragile
* is this strength earned
* is this worth saving
* is this worth showing to others later
That is a much healthier direction.
This is why blackfan testing belongs in the eval layer.
---
## Where To Go Next
### If you want the eval hub
Go to [📊 Eval Hub](./README.md)
### If you want route-level checks
Go to [🧪 Persona Behavior Checks](./persona-behavior-checks.md)
### If you want multilingual status
Go to [🌍 Multilingual Status](./multilingual-status.md)
### If you want the workflow path
Go to [🧭 Avatar Tuning Workflow](../docs/avatar-tuning-workflow.md)
### If you want the highlights map
Go to [✨ Highlights Index](../highlights/README.md)
---
## Quick Links
* [🏠 Avatar Home](../README.md)
* [📊 Eval Hub](./README.md)
* [🧪 Persona Behavior Checks](./persona-behavior-checks.md)
* [🌍 Multilingual Status](./multilingual-status.md)
* [🧭 Avatar Tuning Workflow](../docs/avatar-tuning-workflow.md)
* [✨ Highlights Index](../highlights/README.md)
* [⬆️ Back to WFGY Root](../../README.md)