mirror of
https://github.com/onestardao/WFGY.git
synced 2026-04-28 11:40:07 +00:00
Update eval_cross_agent_consistency.md
This commit is contained in:
parent
75d34ca5fd
commit
0d5e04558c
1 changed files with 5 additions and 0 deletions
|
|
@ -16,6 +16,11 @@
|
|||
> If you need the full triage and all prescriptions, return to the Emergency Room lobby.
|
||||
</details>
|
||||
|
||||
> **Evaluation disclaimer (cross agent consistency)**
|
||||
> Agreement between agents is measured with chosen prompts and roles and can still be wrong in absolute terms.
|
||||
> Consistency scores are diagnostic tools, not proof that the agreed answer is true or safe.
|
||||
|
||||
---
|
||||
|
||||
**Goal**
|
||||
Measure and enforce agreement between two independent validators: a **Scholar** (claims/citations checker) and an **Auditor** (policy/provenance/constraints gate). Produce (1) quantitative agreement (Percent Agreement & Cohen’s κ) and (2) a deterministic **conflict-resolution policy** for ship/no-ship.
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue