mirror of
https://github.com/onestardao/WFGY.git
synced 2026-04-28 11:40:07 +00:00
Update eval_playbook.md
This commit is contained in:
parent
0d5e04558c
commit
fbe1f71dbf
1 changed files with 5 additions and 0 deletions
|
|
@ -16,6 +16,11 @@
|
|||
> If you need the full triage and all prescriptions, return to the Emergency Room lobby.
|
||||
</details>
|
||||
|
||||
> **Evaluation disclaimer (observability playbook)**
|
||||
> The signals in this playbook come from specific logging and probing setups.
|
||||
> They are tools for monitoring behavior, not proofs of safety or correctness on their own.
|
||||
|
||||
---
|
||||
|
||||
A compact playbook to **stabilize evaluation** and ensure results are reproducible.
|
||||
Use this when metrics look inconsistent, coverage drifts, or benchmarks feel untrustworthy.
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue