mirror of
https://github.com/onestardao/WFGY.git
synced 2026-04-28 11:40:07 +00:00
Update eval_drift.md
This commit is contained in:
parent
a2117d2d0a
commit
83b54e16f7
1 changed files with 5 additions and 0 deletions
|
|
@ -16,6 +16,11 @@
|
|||
> If you need the full triage and all prescriptions, return to the Emergency Room lobby.
|
||||
</details>
|
||||
|
||||
> **Evaluation disclaimer (RAG drift)**
|
||||
> Drift signals here are measured inside specific RAG pipelines and datasets.
|
||||
> They are debugging indicators, not proof that a system will stay stable in all real workloads.
|
||||
|
||||
---
|
||||
|
||||
When evaluation metrics **swing unpredictably** across runs (precision, recall, ΔS, coverage) even though the data and index appear unchanged.
|
||||
This signals **eval drift**: your evaluation harness is not structurally stable.
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue