Update eval_drift.md

This commit is contained in:
PSBigBig × MiniPS 2026-02-26 16:34:14 +08:00 committed by GitHub
parent a2117d2d0a
commit 83b54e16f7
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -16,6 +16,11 @@
> If you need the full triage and all prescriptions, return to the Emergency Room lobby.
</details>
> **Evaluation disclaimer (RAG drift)**
> Drift signals here are measured inside specific RAG pipelines and datasets.
> They are debugging indicators, not proof that a system will stay stable in all real workloads.
---
When evaluation metrics **swing unpredictably** across runs (precision, recall, ΔS, coverage) even though the data and index appear unchanged.
This signals **eval drift**: your evaluation harness is not structurally stable.