Update eval_latency_vs_accuracy.md

This commit is contained in:
PSBigBig × MiniPS 2026-02-26 15:46:25 +08:00 committed by GitHub
parent 27c9327d9f
commit bdeab6d983
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -16,6 +16,11 @@
> If you need the full triage and all prescriptions, return to the Emergency Room lobby.
</details>
> **Evaluation disclaimer (latency vs accuracy)**
> The trade off curves and numbers here depend on your stack, load and datasets.
> Treat them as shapes to look for, not fixed targets that prove one model or setting is always better.
---
This page defines how to measure, report, and optimize the trade-off between model latency and retrieval/answer accuracy. It is not enough to chase precision; stable systems must also meet latency SLOs while holding ΔS and λ within guardrails.