Update duplication_and_near_duplicate_collapse.md

This commit is contained in:
PSBigBig 2025-09-05 11:43:09 +08:00 committed by GitHub
parent a2bac767dc
commit b0ee5a03c7
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -1,5 +1,22 @@
# Duplication and Near Duplicate Collapse — Guardrails and Fix Pattern
<details>
<summary><strong>🧭 Quick Return to Map</strong></summary>
<br>
> You are in a sub-page of **RAG_VectorDB**.
> To reorient, go back here:
>
> - [**RAG_VectorDB** — vector databases for retrieval and grounding](./README.md)
> - [**WFGY Global Fix Map** — main Emergency Room, 300+ structured fixes](../README.md)
> - [**WFGY Problem Map 1.0** — 16 reproducible failure modes](../../README.md)
>
> Think of this page as a desk within a ward.
> If you need the full triage and all prescriptions, return to the Emergency Room lobby.
</details>
Use this page when **the same passage floods your top k** under different snippet IDs or slightly different text, which blocks coverage of other relevant sections. This often happens after PDF or HTML exports, aggressive chunk overlap, HyDE variants, or multi store merges.
---