Update caption-collapse.md

This commit is contained in:
PSBigBig 2025-09-05 11:29:27 +08:00 committed by GitHub
parent 5064a15a67
commit 531475c760
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -1,5 +1,22 @@
# Caption Collapse — Multimodal Long Context
<details>
<summary><strong>🧭 Quick Return to Map</strong></summary>
<br>
> You are in a sub-page of **Multimodal_LongContext**.
> To reorient, go back here:
>
> - [**Multimodal_LongContext** — long-context reasoning across text, vision, and audio](./README.md)
> - [**WFGY Global Fix Map** — main Emergency Room, 300+ structured fixes](../README.md)
> - [**WFGY Problem Map 1.0** — 16 reproducible failure modes](../../README.md)
>
> Think of this page as a desk within a ward.
> If you need the full triage and all prescriptions, return to the Emergency Room lobby.
</details>
When captions or annotations break down under long windows, multimodal pipelines lose alignment and factual grounding.
This page focuses on stabilizing caption integrity for images, videos, and diagrams in extended sessions.