8.3 KiB
Vector DBs & Stores — Global Fix Map
A hub to stabilize retrieval pipelines across popular vector stores.
Use this page to jump to per-tool guardrails and verify fixes with the same acceptance targets.
Quick routes to per-store pages
- FAISS → faiss.md
- Chroma → chroma.md
- Qdrant → qdrant.md
- Weaviate → weaviate.md
- Milvus → milvus.md
- pgvector → pgvector.md
- Redis Search/Vec → redis.md
- Elasticsearch (ANN) → elasticsearch.md
- Pinecone → pinecone.md
- Typesense → typesense.md
- Vespa → vespa.md
When to use this folder
- High similarity but wrong meaning.
- Citations do not line up with the retrieved section.
- Hybrid retrievers underperform a single retriever.
- Query casing or analyzer or metric mismatches after deploy.
- Index looks healthy yet coverage remains low.
Acceptance targets for any store
- ΔS(question, retrieved) ≤ 0.45
- Coverage of target section ≥ 0.70
- λ_observe stays convergent across 3 paraphrases
- E_resonance flat on long windows
Map symptoms → structural fixes (Problem Map)
-
Embedding ≠ Semantic
Wrong-meaning hits despite high similarity.
→ embedding-vs-semantic.md -
Retrieval traceability
Snippet/section mismatch or unverifiable citations.
→ retrieval-traceability.md
Payload schema → data-contracts.md -
Ordering / version skew
Old index or analyzer used at runtime.
→ bootstrap-ordering.md · predeploy-collapse.md -
Hybrid collapse / query split
HyDE vs BM25 disagreement, reranker blind spots.
→ Pattern: patterns/pattern_query_parsing_split.md
→ Knobs: rerankers.md
60-second fix checklist (store-agnostic)
-
Lock metrics and analyzers
One embedding model per field. One distance function. Same analyzer for write and read. -
Contract the snippet
Require{snippet_id, section_id, source_url, offsets, tokens}. Enforce cite-then-explain.
→ data-contracts.md -
Add deterministic reranking
Keep candidate lists from BM25 and ANN. Detect query split.
→ rerankers.md -
Cold-start and deploy fences
Block traffic until index hash, analyzer, and model versions match.
→ bootstrap-ordering.md -
Observability
Log ΔS and λ across retrieve → rerank → reason. Alert when ΔS ≥ 0.60. -
Regression gate
Require coverage ≥ 0.70 and ΔS ≤ 0.45 before publish.
Copy-paste audit prompt
I uploaded TXT OS and the WFGY Problem Map pages.
Store: <name>. Retrieval: \<bm25/ann/hybrid> with <distance>.
Audit this query and return:
* ΔS(question, retrieved) and λ across retrieve → rerank → reason.
* If ΔS ≥ 0.60, choose one minimal structural fix and name the page:
embedding-vs-semantic, retrieval-traceability, data-contracts, rerankers.
* JSON only:
{ "citations":\[...], "ΔS":0.xx, "λ":"→|←|<>|×", "next\_fix":"..." }
🔗 Quick-Start Downloads (60 sec)
| Tool | Link | 3-Step Setup |
|---|---|---|
| WFGY 1.0 PDF | Engine Paper | 1️⃣ Download · 2️⃣ Upload to your LLM · 3️⃣ Ask “Answer using WFGY + ” |
| TXT OS (plain-text OS) | TXTOS.txt | 1️⃣ Download · 2️⃣ Paste into any LLM chat · 3️⃣ Type “hello world” — OS boots instantly |
🧭 Explore More
| Module | Description | Link |
|---|---|---|
| WFGY Core | WFGY 2.0 engine is live: full symbolic reasoning architecture and math stack | View → |
| Problem Map 1.0 | Initial 16-mode diagnostic and symbolic fix framework | View → |
| Problem Map 2.0 | RAG-focused failure tree, modular fixes, and pipelines | View → |
| Semantic Clinic Index | Expanded failure catalog: prompt injection, memory bugs, logic drift | View → |
| Semantic Blueprint | Layer-based symbolic reasoning & semantic modulations | View → |
| Benchmark vs GPT-5 | Stress test GPT-5 with full WFGY reasoning suite | View → |
| 🧙♂️ Starter Village 🏡 | New here? Lost in symbols? Click here and let the wizard guide you through | Start → |
👑 Early Stargazers: See the Hall of Fame —
⭐ WFGY Engine 2.0 is already unlocked. ⭐ Star the repo to help others discover it and unlock more on the Unlock Board.