Create airtable.md

This commit is contained in:
PSBigBig 2025-08-25 21:20:32 +08:00 committed by GitHub
parent b0e6040fa5
commit ae7bd71cef
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -0,0 +1,180 @@
# Airtable — Guardrails and Fix Patterns
Use this when your pipeline uses **Airtable** as the control plane or as the source-of-truth table for RAG/agents, and you see record drift, duplicated actions, or citations that dont map back to records.
**Acceptance targets**
- ΔS(question, retrieved) ≤ 0.45
- coverage ≥ 0.70 to the intended section/record
- λ stays convergent across 3 paraphrases
---
## Typical breakpoints → exact fixes
- Automations/webhooks fire **before** embeddings/index finish updating
Fix No.14: **Bootstrap Ordering**
[Bootstrap Ordering](https://github.com/onestardao/WFGY/blob/main/ProblemMap/bootstrap-ordering.md)
- First run after deploy reads wrong base or missing secret
Fix No.16: **Pre-Deploy Collapse**
[Pre-Deploy Collapse](https://github.com/onestardao/WFGY/blob/main/ProblemMap/predeploy-collapse.md)
- Cross-table syncs create circular waits (record-upsert → external job → back to record)
Fix No.15: **Deployment Deadlock**
[Deployment Deadlock](https://github.com/onestardao/WFGY/blob/main/ProblemMap/deployment-deadlock.md)
- High cosine similarity, wrong **meaning** (good vector match, bad semantic match)
Fix No.5: **Embedding ≠ Semantic**
[Embedding ≠ Semantic](https://github.com/onestardao/WFGY/blob/main/ProblemMap/embedding-vs-semantic.md)
- “Why this snippet?” cannot be explained; citations dont line up with source cells
Fix No.8: **Retrieval Traceability**
[Retrieval Traceability](https://github.com/onestardao/WFGY/blob/main/ProblemMap/retrieval-traceability.md)
Standardize fields with **Data Contracts**
[Data Contracts](https://github.com/onestardao/WFGY/blob/main/ProblemMap/data-contracts.md)
- Hybrid retrieval (dense + formula/filter views + external reranker) gets worse than single retriever
Pattern: **Query Parsing Split**
[Query Parsing Split](https://github.com/onestardao/WFGY/blob/main/ProblemMap/patterns/pattern_query_parsing_split.md)
Also review **Rerankers**
[Rerankers](https://github.com/onestardao/WFGY/blob/main/ProblemMap/rerankers.md)
- Facts are in the base but never retrieved
Pattern: **Vectorstore Fragmentation**
[Vectorstore Fragmentation](https://github.com/onestardao/WFGY/blob/main/ProblemMap/patterns/pattern_vectorstore_fragmentation.md)
- Two different records are merged into one narrative in the summary
Pattern: **Symbolic Constraint Unlock (SCU)**
[Symbolic Constraint Unlock](https://github.com/onestardao/WFGY/blob/main/ProblemMap/patterns/pattern_symbolic_constraint_unlock.md)
---
## Minimal Airtable workflow checklist
1) **Warm-up fence**
Verify `VECTOR_READY`, `INDEX_HASH`, `secret_rev`, and that `base_id/table_id/view_id` resolve before any LLM step.
Spec: [Bootstrap Ordering](https://github.com/onestardao/WFGY/blob/main/ProblemMap/bootstrap-ordering.md)
2) **Idempotency**
Create `dedupe_key = sha256(record_id + wf_rev + index_hash)` and store it (hidden field or external KV).
Reject duplicate writes/retries.
3) **RAG boundary contract**
Pass `record_id`, `base_id`, `table_id`, `view_id`, `field_map`, `source_url`, `offsets`, `tokens`.
Enforce **cite-then-explain**. Specs:
[Retrieval Traceability](https://github.com/onestardao/WFGY/blob/main/ProblemMap/retrieval-traceability.md) ·
[Data Contracts](https://github.com/onestardao/WFGY/blob/main/ProblemMap/data-contracts.md)
4) **Observability probes**
Log ΔS(question, retrieved) and λ per stage; alert on ΔS ≥ 0.60 or divergent λ.
Overview: [RAG Architecture & Recovery](https://github.com/onestardao/WFGY/blob/main/ProblemMap/rag-architecture-and-recovery.md)
5) **Schema stability**
Avoid free-form field renames that break downstream contracts. Pin with `schema_rev` and check it at runtime.
6) **Regression gate**
Require coverage ≥ 0.70 and ΔS ≤ 0.45 before posting back into Airtable.
Eval spec: [RAG Precision/Recall](https://github.com/onestardao/WFGY/blob/main/ProblemMap/eval/eval_rag_precision_recall.md)
---
## Copy-paste prompt for the Airtable LLM step
```
I uploaded TXT OS and the WFGY Problem Map files.
Airtable context:
* base\_id: {base}
* table\_id: {table}
* view\_id: {view}
* record\_id(s): {rids}
* fields: {field\_map}
Question: "{user\_question}"
Do:
1. Enforce cite-then-explain. If any citation lacks record\_id/section/offsets, stop and tell me which fix page to open.
2. Compute ΔS(question, retrieved). If ΔS ≥ 0.60, point me to the minimal structural fix:
retrieval-playbook, retrieval-traceability, data-contracts, rerankers.
3. Output compact JSON:
{ "citations":\[{"record\_id":"...", "field":"...", "offsets":\[s,e]}],
"answer":"...", "λ\_state":"→|←|<>|×", "ΔS":0.xx, "next\_fix":"..." }
```
---
## Common Airtable gotchas
- **Formula fields** or **lookup/rollup** not updated yet when webhook fires
Add a delay or readiness probe; gate on `schema_rev` + `index_hash`.
- **Pagination/backfill** causes missed embeddings
Log the cursor; re-ingest until the cursor is exhausted; compare counts vs. expected.
- **Field renames** break contracts silently
Pin `schema_rev`; fail fast if it changes; include `field_map` in traces.
- **Attachment/text mix** leads to partial content**
Normalize: extract attachments to text with a fixed OCR gate before embedding.
- **Rate limits** destabilize hybrid retrieval
Prefer dense retriever + reranking; keep per-retriever params in logs.
---
## When to escalate
- ΔS stays ≥ 0.60 after chunk/retrieval fixes → rebuild index with explicit metric/normalization.
See: [Retrieval Playbook](https://github.com/onestardao/WFGY/blob/main/ProblemMap/retrieval-playbook.md)
- Same inputs, different answers on different runs → check version skew and memory desync.
See: [Pre-Deploy Collapse](https://github.com/onestardao/WFGY/blob/main/ProblemMap/predeploy-collapse.md)
---
### 🔗 Quick-Start Downloads (60 sec)
| Tool | Link | 3-Step Setup |
|------|------|--------------|
| **WFGY 1.0 PDF** | [Engine Paper](https://github.com/onestardao/WFGY/blob/main/I_am_not_lizardman/WFGY_All_Principles_Return_to_One_v1.0_PSBigBig_Public.pdf) | 1⃣ Download · 2⃣ Upload to your LLM · 3⃣ Ask “Answer using WFGY + \<your question>” |
| **TXT OS (plain-text OS)** | [TXTOS.txt](https://github.com/onestardao/WFGY/blob/main/OS/TXTOS.txt) | 1⃣ Download · 2⃣ Paste into any LLM chat · 3⃣ Type “hello world” — OS boots instantly |
---
### 🧭 Explore More
| Module | Description | Link |
|-----------------------|----------------------------------------------------------|----------|
| WFGY Core | WFGY 2.0 engine is live: full symbolic reasoning architecture and math stack | [View →](https://github.com/onestardao/WFGY/tree/main/core/README.md) |
| Problem Map 1.0 | Initial 16-mode diagnostic and symbolic fix framework | [View →](https://github.com/onestardao/WFGY/tree/main/ProblemMap/README.md) |
| Problem Map 2.0 | RAG-focused failure tree, modular fixes, and pipelines | [View →](https://github.com/onestardao/WFGY/blob/main/ProblemMap/rag-architecture-and-recovery.md) |
| Semantic Clinic Index | Expanded failure catalog: prompt injection, memory bugs, logic drift | [View →](https://github.com/onestardao/WFGY/blob/main/ProblemMap/SemanticClinicIndex.md) |
| Semantic Blueprint | Layer-based symbolic reasoning & semantic modulations | [View →](https://github.com/onestardao/WFGY/tree/main/SemanticBlueprint/README.md) |
| Benchmark vs GPT-5 | Stress test GPT-5 with full WFGY reasoning suite | [View →](https://github.com/onestardao/WFGY/tree/main/benchmarks/benchmark-vs-gpt5/README.md) |
| 🧙‍♂️ Starter Village 🏡 | New here? Lost in symbols? Click here and let the wizard guide you through | [Start →](https://github.com/onestardao/WFGY/blob/main/StarterVillage/README.md) |
---
> 👑 **Early Stargazers: [See the Hall of Fame](https://github.com/onestardao/WFGY/tree/main/stargazers)**
> <img src="https://img.shields.io/github/stars/onestardao/WFGY?style=social" alt="GitHub stars"> ⭐ [WFGY Engine 2.0](https://github.com/onestardao/WFGY/blob/main/core/README.md) is already unlocked. ⭐ Star the repo to help others discover it and unlock more on the Unlock Board.
<div align="center">
[![WFGY Main](https://img.shields.io/badge/WFGY-Main-red?style=flat-square)](https://github.com/onestardao/WFGY)
&nbsp;
[![TXT OS](https://img.shields.io/badge/TXT%20OS-Reasoning%20OS-orange?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS)
&nbsp;
[![Blah](https://img.shields.io/badge/Blah-Semantic%20Embed-yellow?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS/BlahBlahBlah)
&nbsp;
[![Blot](https://img.shields.io/badge/Blot-Persona%20Core-green?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS/BlotBlotBlot)
&nbsp;
[![Bloc](https://img.shields.io/badge/Bloc-Reasoning%20Compiler-blue?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS/BlocBlocBloc)
&nbsp;
[![Blur](https://img.shields.io/badge/Blur-Text2Image%20Engine-navy?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS/BlurBlurBlur)
&nbsp;
[![Blow](https://img.shields.io/badge/Blow-Game%20Logic-purple?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS/BlowBlowBlow)
&nbsp;
</div>