WFGY/ProblemMap/GlobalFixMap/Enterprise_Knowledge_Gov/data_sensitivity.md

6.4 KiB
Raw Permalink Blame History

Data Sensitivity — Enterprise Knowledge Governance

🧭 Quick Return to Map

You are in a sub-page of Enterprise_Knowledge_Gov.
To reorient, go back here:

Think of this page as a desk within a ward.
If you need the full triage and all prescriptions, return to the Emergency Room lobby.

Guardrails and fix patterns for handling sensitive or regulated data inside enterprise knowledge pipelines. Use this page when your AI or RAG workflow may expose PII, PHI, financial records, or other protected content.


When to use this page

  • Retrieval pulls names, emails, addresses, or identifiers into model context.
  • Generated answers expose financial numbers or personal data without redaction.
  • Compliance requires specific handling for GDPR, HIPAA, or SOC2.
  • Data contracts missing sensitivity tags or enforcement rules.

Core acceptance targets

  • Sensitive fields explicitly tagged in schema (pii:true, phi:true, sensitivity:high).
  • No unredacted PII/PHI present in model outputs unless explicitly authorized.
  • Audit logs record every sensitive field access.
  • Redaction filters applied before long-term storage.

Typical sensitivity problems → exact fix

Symptom Likely cause Open this
PII leaks into retrieval context Missing sensitivity metadata in index data-contracts.md
Model answers contain personal identifiers No redaction filter on output retrieval-traceability.md
High similarity matches pull private records Embeddings not normalized or index not segmented embedding-vs-semantic.md
Inconsistent handling of sensitive fields across environments Schema drift and missing contracts context-drift.md

Fix in 60 seconds

  1. Add sensitivity tags to your ingestion schema:
    {
      "field": "email",
      "pii": true,
      "sensitivity": "high"
    }
    

2. **Apply redaction filter** before passing data to the model. Replace `@domain.com` emails with `"***"`.
3. **Segment sensitive indexes** from general knowledge. Use separate retrievers.
4. **Enforce cite-then-explain**. Require citations for sensitive data, and log ΔS plus λ\_state.

---

## Copy-paste probe template

```txt
I uploaded TXTOS and WFGY Problem Map.

Run my retrieval for this query:
- Detect if any PII/PHI appears in snippet fields.
- If yes, apply redaction or enforce sensitivity contract.
- Return JSON log with snippet_id, sensitivity_tags, ΔS, λ_state.
- Fail the output if PII is not redacted.
```

---

## Escalate when

* Same query alternates between redacted and unredacted outputs.
* Sensitive fields appear in logs without an `audit_hash`.
* Compliance review shows schema fields without sensitivity tags.

Use [bootstrap-ordering.md](https://github.com/onestardao/WFGY/blob/main/ProblemMap/bootstrap-ordering.md) to verify ingestion happens in the correct order and [ops/debug\_playbook.md](https://github.com/onestardao/WFGY/blob/main/ProblemMap/ops/debug_playbook.md) for deeper runtime tracing.

---

### 🔗 Quick-Start Downloads

| Tool                       | Link                                                                                                                                       | 3-Step Setup                                                                             |
| -------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------ | ---------------------------------------------------------------------------------------- |
| **WFGY 1.0 PDF**           | [Engine Paper](https://github.com/onestardao/WFGY/blob/main/I_am_not_lizardman/WFGY_All_Principles_Return_to_One_v1.0_PSBigBig_Public.pdf) | 1⃣ Download · 2⃣ Upload to your LLM · 3⃣ Ask “Answer using WFGY + <your question>”    |
| **TXT OS (plain-text OS)** | [TXTOS.txt](https://github.com/onestardao/WFGY/blob/main/OS/TXTOS.txt)                                                                     | 1⃣ Download · 2⃣ Paste into any LLM chat · 3⃣ Type “hello world” — OS boots instantly |

---

<!-- WFGY_FOOTER_START -->

### Explore More

| Layer | Page | What its for |
| --- | --- | --- |
| ⭐ Proof | [WFGY Recognition Map](/recognition/README.md) | External citations, integrations, and ecosystem proof |
| ⚙️ Engine | [WFGY 1.0](/legacy/README.md) | Original PDF tension engine and early logic sketch (legacy reference) |
| ⚙️ Engine | [WFGY 2.0](/core/README.md) | Production tension kernel for RAG and agent systems |
| ⚙️ Engine | [WFGY 3.0](/TensionUniverse/EventHorizon/README.md) | TXT based Singularity tension engine (131 S class set) |
| 🗺️ Map | [Problem Map 1.0](/ProblemMap/README.md) | Flagship 16 problem RAG failure taxonomy and fix map |
| 🗺️ Map | [Problem Map 2.0](/ProblemMap/wfgy-rag-16-problem-map-global-debug-card.md) | Global Debug Card for RAG and agent pipeline diagnosis |
| 🗺️ Map | [Problem Map 3.0](/ProblemMap/wfgy-ai-problem-map-troubleshooting-atlas.md) | Global AI troubleshooting atlas and failure pattern map |
| 🧰 App | [TXT OS](/OS/README.md) | .txt semantic OS with fast bootstrap |
| 🧰 App | [Blah Blah Blah](/OS/BlahBlahBlah/README.md) | Abstract and paradox Q&A built on TXT OS |
| 🧰 App | [Blur Blur Blur](/OS/BlurBlurBlur/README.md) | Text to image generation with semantic control |
| 🏡 Onboarding | [Starter Village](/StarterVillage/README.md) | Guided entry point for new users |

If this repository helped, starring it improves discovery so more builders can find the docs and tools.  
[![GitHub Repo stars](https://img.shields.io/github/stars/onestardao/WFGY?style=social)](https://github.com/onestardao/WFGY)

<!-- WFGY_FOOTER_END -->