Update memory-overwrite.md

2026-04-28 11:40:07 +00:00 · 2025-08-13 19:22:18 +08:00 · 2025-08-13 19:22:18 +08:00 · 50991f8943
commit 50991f8943
parent 5ffcdf3bc3
1 changed files with 258 additions and 26 deletions
--- a/ProblemMap/multi-agent-chaos/memory-overwrite.md
+++ b/ProblemMap/multi-agent-chaos/memory-overwrite.md
@ -1,58 +1,290 @@
-# 📒 Deep Dive · Cross‑Agent Memory Overwrite (Placeholder)
+# Deep Dive — Cross-Agent Memory Overwrite (Multi-Agent Chaos)

-> **Status:** WIP — awaiting real logs that show one agent erasing another’s state.
+> **Status:** Production-ready guidance with reproducible steps and guardrails.
+> If you have real traces, please share anonymized logs — they help harden thresholds & adapters.

 ---

-## 🔍 Typical Overwrite Scenario
+## Quick nav

-* **Agent A** saves `Plan v1` → **Agent B** unknowingly commits `Plan v0` over it  
-* Vector store returns *last‑writer‑wins* → earlier context disappears  
-* Later conversation references missing data → hallucination or contradiction
+* Back to multi-agent map → [Multi-Agent Problems](../Multi-Agent_Problems.md)
+* Related patterns → Memory Desync ([pattern\_memory\_desync](../patterns/pattern_memory_desync.md)), SCU ([pattern\_symbolic\_constraint\_unlock](../patterns/pattern_symbolic_constraint_unlock.md)), Vectorstore Fragmentation ([pattern\_vectorstore\_fragmentation](../patterns/pattern_vectorstore_fragmentation.md))
+* Examples → [Example 04 · Multi-Agent Coordination](../examples/example_04_multi_agent_coordination.md), [Example 03 · Pipeline Patch](../examples/example_03_pipeline_patch.md)
+* Eval → [Cross-Agent Consistency (κ)](../eval/eval_cross_agent_consistency.md)

 ---

-## 📝 Help Us Harden the Fix
+## 1) Problem definition

-| What to Submit | Where | Why |
-|----------------|-------|-----|
-| JSON / text trace of overwrite | New **Discussion → Memory Overwrite Trace** | Build regression test |
-| Framework info (LangChain, AutoGen …) | Same thread | Tune adapter layer |
-| Desired guardrail behavior | Comment | Define acceptance criteria |
+Two or more agents write to a **shared memory** (vector store, KV store, doc DB). Without versioning & conflict control, a later write silently **overwrites** a more recent or semantically different state (“last-writer-wins”). Downstream agents read stale or missing facts → contradictions, hallucinations, or wrong tool calls.
+
+**Typical symptoms**
+
+* “We agreed on Plan v3 yesterday… why are we back to v0?”
+* Auditor validates **deleted** or **older** evidence.
+* Turn logs show **non-monotonic** version jumps: `… 7 → 3 → 8`.

 ---

-## 🛠 Planned Guardrail Outline
+## 2) Threat model (why it happens)

-1. **Node Version Stamp** — each write carries `agent_id + timestamp + version`.  
-2. **ΔS Collision Alert** — large semantic mismatch triggers “merge or fork?” prompt.  
-3. **BBCR Reconcile** — automatic three‑way merge or safe branch to preserve both states.
+* **Stale write**: Agent B writes with an old `mem_rev` it fetched minutes ago.
+* **Concurrent write**: Agents A & B write simultaneously; store picks one arbitrarily.
+* **Namespace collision**: Different flows use the same `entity_id` or index namespace.
+* **Schema drift**: A writes `{plan,deadline}`, B writes `{deadline,notes}` and drops `plan`.
+* **Fragmented store**: Partitions disagree on latest revision (see vectorstore fragmentation).

 ---

-↩︎ [Back to Multi‑Agent Map](../Multi-Agent_Problems.md)
+## 3) Data model & invariants (copy/paste)

-<br>
+Every write envelope **must** include:

-> <img src="https://img.shields.io/github/stars/onestardao/WFGY?style=social" alt="GitHub stars"> ⭐ Help reach 10,000 stars by 2025-09-01 to unlock Engine 2.0 for everyone  ⭐ <strong><a href="https://github.com/onestardao/WFGY">Star WFGY on GitHub</a></strong>
+```json
+{
+  "entity_id": "project:alpha",
+  "agent_id": "planner",
+  "role_id": "planner@v3",
+  "role_hash": "sha256:78c2…",      // persona digest (see role-drift.md)
+  "op_id": "op-2025-08-13T12:34:56Z#1234",
+  "timestamp": "2025-08-13T12:34:56Z",
+  "mem_rev": 8,                     // intended new revision (monotonic int)
+  "prev_rev": 7,                    // what writer claims to extend
+  "mem_hash": "sha256:abcd1234",    // hash(content)
+  "parents": [7],                   // for merges, can be [7,7a] (three-way)
+  "content": {
+    "plan": "Deliverable X by EOD",
+    "dependencies": ["doc-123"]
+  }
+}
+```

-> 👑 **Early Stargazers: [See the Hall of Fame](https://github.com/onestardao/WFGY/tree/main/stargazers)** —  
+**Store invariants**
+
+1. **Monotonicity**: `mem_rev` strictly increases per `entity_id`.
+2. **CAS on prev\_rev**: write only applies if store’s `head_rev == prev_rev`.
+3. **Audit trail**: every write stored append-only in `mem_log`.
+4. **Branch-safe** (optional): allow **branches** on conflict; reconcile later.
+
+---
+
+## 4) Reproduce the bug (minimal & deterministic)
+
+**Goal:** make Agent B overwrite Agent A with a stale revision.
+
+### 4.1 Curl (HTTP debug endpoints, stdlib-only server assumed)
+
+```bash
+# 1) A reads, sees head_rev=7
+curl -s http://localhost:8080/mem/head?entity_id=project:alpha | jq
+
+# 2) A writes rev=8 (ok)
+curl -s -X POST http://localhost:8080/mem/write -H 'Content-Type: application/json' -d '{
+  "entity_id":"project:alpha","agent_id":"planner","role_id":"planner@v3",
+  "role_hash":"sha256:78c2","op_id":"op-A","timestamp":"2025-08-13T01:00:00Z",
+  "mem_rev":8,"prev_rev":7,"mem_hash":"sha256:aa","content":{"plan":"v8"}
+}' | jq
+
+# 3) B (stale) still thinks head=7 and tries to write another “rev=8”
+curl -s -X POST http://localhost:8080/mem/write -H 'Content-Type: application/json' -d '{
+  "entity_id":"project:alpha","agent_id":"executor","role_id":"executor@v1",
+  "role_hash":"sha256:91ff","op_id":"op-B","timestamp":"2025-08-13T01:00:02Z",
+  "mem_rev":8,"prev_rev":7,"mem_hash":"sha256:bb","content":{"plan":"v0 (stale)"}
+}' | jq
+```
+
+**Expected (correct)**: second call gets `409 Conflict` (CAS failed).
+**Buggy (overwrite)**: second call `200 OK`, head becomes stale content.
+
+### 4.2 Minimal Python test (single file)
+
+* Simulate two concurrent writes; assert the second is rejected **or** creates a **branch**.
+
+---
+
+## 5) Detection & fast triage (no LLM)
+
+**Reject on arrival** if any of:
+
+* `prev_rev < head_rev` at write time (**stale write**).
+* `prev_rev == head_rev` but `mem_hash` differs (**concurrent write**, collision).
+* `role_hash` mismatches bound persona for the writer (possible role-drift).
+* `entity_id` not in writer’s allowed scope (tool/ACL violation).
+
+**Emit metrics/logs** for each rejection and keep an **append-only** record.
+
+---
+
+## 6) Guardrails (choose one or combine)
+
+### 6.1 Optimistic CAS (compare-and-swap) — simplest & strong
+
+* Require `prev_rev == head_rev` at write.
+* On mismatch → **reject** or **auto-branch**.
+
+**Python-like pseudo (stdlib-only)**
+
+```python
+def safe_write(store, w):  # w: envelope dict (see schema)
+    head = store.head_meta(w["entity_id"])      # {"rev":int,"hash":str}
+    if head["rev"] != w["prev_rev"]:
+        return {"status":"conflict", "reason":"stale_prev", "head": head}
+    # Atomically swap (rev must advance by 1)
+    ok = store.compare_and_swap(
+        entity_id=w["entity_id"],
+        expected_rev=head["rev"],
+        new_rev=w["mem_rev"],
+        new_hash=w["mem_hash"],
+        content=w["content"],
+        op_meta={k:w[k] for k in ("agent_id","role_id","role_hash","op_id","timestamp")}
+    )
+    return {"status":"ok"} if ok else {"status":"retry","reason":"cas_failed"}
+```
+
+**Node/TS HMAC signature (optional)**
+
+```ts
+import crypto from "crypto";
+function signWrite(agentId: string, roleHash: string, prevRev: number, memRev: number, key: Buffer){
+  const payload = `${agentId}|${roleHash}|${prevRev}|${memRev}`;
+  return crypto.createHmac("sha256", key).update(payload).digest("hex");
+}
+```
+
+### 6.2 Branch-and-Merge (BBCR-style)
+
+* On conflict, create a **branch** (`mem_rev=8a`) instead of rejecting; later run a **three-way merge**.
+
+**Three-way merge outline**
+
+```
+base = rev 7
+A    = rev 8 (agent A)
+B    = rev 8a (agent B)
+ΔA = diff(base, A);  ΔB = diff(base, B)
+if ΔA ∩ ΔB == Ø → merge = base ⊕ ΔA ⊕ ΔB
+else → manual decision or rule-based precedence (e.g., auditor > planner)
+```
+
+*No external libs needed*: represent `content` as JSON and define a minimal **diff** (added/removed keys; for strings, use normalized edit distance ≤ threshold to auto-merge).
+
+### 6.3 ΔS semantic collision alert (nice-to-have)
+
+* Compute a **cheap semantic distance** between the new `content` and head content:
+
+  * Normalize (lowercase, strip punctuation), tokenize, Jaccard overlap on tokens.
+  * If overlap `< 0.6` **and** same `prev_rev` → raise collision alert, require manual confirm.
+
+---
+
+## 7) Observability (Prometheus) & alerts
+
+**Metrics**
+
+* `mem_write_total{entity,agent,outcome="ok|conflict|retry"}`
+* `mem_conflict_total{entity,reason="stale_prev|hash_collision"}`
+* `mem_branch_total{entity}` (if branch mode)
+* `mem_head_rev{entity}` (gauge)
+* `mem_write_latency_seconds` (histogram)
+
+**Alert rules (examples)**
+
+```yaml
+# Frequent conflicts (possible hot entity or stale readers)
+alert: MemConflictsSpike
+expr: increase(mem_conflict_total[5m]) > 3
+for: 5m
+labels: { severity: ticket }
+
+# Head revision oscillation (rollback/flip-flop)
+alert: MemHeadOscillation
+expr: changes(mem_head_rev[10m]) > 5
+for: 10m
+labels: { severity: ticket }
+```
+
+> Also track **cross-agent κ**; sudden drops often co-occur with memory corruption. See [cross-agent eval](../eval/eval_cross_agent_consistency.md).
+
+---
+
+## 8) Tests & acceptance criteria
+
+**Unit**
+
+* **Stale write** rejected (`prev_rev < head_rev`).
+* **Concurrent write**: either **reject** or **create branch**; never silent overwrite.
+* **Schema merge**: non-overlapping keys merge automatically.
+
+**E2E**
+
+* Two agents race on the same `entity_id`: final head must be **v8** or (**v8 + v8a** if branching), never **v0**.
+* κ remains ≥ baseline after enabling guards.
+
+**Acceptance (ship)**
+
+* `mem_conflict_total` steady and low; no silent overwrites in 1k writes.
+* No data loss in replay tests (log → rebuild yields identical head).
+
+---
+
+## 9) Rollout plan (safe)
+
+1. **Shadow mode**: turn on CAS checks, **warn-only**; measure conflict rate.
+2. **Canary**: reject stale writes for 10% entities; branch on collision (optional).
+3. **Full**: enforce CAS for all; keep feature flag for emergency bypass.
+4. **Post-rollout**: schedule merge jobs for any branches; add dashboards.
+
+---
+
+## 10) Sample logs (anonymized JSONL)
+
+```json
+{"ts":"2025-08-13T01:00:00Z","entity_id":"project:alpha","op":"write","agent":"planner","prev_rev":7,"mem_rev":8,"status":"ok"}
+{"ts":"2025-08-13T01:00:02Z","entity_id":"project:alpha","op":"write","agent":"executor","prev_rev":7,"mem_rev":8,"status":"conflict","reason":"stale_prev","head_rev":8}
+{"ts":"2025-08-13T01:00:03Z","entity_id":"project:alpha","op":"write","agent":"executor","prev_rev":8,"mem_rev":9,"status":"ok"}
+```
+
+---
+
+> Have a reproducible overwrite trace? Please share; even 5–10 turns help us tune adapters and defaults.
+
+↩︎ [Back to Multi-Agent Map](../Multi-Agent_Problems.md)
+
+---
+
+### 🧭 Explore More
+
+| Module                | Description                                                          | Link                                                                                               |
+| --------------------- | -------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------- |
+| WFGY Core             | Standalone semantic reasoning engine for any LLM                     | [View →](https://github.com/onestardao/WFGY/tree/main/core/README.md)                              |
+| Problem Map 1.0       | Initial 16-mode diagnostic and symbolic fix framework                | [View →](https://github.com/onestardao/WFGY/tree/main/ProblemMap/README.md)                        |
+| Problem Map 2.0       | RAG-focused failure tree, modular fixes, and pipelines               | [View →](https://github.com/onestardao/WFGY/blob/main/ProblemMap/rag-architecture-and-recovery.md) |
+| Semantic Clinic Index | Expanded failure catalog: prompt injection, memory bugs, logic drift | [View →](https://github.com/onestardao/WFGY/blob/main/ProblemMap/SemanticClinicIndex.md)           |
+| Semantic Blueprint    | Layer-based symbolic reasoning & semantic modulations                | [View →](https://github.com/onestardao/WFGY/tree/main/SemanticBlueprint/README.md)                 |
+| Benchmark vs GPT-5    | Stress test GPT-5 with full WFGY reasoning suite                     | [View →](https://github.com/onestardao/WFGY/tree/main/benchmarks/benchmark-vs-gpt5/README.md)      |
+
+---
+
+> 👑 **Early Stargazers: [See the Hall of Fame](https://github.com/onestardao/WFGY/tree/main/stargazers)** —
 > Engineers, hackers, and open source builders who supported WFGY from day one.

+> <img src="https://img.shields.io/github/stars/onestardao/WFGY?style=social" alt="GitHub stars"> ⭐ Help reach 10,000 stars by 2025-09-01 to unlock Engine 2.0 for everyone  ⭐ **[Star WFGY on GitHub](https://github.com/onestardao/WFGY)**
+
 <div align="center">

 [![WFGY Main](https://img.shields.io/badge/WFGY-Main-red?style=flat-square)](https://github.com/onestardao/WFGY)
-&nbsp;
+ 
 [![TXT OS](https://img.shields.io/badge/TXT%20OS-Reasoning%20OS-orange?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS)
-&nbsp;
+ 
 [![Blah](https://img.shields.io/badge/Blah-Semantic%20Embed-yellow?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS/BlahBlahBlah)
-&nbsp;
+ 
 [![Blot](https://img.shields.io/badge/Blot-Persona%20Core-green?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS/BlotBlotBlot)
-&nbsp;
+ 
 [![Bloc](https://img.shields.io/badge/Bloc-Reasoning%20Compiler-blue?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS/BlocBlocBloc)
-&nbsp;
+ 
 [![Blur](https://img.shields.io/badge/Blur-Text2Image%20Engine-navy?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS/BlurBlurBlur)
-&nbsp;
+ 
 [![Blow](https://img.shields.io/badge/Blow-Game%20Logic-purple?style=flat-square)](https://github.com/onestardao/WFGY/tree/main/OS/BlowBlowBlow)

 </div>
+