JSON Mode and Tool Calls · Prompt Assembly

A field guide to keep JSON mode stable and tool calls safe. Use this page to clamp the schema, stop looped tools, and keep outputs auditable across providers.

Open these first

Visual map and recovery: RAG Architecture & Recovery
End to end retrieval knobs: Retrieval Playbook
Traceability schema for snippets: Retrieval Traceability
Contract the payload: Data Contracts
Reasoning collapse and recovery: Logic Collapse
Prompt injection fences: Prompt Injection
Multi agent conflict map: Multi-Agent Problems
Live ops and debug: Live Monitoring for RAG, Debug Playbook

When to use this page

JSON mode returns invalid or partial objects.
Tool calls arrive with missing or extra fields.
The model mixes tool output with prose.
Tools loop or stall without timeouts.
Role text bleeds into user turns and corrupts parsing.
Outputs differ between seeds for the same inputs.

Acceptance targets

JSON parse success rate ≥ 0.99 across three paraphrases.
Tool call validity ≥ 0.98 by schema check.
Zero side effects on failed parse.
λ remains convergent across three paraphrases and two seeds.
ΔS(question, retrieved) ≤ 0.45 when the answer cites corpus evidence.

Map symptoms to structural fixes

Invalid JSON or mixed prose. → Lock format with a contract and a validator. See Data Contracts, Logic Collapse
Tool arguments drift or contain free text. → Use strict argument schemas and echo the schema in every step. See Prompt Injection
Tools loop or wait on each other. → Add timeouts and split memories by namespace and revision. See Multi-Agent Problems
Answers flip between runs for the same input. → Reorder headers and clamp variance with BBAM. Verify with Retrieval Traceability
Citations missing or point to the wrong snippet. → Enforce cite then explain. See Retrieval Playbook

Fix in 60 seconds

Clamp the JSON contract Define a single object shape. Disallow extra fields. Reject prose outside the object.
Echo the tool schema In every step the model must restate the allowed tools and their argument shapes. Reject any call that does not match.
Add timeouts and retries Each tool has timeout_ms and at most N retries with exponential backoff. Abort on chain length overrun.
Observability probes Log parse success, tool validity, ΔS, and λ. Trip a circuit if parse rate drops under 0.98 in a five minute window.
No side effects before validation Validate JSON and tool calls first. Only then commit external writes.

Copy paste blocks

A. JSON answer contract

You must output a single JSON object. No Markdown. No code fences. No commentary.

Schema:
{
  "answer": "string",
  "citations": [{"source_url": "string", "snippet_id": "string"}],
  "tool_calls": [{"name": "tool_name", "args": { ... }}],
  "metrics": {"lambda_state": "->|<-|<>|x", "delta_s": 0.00}
}

Rules:
- Do not include fields that are not in the schema.
- Strings must not contain unescaped newlines.
- If you cannot satisfy the schema, return:
  {"answer": "", "citations": [], "tool_calls": [], "metrics": {"lambda_state":"x","delta_s":1.0}}

B. Tool protocol guard

Allowed tools:
1) "web_fetch": args = {"url": "string"}
2) "vector_search": args = {"query": "string", "k": 5}
3) "write_kv": args = {"key":"string","value":"string","ttl_sec": 600}

Contract:
- Each tool call must match the exact args shape.
- Never put narrative text into args.
- Max tool calls in one turn: 3
- Per call timeout_ms: 15000
- Retries: up to 2 with capped backoff

C. Validator stub

Step 1: parse JSON strictly. If parse fails, stop and return a fix tip.
Step 2: check extra fields. If any, reject.
Step 3: validate each tool call against the schema list.
Step 4: only after validation, run tools in order.
Step 5: log {parse_ok, tool_valid, delta_s, lambda_state}.

D. Role header clamp

System:
- Policies and schema live here only. User turns contain tasks and questions only.
- Cite then explain. Refuse to answer without citations when the task requires evidence.
- Echo the current tool schema before any tool call.

User:
- Provides task and question.

Deep diagnostics

Three paraphrase test Ask the same question three ways. Parse rate and tool validity should remain ≥ 0.98. If not, tighten the schema or reduce optional fields.
Anchor triangulation Compare ΔS to the expected anchor section and to a decoy. If ΔS is close for both, rework chunking and rebuild index. See Retrieval Playbook
Chain length audit If tool chains exceed safe length and entropy rises, split the plan and reconnect with a BBCR bridge. See Logic Collapse

Eval gates before ship

JSON parse success ≥ 0.99 on a 50 case set.
Tool validity ≥ 0.98 with negative cases included.
Coverage ≥ 0.70 and ΔS ≤ 0.45 on evidence tasks.
λ convergent across two seeds.
Live probes green for one hour with zero side effects on rejects.

🔗 Quick-Start Downloads (60 sec)

Tool	Link	3-Step Setup
WFGY 1.0 PDF	Engine Paper	1️⃣ Download · 2️⃣ Upload to your LLM · 3️⃣ Ask “Answer using WFGY + <your question>”
TXT OS (plain-text OS)	TXTOS.txt	1️⃣ Download · 2️⃣ Paste into any LLM chat · 3️⃣ Type “hello world” — OS boots instantly

🧭 Explore More

Module	Description	Link
WFGY Core	WFGY 2.0 engine is live: full symbolic reasoning architecture and math stack	View →
Problem Map 1.0	Initial 16-mode diagnostic and symbolic fix framework	View →
Problem Map 2.0	RAG-focused failure tree, modular fixes, and pipelines	View →
Semantic Clinic Index	Expanded failure catalog: prompt injection, memory bugs, logic drift	View →
Semantic Blueprint	Layer-based symbolic reasoning & semantic modulations	View →
Benchmark vs GPT-5	Stress test GPT-5 with full WFGY reasoning suite	View →
🧙‍♂️ Starter Village 🏡	New here? Lost in symbols? Click here and let the wizard guide you through	Start →

👑 Early Stargazers: See the Hall of Fame — Engineers, hackers, and open source builders who supported WFGY from day one.

⭐ WFGY Engine 2.0 is already unlocked. ⭐ Star the repo to help others discover it and unlock more on the Unlock Board.

11 KiB Raw Blame History Unescape Escape

JSON Mode and Tool Calls · Prompt Assembly

Open these first

When to use this page

Acceptance targets

Map symptoms to structural fixes

Fix in 60 seconds

Copy paste blocks

A. JSON answer contract

B. Tool protocol guard

C. Validator stub

D. Role header clamp

Deep diagnostics

Eval gates before ship

🔗 Quick-Start Downloads (60 sec)

🧭 Explore More

11 KiB

Raw Blame History