better doc

2026-05-02 16:20:12 +00:00 · 2026-02-27 01:11:57 +08:00 · 2026-02-27 01:11:57 +08:00 · 665831c774
commit 665831c774
parent aea8844bac
46 changed files with 1217 additions and 3505 deletions
--- a/docs/en/s03-todo-write.md
+++ b/docs/en/s03-todo-write.md
@ -1,159 +1,87 @@
 # s03: TodoWrite

-> A TodoManager lets the agent track its own progress, and a nag reminder injection forces it to keep updating when it forgets.
+`s01 > s02 > [ s03 ] s04 > s05 > s06 | s07 > s08 > s09 > s10 > s11 > s12`

-## The Problem
+> *"Plan before you act"* -- visible plans improve task completion.

-When an agent works on a multi-step task, it often loses track of what it
-has done and what remains. Without explicit planning, the model might repeat
-work, skip steps, or wander off on tangents. The user has no visibility
-into the agent's internal plan.
+## Problem

-This is worse than it sounds. Long conversations cause the model to "drift"
-- the system prompt fades in influence as the context window fills with
-tool results. A 10-step refactoring task might complete steps 1-3, then
-the model starts improvising because it forgot steps 4-10 existed.
+On multi-step tasks, the model loses track. It repeats work, skips steps, or wanders off. Long conversations make this worse -- the system prompt fades as tool results fill the context. A 10-step refactoring might complete steps 1-3, then the model starts improvising because it forgot steps 4-10.

-The solution is structured state: a TodoManager that the model writes to
-explicitly. The model creates a plan, marks items in_progress as it works,
-and marks them completed when done. A nag reminder injects a nudge if the
-model goes 3+ rounds without updating its todos.
-
-Note: the nag threshold of 3 rounds is low for visibility. Production systems tune higher. From s07, this course switches to the Task board for durable multi-step work; TodoWrite remains available for quick checklists.
-
-## The Solution
+## Solution

 ```
-+----------+      +-------+      +---------+
-|   User   | ---> |  LLM  | ---> | Tools   |
-|  prompt  |      |       |      | + todo  |
-+----------+      +---+---+      +----+----+
-                      ^               |
-                      |   tool_result |
-                      +---------------+
-                            |
-                +-----------+-----------+
-                | TodoManager state     |
-                | [ ] task A            |
-                | [>] task B  <- doing  |
-                | [x] task C            |
-                +-----------------------+
-                            |
-                if rounds_since_todo >= 3:
-                  inject <reminder> into tool_result
+--------+      +-------+      +---------+
+|  User  | ---> |  LLM  | ---> | Tools   |
+| prompt |      |       |      | + todo  |
+--------+      +---+---+      +----+----+
+                    ^                |
+                    |   tool_result  |
+                    +----------------+
+                          |
+              +-----------+-----------+
+              | TodoManager state     |
+              | [ ] task A            |
+              | [>] task B  <- doing  |
+              | [x] task C            |
+              +-----------------------+
+                          |
+              if rounds_since_todo >= 3:
+                inject <reminder> into tool_result
 ```

 ## How It Works

-1. The TodoManager validates and stores a list of items with statuses.
-   Only one item can be `in_progress` at a time.
+1. TodoManager stores items with statuses. Only one item can be `in_progress` at a time.

 ```python
 class TodoManager:
-    def __init__(self):
-        self.items = []
-
    def update(self, items: list) -> str:
-        validated = []
-        in_progress_count = 0
+        validated, in_progress_count = [], 0
        for item in items:
            status = item.get("status", "pending")
            if status == "in_progress":
                in_progress_count += 1
-            validated.append({
-                "id": item["id"],
-                "text": item["text"],
-                "status": status,
-            })
+            validated.append({"id": item["id"], "text": item["text"],
+                              "status": status})
        if in_progress_count > 1:
            raise ValueError("Only one task can be in_progress")
        self.items = validated
        return self.render()
 ```

-2. The `todo` tool is added to the dispatch map like any other tool.
+2. The `todo` tool goes into the dispatch map like any other tool.

 ```python
 TOOL_HANDLERS = {
-    "bash":  lambda **kw: run_bash(kw["command"]),
-    # ...other tools...
-    "todo":  lambda **kw: TODO.update(kw["items"]),
+    # ...base tools...
+    "todo": lambda **kw: TODO.update(kw["items"]),
 }
 ```

-3. The nag reminder injects a `<reminder>` tag into the tool_result
-   messages when the model goes 3+ rounds without calling `todo`.
+3. A nag reminder injects a nudge if the model goes 3+ rounds without calling `todo`.

 ```python
-def agent_loop(messages: list):
-    rounds_since_todo = 0
-    while True:
-        if rounds_since_todo >= 3 and messages:
-            last = messages[-1]
-            if (last["role"] == "user"
-                    and isinstance(last.get("content"), list)):
-                last["content"].insert(0, {
-                    "type": "text",
-                    "text": "<reminder>Update your todos.</reminder>",
-                })
-        # ... rest of loop ...
-        rounds_since_todo = 0 if used_todo else rounds_since_todo + 1
+if rounds_since_todo >= 3 and messages:
+    last = messages[-1]
+    if last["role"] == "user" and isinstance(last.get("content"), list):
+        last["content"].insert(0, {
+            "type": "text",
+            "text": "<reminder>Update your todos.</reminder>",
+        })
 ```

-4. The system prompt instructs the model to use todos for planning.
-
-```python
-SYSTEM = f"""You are a coding agent at {WORKDIR}.
-Use the todo tool to plan multi-step tasks.
-Mark in_progress before starting, completed when done.
-Prefer tools over prose."""
-```
-
-## Key Code
-
-The TodoManager and nag injection (from `agents/s03_todo_write.py`,
-lines 51-85 and 158-187):
-
-```python
-class TodoManager:
-    def update(self, items: list) -> str:
-        validated = []
-        in_progress_count = 0
-        for item in items:
-            status = item.get("status", "pending")
-            if status == "in_progress":
-                in_progress_count += 1
-            validated.append({
-                "id": item["id"],
-                "text": item["text"],
-                "status": status,
-            })
-        if in_progress_count > 1:
-            raise ValueError("Only one in_progress")
-        self.items = validated
-        return self.render()
-
-# In agent_loop:
-if rounds_since_todo >= 3:
-    last["content"].insert(0, {
-        "type": "text",
-        "text": "<reminder>Update your todos.</reminder>",
-    })
-```
+The "one in_progress at a time" constraint forces sequential focus. The nag reminder creates accountability.

 ## What Changed From s02

-| Component      | Before (s02)     | After (s03)              |
-|----------------|------------------|--------------------------|
-| Tools          | 4                | 5 (+todo)                |
-| Planning       | None             | TodoManager with statuses|
+| Component      | Before (s02)     | After (s03)                |
+|----------------|------------------|----------------------------|
+| Tools          | 4                | 5 (+todo)                  |
+| Planning       | None             | TodoManager with statuses  |
 | Nag injection  | None             | `<reminder>` after 3 rounds|
 | Agent loop     | Simple dispatch  | + rounds_since_todo counter|

-## Design Rationale
-
-Visible plans improve task completion because the model can self-monitor progress. The nag mechanism creates accountability -- without it, the model may abandon plans mid-execution as conversation context grows and earlier instructions fade. The "one in_progress at a time" constraint enforces sequential focus, preventing context-switching overhead that degrades output quality. This pattern works because it externalizes the model's working memory into structured state that survives attention drift.
-
 ## Try It

 ```sh
@ -161,8 +89,6 @@ cd learn-claude-code
 python agents/s03_todo_write.py
 ```

-Example prompts to try:
-
 1. `Refactor the file hello.py: add type hints, docstrings, and a main guard`
 2. `Create a Python package with __init__.py, utils.py, and tests/test_utils.py`
 3. `Review all Python files and fix any style issues`