better doc

2026-05-05 17:56:42 +00:00 · 2026-02-27 01:11:57 +08:00 · 2026-02-27 01:11:57 +08:00 · 665831c774
commit 665831c774
parent aea8844bac
46 changed files with 1217 additions and 3505 deletions
--- a/docs/zh/s03-todo-write.md
+++ b/docs/zh/s03-todo-write.md
@ -1,147 +1,86 @@
 # s03: TodoWrite (待办写入)

-> TodoManager 让智能体能追踪自己的进度, 而 nag reminder 注入机制在它忘记更新时强制提醒。
+`s01 > s02 > [ s03 ] s04 > s05 > s06 | s07 > s08 > s09 > s10 > s11 > s12`
+
+> *"Plan before you act"* -- 先列计划, 完成率翻倍。

 ## 问题

-当智能体处理多步骤任务时, 它经常丢失对已完成和待办事项的追踪。没有显式的计划, 模型可能重复工作、跳过步骤或跑偏。用户也无法看到智能体内部的计划。
-
-这个问题比听起来更严重。长对话会导致模型 "漂移" -- 随着上下文窗口被工具结果填满, 系统提示的影响力逐渐减弱。一个 10 步的重构任务可能完成了 1-3 步, 然后模型就开始即兴发挥, 因为它忘了第 4-10 步的存在。
-
-解决方案是结构化状态: 一个模型显式写入的 TodoManager。模型创建计划, 工作时将项目标记为 in_progress, 完成后标记为 completed。nag reminder 机制在模型连续 3 轮以上不更新待办时注入提醒。
-
-注: nag 阈值 3 轮是为教学可见性设的低值, 生产环境通常更高。从 s07 起, 课程转向 Task 看板处理持久化多步工作; TodoWrite 仍可用于轻量清单。
+多步任务中, 模型会丢失进度 -- 重复做过的事、跳步、跑偏。对话越长越严重: 工具结果不断填满上下文, 系统提示的影响力逐渐被稀释。一个 10 步重构可能做完 1-3 步就开始即兴发挥, 因为 4-10 步已经被挤出注意力了。

 ## 解决方案

 ```
-+----------+      +-------+      +---------+
-|   User   | ---> |  LLM  | ---> | Tools   |
-|  prompt  |      |       |      | + todo  |
-+----------+      +---+---+      +----+----+
-                      ^               |
-                      |   tool_result |
-                      +---------------+
-                            |
-                +-----------+-----------+
-                | TodoManager state     |
-                | [ ] task A            |
-                | [>] task B  <- doing  |
-                | [x] task C            |
-                +-----------------------+
-                            |
-                if rounds_since_todo >= 3:
-                  inject <reminder> into tool_result
+--------+      +-------+      +---------+
+|  User  | ---> |  LLM  | ---> | Tools   |
+| prompt |      |       |      | + todo  |
+--------+      +---+---+      +----+----+
+                    ^                |
+                    |   tool_result  |
+                    +----------------+
+                          |
+              +-----------+-----------+
+              | TodoManager state     |
+              | [ ] task A            |
+              | [>] task B  <- doing  |
+              | [x] task C            |
+              +-----------------------+
+                          |
+              if rounds_since_todo >= 3:
+                inject <reminder> into tool_result
 ```

 ## 工作原理

-1. TodoManager 验证并存储一组带状态的项目。同一时间只允许一个项目处于 `in_progress` 状态。
+1. TodoManager 存储带状态的项目。同一时间只允许一个 `in_progress`。

 ```python
 class TodoManager:
-    def __init__(self):
-        self.items = []
-
    def update(self, items: list) -> str:
-        validated = []
-        in_progress_count = 0
+        validated, in_progress_count = [], 0
        for item in items:
            status = item.get("status", "pending")
            if status == "in_progress":
                in_progress_count += 1
-            validated.append({
-                "id": item["id"],
-                "text": item["text"],
-                "status": status,
-            })
+            validated.append({"id": item["id"], "text": item["text"],
+                              "status": status})
        if in_progress_count > 1:
            raise ValueError("Only one task can be in_progress")
        self.items = validated
        return self.render()
 ```

-2. `todo` 工具和其他工具一样添加到 dispatch map 中。
+2. `todo` 工具和其他工具一样加入 dispatch map。

 ```python
 TOOL_HANDLERS = {
-    "bash":  lambda **kw: run_bash(kw["command"]),
-    # ...other tools...
-    "todo":  lambda **kw: TODO.update(kw["items"]),
+    # ...base tools...
+    "todo": lambda **kw: TODO.update(kw["items"]),
 }
 ```

-3. nag reminder 在模型连续 3 轮以上不调用 `todo` 时, 向 tool_result 消息中注入 `<reminder>` 标签。
+3. nag reminder: 模型连续 3 轮以上不调用 `todo` 时注入提醒。

 ```python
-def agent_loop(messages: list):
-    rounds_since_todo = 0
-    while True:
-        if rounds_since_todo >= 3 and messages:
-            last = messages[-1]
-            if (last["role"] == "user"
-                    and isinstance(last.get("content"), list)):
-                last["content"].insert(0, {
-                    "type": "text",
-                    "text": "<reminder>Update your todos.</reminder>",
-                })
-        # ... rest of loop ...
-        rounds_since_todo = 0 if used_todo else rounds_since_todo + 1
+if rounds_since_todo >= 3 and messages:
+    last = messages[-1]
+    if last["role"] == "user" and isinstance(last.get("content"), list):
+        last["content"].insert(0, {
+            "type": "text",
+            "text": "<reminder>Update your todos.</reminder>",
+        })
 ```

-4. 系统提示指导模型使用 todo 进行规划。
-
-```python
-SYSTEM = f"""You are a coding agent at {WORKDIR}.
-Use the todo tool to plan multi-step tasks.
-Mark in_progress before starting, completed when done.
-Prefer tools over prose."""
-```
-
-## 核心代码
-
-TodoManager 和 nag 注入 (来自 `agents/s03_todo_write.py`,
-第 51-85 行和第 158-187 行):
-
-```python
-class TodoManager:
-    def update(self, items: list) -> str:
-        validated = []
-        in_progress_count = 0
-        for item in items:
-            status = item.get("status", "pending")
-            if status == "in_progress":
-                in_progress_count += 1
-            validated.append({
-                "id": item["id"],
-                "text": item["text"],
-                "status": status,
-            })
-        if in_progress_count > 1:
-            raise ValueError("Only one in_progress")
-        self.items = validated
-        return self.render()
-
-# In agent_loop:
-if rounds_since_todo >= 3:
-    last["content"].insert(0, {
-        "type": "text",
-        "text": "<reminder>Update your todos.</reminder>",
-    })
-```
+"同时只能有一个 in_progress" 强制顺序聚焦。nag reminder 制造问责压力 -- 你不更新计划, 系统就追着你问。

 ## 相对 s02 的变更

-| 组件           | 之前 (s02)       | 之后 (s03)                   |
-|----------------|------------------|--------------------------|
-| Tools          | 4                | 5 (+todo)                |
-| 规划           | 无               | 带状态的 TodoManager     |
-| Nag 注入       | 无               | 3 轮后注入 `<reminder>`  |
-| Agent loop     | 简单分发         | + rounds_since_todo 计数器|
-
-## 设计原理
-
-可见的计划能提高任务完成率, 因为模型可以自我监控进度。nag 机制创造了问责性 -- 没有它, 随着对话上下文增长和早期指令淡化, 模型可能在执行中途放弃计划。"同一时间只允许一个 in_progress" 的约束强制顺序聚焦, 防止上下文切换开销降低输出质量。这个模式之所以有效, 是因为它将模型的工作记忆外化为结构化状态, 使其能够在注意力漂移中存活。
+| 组件           | 之前 (s02)       | 之后 (s03)                     |
+|----------------|------------------|--------------------------------|
+| Tools          | 4                | 5 (+todo)                      |
+| 规划           | 无               | 带状态的 TodoManager           |
+| Nag 注入       | 无               | 3 轮后注入 `<reminder>`        |
+| Agent loop     | 简单分发         | + rounds_since_todo 计数器     |

 ## 试一试

@ -150,7 +89,7 @@ cd learn-claude-code
 python agents/s03_todo_write.py
 ```

-可以尝试的提示:
+试试这些 prompt (英文 prompt 对 LLM 效果更好, 也可以用中文):

 1. `Refactor the file hello.py: add type hints, docstrings, and a main guard`
 2. `Create a Python package with __init__.py, utils.py, and tests/test_utils.py`