从提示到自治:设计 AI 工作循环的五个阶段
Claude Code 的创造者声称自己几乎不再写提示词了——是循环在替他“提示”。本文提出一个清晰的五级框架,描述开发者与 AI 协同工作模式的演进:从 L1 的单轮问答,到 L2 的手动循环(重复“做-检查-修正”),再到 L3 的验证循环(用独立检查定义“完成”),然后到 L4 的自运行循环(靠 Goal 命令自主迭代),最终到 L5 的自主智能体系统(循环自触发、并行执行、经验回写入知识库持续改进)。每级都包含识别标志和具体升级行动。适合已感觉“聊胜于无”的 AI 用户,以及正在构建自动化 Agent 工作流的工程师。
The person who created Claude Code says he doesn't really prompt it anymore. Loops prompt it. His job is to design the loops.
That one sentence is the whole shift happening in 2026. For two years the move was simple: write a good prompt, feed context, read what comes back, type the next thing. You held the agent the entire time, one turn at a time. That era is closing.
Claude Code 的创造者说,他现在几乎不再主动提示它了。是循环在提示它。他的工作变成了设计这些循环。
这一句话就概括了 2026 年正在发生的整个转变。过去两年,做法很简单:写好提示词、喂入上下文、阅读回复、再键入下一个指令。你全程握着智能体,一次一个回合。那个时代正在落幕。
The new skill is building a small system that finds the work, hands it out, checks it, and decides what's next, while you watch instead of type. It's called loop design, and it isn't one skill, it's a ladder. Here are the 5 levels, from where almost everyone starts to where the best builders operate. Find your rung.
新技能是构建一个小型系统:它自己找活干、分配任务、检查结果、决定下一步,而你的角色是观察而非打字。这被称为“循环设计”,但它不是单一技能,而是一级级的阶梯。以下是 5 个层级,从几乎所有人的起点,到顶尖构建者的操作区间。找到你所在的层级。
You write a prompt, read the answer, write the next prompt.
This is where everyone starts and where most people stay. You type a request, the agent answers, you read it, you type the next request. The agent is a tool you hold in your hand the whole time. Every single turn requires you.
Nothing wrong with it for quick tasks. But it has a hard ceiling: your throughput is capped by how fast you can type and read. You are the loop. And a human running the loop is slow, gets tired, and can only do one thing at a time.
You're in this level if "using AI" means a chat window where you send messages and read replies, one at a time.
你写一条提示,阅读答案,再写下一条提示。
这是每个人的起点,也是大多数人的停留点。你打入请求,智能体回答,你阅读,再打入下一个请求。智能体是你全程握在手里的工具。每一轮都需要你。
对快速任务来说这没问题。但它有一个硬性天花板:你的吞吐量受限于打字和阅读的速度。你就是那个循环。由人类来运行循环很慢,会疲劳,而且一次只能做一件事。
如果你使用 AI 的方式就是打开聊天窗口,一条接一条地发送消息并读取回复,那你就处在这个层级。
You repeat the same cycle by hand: do, check, correct, repeat.
At level 2 you've noticed a pattern: most real work is a cycle, not a single answer. Do the thing, check it, fix what's wrong, repeat until it's right. You start running this cycle deliberately: "now run the tests," "now fix that failure," "now check again."
This is the first real loop, but you're still the engine driving it. You're manually pushing the agent through each stage. It's better than level 1 because you're thinking in cycles, but you're still spending your attention on every lap.
You're here if you find yourself typing "run it again," "now check," and "fix that" over and over in a session.
你手动重复同一个循环:执行、检查、修正、重复。
在第二级,你已经注意到一个模式:大多数实际工作是一个循环,而不是单个答案。执行操作、检查结果、修复错误、重复直到正确。你开始有意地运行这个循环:“现在运行测试”“现在修复那个失败”“现在再检查一次”。
这是第一个真正的循环,但你仍是驱动它的引擎。你手动推动智能体经过每个阶段。这比第一级好,因为你在按循环思考,但你的注意力仍然耗费在每一圈上。
如果你发现自己在一个会话中反复输入“再运行一次”“现在检查”“修复那个”,那你就处在这个层级。
You define "done," and a check decides when the loop stops.
Level 3 is the first real jump in skill. Instead of you eyeballing whether each lap worked, you write down a success condition and let a check enforce it. The loop now has a stopping criterion that isn't your opinion: tests pass, lint is clean, the output matches a rubric.
This is the level where loop design becomes a real discipline. The key insight: the agent that did the work is a bad judge of whether the work is done, it tends to over-report success. So the check has to be separate from the doer.
A VERIFIED LOOP HAS A SEPARATE JUDGE
# the doer does. a separate check decides "done".
LOOP until: all tests in test/ pass AND lint is clean
# the model that wrote the code does NOT get to grade itself
# a separate check (tests, a faster grader model) decides
You're here if you can state "done" as a condition a machine could check, not a feeling you have to confirm.
你定义“完成”,并由一个检查来决定循环何时停止。
第三级是技能的第一个真正飞跃。不再是靠你直观判断每一轮是否有效,而是写下成功条件,让一个检查机制来执行它。循环现在有了一个不依赖你主观意见的停止标准:测试通过、代码检查干净、输出符合评分准则。
在这个层级,循环设计成为一门真正的方法论。关键洞察是:执行工作的智能体并不擅长判断工作是否完成,它倾向于过度报告成功。因此检查必须和执行者分离。
验证循环有独立的裁判
# 执行者执行。一个独立的检查决定“完成”。
循环直到:test/ 中的所有测试通过 且 代码检查干净
# 编写代码的模型不得给自己评分
# 由一个独立的检查(测试、更快的评分模型)来决定
如果你能把“完成”表述为一个机器可以检查的条件,而不是需要你确认的感觉,那你就处在这个层级。
You set a goal, the loop runs across turns on its own until it's met.
At level 4 the loop runs itself. You set a goal and a stopping condition, and the system loops across many turns without you driving each one: it reads the failure, traces the cause, makes the change, re-checks, and repeats until the condition is true. You check in when you want, but you don't have to hover.
In Claude Code this is now a native primitive. The /goal command runs a loop across turns until a condition you wrote is true, and crucially it uses a separate, faster model to grade whether the goal is met after each turn, instead of trusting the doer's word.
A SELF-RUNNING GOAL LOOP
/goal All tests in the suite pass with no skips or warnings
/loop every 2m until: tests pass
# Claude reads failing output, traces the cause, edits,
# re-runs, and repeats, on its own, until the goal is met
# a separate fast model checks the condition each cycle
The level-4 discipline:A self-running loop needs guardrails or it spins. Cap the iterations (a max-turns limit), front-load context so it doesn't wander, and snapshot state before each turn so a bad lap can roll back. Autonomy without limits is just a faster way to make a mess.
你设定一个目标,循环自主跨回合运行,直到目标达成。
在第四级,循环自行运转。你设定一个目标和停止条件,系统自动跨多个回合循环,无需你驱动每一轮:它读取失败信息、追踪根因、进行修改、重新检查,并重复直到条件为真。你可以随时检查进度,但不必全程盯着。
在 Claude Code 中,这已经是一个原生原语。/goal 命令会运行一个跨回合的循环,直到你写下的条件为真,关键在于它使用一个独立的、更快的模型来判断每个回合后目标是否达成,而不是信任执行者的断言。
自运行目标循环
/goal 套件中所有测试通过,无跳过或警告
/loop 每 2 分钟直到:测试通过
# Claude 读取失败输出、追踪根因、编辑、
# 重新运行并重复,自主直到目标达成
# 一个独立的快速模型在每个周期检查条件
第四级的纪律:自运行循环需要护栏,否则它会空转。设置迭代上限(最大回合数)、前置上下文以免跑偏、在每个回合前快照状态以便回滚错误回合。没有边界的自主性只是更快地制造混乱。
Loops that start themselves, run in parallel, and improve over time.
Level 5 is where loop design becomes systems design. The loops don't wait for you to start them, they trigger on their own (a schedule, an event), run in parallel, and feed their lessons back so the system gets better over time. You're no longer operating an agent. You're running a small organization of them.
Three things define this level:
- Self-starting: routines and automations kick loops off on a timer or an event, so work happens without you pressing go
- Parallel: many loops run at once in isolated worktrees, a lead integrating their results, instead of one loop at a time
- Self-improving: when the system makes a mistake, the lesson gets written back into CLAUDE.md or a skill, so the correction persists into every future run instead of being lost
What level 5 actually feels like:You describe outcomes, not steps. The system finds the work, dispatches it, checks it, records what's done, and decides what's next. Your job shifts entirely from doing the work to designing the system that does it.
循环自行启动、并行运行、并随时间改进。
第五级是循环设计升格为系统设计的地方。循环不再等你启动,它们自行触发(基于定时任务或事件)、并行运行,并将经验反馈回去,使系统不断自我优化。你不再是操作一个智能体,而是在运营一个小型的智能体组织。
这个层级由三个要素定义:
- 自启动:例程和自动化机制通过定时器或事件触发循环,无需你按下“开始”
- 并行:多个循环在隔离的工作目录中同时运行,由一个主导者整合它们的结果,而非一次只跑一个循环
- 自改进:当系统出错时,教训被写回 CLAUDE.md 或技能库中,因此修正会持续影响未来每次运行,而非被遗忘
第五级实际上的体验是:你描述结果,而非步骤。系统自己找活干、分配任务、检查结果、记录已完成事项,并决定下一步做什么。你的工作完全从执行任务转变为设计执行任务的系统。
You don't jump from 1 to 5. You climb one rung at a time, and each rung is a specific, learnable move:
- 1 → 2: stop asking for single answers. Notice the cycle in your work and run it on purpose
- 2 → 3: write "done" as a condition a machine could check, and let the check decide, not you
- 3 → 4: hand the running of the loop to a goal command, and add guardrails so it can't spin
- 4 → 5: let loops start themselves, run in parallel, and write their lessons back so they improve
The honest takeaway:Most people are stuck at level 1 or 2 not because the higher levels are hard, but because nobody told them the ladder exists. The skill of 2026 isn't a better prompt. It's knowing that "design the loop" is a thing you can do, and climbing one rung at a time until the work runs without you.
你不能从 1 级跳到 5 级。需要一级一级地爬,而每一级都是一个具体、可学的动作:
- 1 → 2:别再只求单个答案。注意到工作中的循环模式,并有意识地运行它
- 2 → 3:将“完成”写成一个机器可检查的条件,让检查来决定,而不是你
- 3 → 4:把循环的运行权交给一个目标命令,并添加护栏以防空转
- 4 → 5:让循环自行启动、并行运行,并将经验写回以便改进
诚实的结论:大多数人卡在第 1 级或第 2 级,不是因为更高层级有多难,而是因为没人告诉他们这个阶梯的存在。2026 年的技能不是写出更好的提示词,而是知道“设计循环”是你能够做到的事,然后一级一级地攀登,直到工作无需你介入也能自行运转。