/goal：一个命令改变了我使用 Claude Code 的方式

两年来，Defileo 一直是每次 Claude Code 会话的瓶颈。

输入 prompt，等待输出，review，输入下一个 prompt，等待，review，重复。每个项目数百次 tiny approvals。模型做工作，他保持节奏。没有他坐在那里按键，什么都不动。

上周停止了。他输入一个命令，走到厨房，煮咖啡，回来，工作完成了。

/goal 是什么

/goal 是第一个真正将人类从循环中移除的功能，它改变 Defileo 使用 Claude Code 的方式比任何模型升级都多。

普通 Claude Code 是对话。你问，它行动，你回应，它再行动。每轮都是 disguised as work 的 check-in。

/goal 翻转了这一点。你定义完成是什么样子，Claude 在 closed loop 中运行，模型自己决定工作何时完成。它写、测试、遇到错误、修复、对照你的标准检查自己的输出，要么继续要么因为目标达成而停止。

你不再在循环中。你设置循环运行，然后去了别的地方。

这听起来小。它不是。每个你曾经用过的生产力工具都围绕你的注意力设计。/goal 是第一个围绕你的 absence 设计的。

每个 /goal prompt 应该回答三个问题

需要做什么
如何知道完成了
什么是禁区

骨架：

/goal [task] until [measurable completion condition] without [hard constraints]

弱 prompt：

/goal improve the app performance

没有 finish line。Claude 会 tweak 几样东西，决定它改进了，停止。你不会知道是否有什么改变。

强 prompt：

/goal optimize image loading across all product pages until 
Google PageSpeed scores 85+ on mobile, without changing any 
HTML structure or removing existing images

现在 Claude 有 target（PageSpeed 85+）、scope（仅 product pages）和 clear limits（不碰 HTML 或图片）。它可以运行、针对 PageSpeed 测试、修复不工作的、self-verify 直到 target 达成。

核心机制：Claude 自己评分

/goal 工作是因为 Claude 给自己的工作评分。这就是整个机制。一个小 evaluator 检查每个输出对照你设定的标准，决定目标是否达成，要么停止要么继续。

陷阱一旦看到就明显：如果你的标准 weak，Claude 在 weak work 上宣布胜利。

"Improve performance" 对照 Claude 自己的改进定义评分
"Add tests" 在任何测试存在时通过
"Make it more user-friendly" 无法评分，模型会 hand you something 并称它完成

标准需要是陌生人能在 30 秒内不读代码验证的东西。 PageSpeed 分数。测试通过数。不再出现的特定错误状态。端到端完成的特定用户流。

如果成功条件无法测量，循环无法关闭。如果循环无法关闭，你回到瓶颈位置。

像为初级工程师写 bug report 一样写标准：concrete、observable、falsifiable。

长任务结构

几分钟以上的任务，one-liner 不够。用这个结构：

/goal [what needs to be achieved]

context:
[anything Claude needs to understand about your project, stack, 
or constraints that is not obvious from the codebase]

done when:
- [measurable condition]
- [measurable condition]  
- [measurable condition]

do not:
- [hard limit]
- [hard limit]

progress tracking:
log completed steps to progress.md as you go

progress tracking 是大多数人 skip 但比意识到更重要的那行。长运行会 drift。模型花上下文在自己的输出上，忘记早期决策，自相矛盾。progress 文件给 Claude external memory，给你检查运行的方式而不打断它。

多小时的 goal，再加 plan.md 和 decisions.md。plan 文件是 Claude upfront 承诺的方法。decisions 文件是为什么做特定选择。一起保持长自主运行 coherent，给你回来时的 paper trail。

不只是编码

/goal 的价值至少一半是编码之外。

研究和分析：

/goal find every credible source published in the last 90 days 
on [topic] until I have a structured summary with at least 20 
distinct sources, citing each one with author and date, without 
including any source from a site flagged as low credibility

Claude 运行研究循环、验证来源、构建 summary、对照数量和质量条件检查它，hand you finished document。

内容生产：

/goal write a 1500-word article on [topic] until it passes three 
internal review passes (clarity, originality, voice match against 
the reference samples in /writing-samples) without using any of 
the banned phrases in style-guide.md

Claude 起草、self-review、重写、再 self-review，直到 piece 通过自己的 quality bar。

运维和维护：

/goal audit every dependency in package.json until I have a report 
showing version, last update date, known vulnerabilities, and a 
recommendation to keep, update, or replace each one, without 
actually changing any dependency yet

过去花半天的任务，现在在你做别的事时运行。

每个领域的模式相同：定义 deliverable、定义 bar、定义 limits、走开。

大多数人一个月内不会 figured out 的 unlock

单个 /goal 在一个项目上运行有用。三个 /goal 在三个项目上并行运行 while you sleep 是不同类别的 leverage。

打开三个终端窗口。每个是不同的项目。每个得到不同的 /goal。去睡觉。醒来三个完成的任务。

这工作因为每个循环独立。没有共享上下文，没有跨项目干扰。运行这个 setup 的 operator 不是三倍于 prompt-by-prompt 用户快。他们在运行 prompt-by-prompt 用户物理上无法匹配的三个并行流。

约束不是 Claude。约束是你能多清楚地写三个不同的验收标准。

诚实部分

/goal 在需要实时外部验证的事情上挣扎。需要人类看 UI 并反应的任务。成功是 aesthetic 而非可测量的任务。依赖 Claude 无法测试的服务的任务。

也在有 hidden dependencies 的目标上挣扎。如果完成任务需要你没给 Claude access 的数据，循环会 spin 直到放弃或 hallucinate 解决方案。诚实答案是 upfront 给 Claude access 或选不同工具。

它也不会把你从 bad plan 中拯救出来。明确定义的目标指向错误 objective 只是更快把你带到错误地方。自主性放大你设定的任何方向。

规则：任何有 measurable finish line 且活在 Claude 能 touch 的环境内的是 fair game。其他东西，留在循环中。

设置

安装（如果没有）：

npm install -g @anthropic-ai/claude-code

启动 Claude Code，然后设定目标：

/goal [your task]

长运行不想 approval prompts：

claude --approval-mode full-auto

有用命令：

/pause          暂停当前 goal
/goal clear     取消并重置
/plan           执行前生成 step-by-step plan

整个设置不到两分钟。

新角色

两年来 AI 的生产性使用一直是"我能多快 prompt"。打字速度、prompt 质量、一次能 juggling 多少标签。瓶颈总是你。

/goal 把工作移出你的手。你不再是 operator。你是 brief operator 然后走开的人。

这是不同角色。前面更慢，后面更快。更少打字，更多思考。对以多忙衡量生产力的人更糟。对其他人 dramatically better。

花两分钟 extra 写 goal。加 measurable finish line。加 constraints。然后合上 laptop 让循环完成。

Welcome to being useless, it is the most productive thing you have done all year.