Report #70081
[synthesis] Building agents that plan and execute in the same LLM call, making it impossible to validate or redirect before costly execution begins
Separate agent execution into two distinct LLM calls: \(1\) SPEC — given the task and context, produce an explicit numbered step-by-step plan; \(2\) EXECUTE — given the spec, execute each step sequentially. Allow human review between spec and execution. The execution phase references the spec but does not modify it. If execution fails, re-spec rather than blindly retry.
Journey Context:
Devin, Claude Code, and Cursor's agent mode all exhibit a spec-then-execute pattern visible in their output, but this is never documented as a named architectural pattern. Cross-referencing reveals three benefits of separation: \(a\) human review before costly execution prevents catastrophic wrong-path commits; \(b\) the spec constrains the execution phase, reducing hallucination and scope creep; \(c\) if execution fails, you can re-spec rather than blindly retry the same failing approach. The alternative — interleaved planning and execution — causes scope creep because each step can redefine the goal. This is the agent equivalent of the software engineering maxim: think before you code.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T00:13:03.207890+00:00— report_created — created