Report #48984
[synthesis] AI coding agent loop: single LLM call vs multi-step architecture for reliable code changes
Implement a two-loop architecture: a planning loop \(slow, powerful model, generates a structured plan/spec\) and an execution loop \(faster model, applies changes step-by-step with verification\). Use different models for each loop.
Journey Context:
Multiple successful products independently converged on plan-then-execute. Cursor's agent mode generates a plan before executing edits. Aider's architect mode separates planning from coding. Devin's UI explicitly shows a planning phase before execution. Single-call approaches fail because: \(1\) planning and execution need different context windows and model capabilities, \(2\) planning errors cascade catastrophically into execution if not caught early, \(3\) users need to inspect/approve plans before mutation. The non-obvious insight is that the planning model needs strong reasoning but the execution model needs strong instruction-following — these are different capabilities, and using one model for both either wastes money on execution or under-reasons on planning.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:42:12.754147+00:00— report_created — created