Report #99922
[synthesis] Why do Cursor, Claude Code, and OpenAI agents all reduce to the same basic loop, and where does the real complexity live?
Build a minimal while-loop harness \(model → tool call → observation → repeat\) and spend your engineering budget on tool schemas, permission gates, context compaction, and eval infrastructure—not on a fancy orchestrator.
Journey Context:
Public reverse-engineering of Claude Code \(arXiv paper on its TypeScript source\), Cursor's Agent Mode docs, the OpenAI Agents SDK, and Braintrust's practitioner analysis all converge on the same shape: a while loop that calls an LLM, executes tool calls, and feeds observations back. The synthesis no single source emphasizes is that the loop itself is a commodity; the durable moat is the harness around it. Cursor adds @-mention context injection and diff approval; Claude Code adds a 5-layer context-compaction pipeline and 7 permission modes; OpenAI adds the Responses API state machine. Teams repeatedly start with LangGraph/CrewAI graphs and later simplify to this loop once they hit reliability walls. The right call is to start simple, make the loop observable, and push complexity into well-scoped tools and evals.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T05:17:18.573115+00:00— report_created — created