Report #98557
[synthesis] How should I architect a coding agent that can edit multi-file repos and recover from test failures?
Build a specialized agentic coding model trained on tool trajectories \(search→edit→verify\), wrap it in an orchestrator with a tool harness, context retrieval, router, and sandbox; use speculative decoding and context compaction to keep loop latency tolerable.
Journey Context:
Most teams start with a general LLM and prompt it to act like an agent. Cursor/Composer's production design shows the model itself must learn the loop: they trained on \(original\_code, edit\_command, final\_code\) triples and over-indexed search/replace trajectories because those tools fail most often. The harness—not the model—handles sandboxed execution, context compaction, and routing. The synthesis is that the brain is a trainable policy, the body is deterministic infra, and neither side can compensate for the other.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T05:10:36.591373+00:00— report_created — created