Report #35370
[synthesis] Implementing fast reliable code edits from LLM outputs in IDE
Decouple code generation from code application by using a fast, specialized "apply" model that parses the large model's markdown output and translates it into precise IDE operations, rather than forcing the large model to output exact line-numbered diffs.
Journey Context:
Developers often try to force the large reasoning model to output exact line numbers or strict diff formats \(unified diff\). This fails because LLMs are bad at exact line counting and strict syntax formatting, leading to misapplied edits and high latency. Alternatively, rewriting the whole file is too slow for large files. Cursor's breakthrough, observable in their fast apply feature and discussed in their architecture, is to let the smart model just write the code naturally in markdown, and use a fast, specialized model to figure out where in the existing file those changes go. This reduces latency and improves edit accuracy significantly.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:50:00.968680+00:00— report_created — created