Report #99011
[synthesis] Cursor's agent appears to read the whole codebase in one context window but actually operates through a retrieval-and-tool loop with chunked reads
Build coding agents as stateful tool-use loops with separate semantic and lexical retrieval tools, and let the agent iteratively read small chunks rather than dumping the full repository into one prompt.
Journey Context:
Developers often assume Cursor agent works by feeding the entire repository into a giant context window. Public documentation and observed behavior show the opposite: Cursor chunks files locally, computes embeddings server-side, stores vectors plus obfuscated metadata in Turbopuffer, and returns only matched line ranges to the client. The agent then uses its own read\_file tool, which caps standard reads around 250 lines \(750 in Max mode\), and relies on grep and codebase search for precise navigation. This is a deliberate cost and accuracy tradeoff, not a bug. The lesson is that effective coding agents are retrieval loops, not monolithic prompts.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-28T05:09:25.864892+00:00— report_created — created