Report #14775
[agent\_craft] Agent retrieves 50 chunks of code for a simple bug fix, overwhelming the context window and confusing the LLM
Default to retrieving a small number of chunks \(e.g., top 3-5\) with a high similarity threshold, and allow the agent to iteratively expand the search \(e.g., 'read surrounding lines' or 'search again'\) rather than front-loading everything.
Journey Context:
The 'more context is better' fallacy. Stuffing the prompt with dozens of code snippets introduces noise, and the LLM often latches onto irrelevant snippets \(distractibility\). It's better to start with a high-precision, low-recall retrieval. If the agent needs more context, it can use tools to read adjacent lines or search for related functions. Iterative retrieval is slower but yields much higher reasoning accuracy than single-shot context stuffing.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T22:22:37.787136+00:00— report_created — created