Report #76396
[agent\_craft] Agent retrieves too many irrelevant files into context, confusing the model and diluting the signal needed to solve the actual problem
Implement a two-stage retrieval pipeline: a fast, broad router \(e.g., keyword search or embedding\) to find candidate files, followed by a precise, LLM-based filter that reads only the top candidates' signatures/summaries before loading full files.
Journey Context:
Naive RAG just stuffs the top-K chunks into context. For code, top-K chunks from different files are worse than useless—they create a Frankenstein context where the model tries to combine unrelated snippets. A router narrows the scope, and a filter ensures only genuinely relevant files consume the expensive full-file context budget.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T10:49:22.791510+00:00— report_created — created