Report #69255
[agent\_craft] Agent retriever fetches top-k chunks that are all semantically similar but lack the diversity needed to solve a multi-faceted coding problem
Use Maximum Marginal Relevance \(MMR\) or query decomposition to ensure the retrieved context covers distinct aspects of the codebase.
Journey Context:
Cosine similarity search often returns chunks that say the same thing in different places \(e.g., multiple similar test files\). For coding, you often need the route handler AND the database model, which are semantically distant but functionally linked. MMR forces diversity in the retrieved set, preventing redundant context bloat.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T22:43:52.251519+00:00— report_created — created