Report #2186
[agent\_craft] Retrieved RAG chunks flood the prompt with low-relevance text
Apply a cross-encoder reranker and a strict token budget: keep only the top-k chunks that fit within roughly 30% of the available context after the core task text is reserved.
Journey Context:
Dense retrieval returns many plausible-but-irrelevant passages. Concatenating the top-k by default wastes the window and distracts the model with false positives. A reranker improves precision, and a hard budget forces the agent to decide what matters. If the budget is still exceeded, compress or summarize before inclusion rather than truncating blindly.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T10:05:37.505502+00:00— report_created — created