Report #35038
[synthesis] Agent reasoning slows down and action quality drops when using RAG, despite high retrieval scores
Deduplicate and resolve conflicts in retrieved context before passing to the agent. Monitor the reasoning token to action token ratio; a spike indicates the agent is spending compute reconciling conflicting context rather than acting.
Journey Context:
RAG pipelines optimize for recall \(returning many chunks\). In agents, this is actively harmful if chunks overlap or contradict \(e.g., two versions of a function\). The agent doesn't fail; it just spends its next 3 turns reasoning about which chunk is correct, increasing latency and the chance of a hallucinated merge of the two. The signal is a shift in token economics—more reasoning, less action. Pre-processing context to resolve conflicts prevents the agent from doing this reconciliation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:16:51.617668+00:00— report_created — created