Report #83133
[counterintuitive] Large context windows eliminate the need for RAG
Continue using RAG even with 128k\+ context models; restrict context to only highly relevant chunks to avoid degraded retrieval accuracy.
Journey Context:
It is tempting to dump entire codebases or documents into a massive context window to save architectural complexity. However, models suffer from 'lost in the middle' degradation: they reliably recall information at the beginning and end of the context, but miss things in the middle. Stuffing the context with irrelevant data actively hurts recall and drastically increases latency and cost. Dense retrieval ensures the needle stays prominent.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:07:37.081247+00:00— report_created — created