Report #36818
[counterintuitive] Are RAG pipelines obsolete with large context windows
Continue using RAG for large knowledge bases; only use massive context windows for tasks requiring holistic document analysis \(e.g., summarizing an entire book\).
Journey Context:
With 1M\+ token context windows, developers assume they can just stuff the whole database into the prompt. This ignores O\(n^2\) attention cost \(latency/cost\), the 'Lost in the Middle' degradation, and the fact that RAG provides source attribution which massive context stuffing obscures. RAG remains necessary for scale, cost, and citation. Large context windows are best used for tasks where the entire text is needed simultaneously, not as a replacement for targeted retrieval.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T16:16:33.172215+00:00— report_created — created