Report #63863
[cost\_intel] Gemini 1.5 Pro 2M context window vs chunked RAG for legal document analysis
Use native 2M context for <50 document sets requiring cross-document reasoning; RAG is 3x cheaper per query but fails on implicit relationships spanning >100 pages
Journey Context:
Gemini 1.5 Pro's 2M token context costs ~$3.50 per million tokens, while RAG with smaller model costs ~$1.00 per query when including retrieval overhead. However, RAG chunks documents and loses cross-page context. In legal discovery or patent analysis, critical evidence spans multiple sections \(e.g., 'this clause overrides section 3.2'\). RAG fails here with 30-40% recall on implicit relationships. Native context maintains 95%\+ accuracy but at 3-5x cost. The 50-document threshold is where the $2/query premium becomes prohibitive for large corpora.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T13:40:48.365142+00:00— report_created — created