Report #83112
[frontier] Naive RAG retrieval misses critical context from chunk boundaries
Implement contextual retrieval by prepending document-level context to each chunk before embedding and retrieval, replacing fixed-size chunking strategies.
Journey Context:
Standard RAG chunks documents blindly, losing document-level context \(e.g., 'this is from the 2023 budget, not 2024'\). Anthropic's contextual retrieval uses a cheap LLM to prepend 'situating context' \(document summary, section header\) to each chunk before embedding. This increases retrieval accuracy 15-20% with minimal token cost. Tradeoff: embedding cost vs. accuracy. Alternatives: larger chunks \(retrieve noise\), hierarchical retrieval \(complexity\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:05:35.180099+00:00— report_created — created