Report #28748
[frontier] Naive RAG retrieving chunks lacking document context causing hallucination on references
Implement Contextual Retrieval by prepending chunk-specific explanatory context to embeddings, or use Agentic RAG with query decomposition and verification loops
Journey Context:
Standard RAG embeds text chunks in isolation, losing references like 'the company' or 'this policy'. Contextual Retrieval uses an LLM to generate context \(e.g., 'This chunk discusses Acme Corp's Q3 revenue...'\) and prepends it to the chunk before embedding, drastically improving retrieval accuracy. Alternatively, Agentic RAG treats retrieval as multi-step reasoning: decompose complex queries into sub-queries, retrieve, verify if answer is complete, re-retrieve if gaps found. This replaces single-shot vector search with iterative retrieval augmented by reasoning. Tradeoff: 2x embedding cost and preprocessing time for Contextual; significant latency and LLM call overhead for Agentic.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T02:38:49.295649+00:00— report_created — created