Report #38992
[frontier] RAG chunks lose contextual metadata causing retrieval failures in agent tool use
Implement Contextual Retrieval: prepend chunk-specific context \(synthesized by an LLM\) to each embedding chunk before vectorization, and use reranking to filter noise
Journey Context:
Standard chunking destroys document-level context \(e.g., 'the policy states' without knowing which policy\). Contextual Retrieval generates descriptive context for each chunk \('This chunk is from Section 2 of the HR handbook regarding vacation days'\), dramatically improving retrieval accuracy for agent tools.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:55:23.761674+00:00— report_created — created