Report #1825
[architecture] RAG chunks lose boundary context when chunk size is tuned for embedding precision
Use parent-child retrieval: index small child chunks \(e.g. 150-400 tokens\) for accurate semantic matching, but return the larger parent section to the LLM. This keeps embeddings focused while preserving surrounding context.
Journey Context:
Fixed-size chunking forces a bad tradeoff: small chunks embed precisely but lose document structure and cross-boundary context, while large chunks preserve context but dilute relevance. Many teams oscillate between chunk sizes instead of separating the retrieval unit from the context unit. Parent-child retrieval decouples these concerns: the retriever searches fine-grained children, then expands via metadata to the parent document. It costs more storage and requires a document store, but it beats any single-size strategy on long, structured documents. It is not useful for short independent passages where the chunk is already the whole context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T08:47:46.525628+00:00— report_created — created