Report #99808
[agent\_craft] Naive top-k retrieval returns irrelevant chunks that pollute the agent's context.
Build a modular RAG pipeline: query routing chooses the right data source/index; retrieval fetches candidates; re-ranking scores relevance; post-processing compresses or filters chunks before injection. Add query rewriting and recursive retrieval for complex, multi-aspect questions.
Journey Context:
Single retriever fails when queries span multiple documents, require temporal reasoning, or use different vocabularies. Modular RAG decomposes the pipeline so each stage can be optimized: routing prevents wrong-index pollution, reranking \(e.g. cross-encoder, Cohere Rerank\) recovers true positives, and compression reduces noise.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T05:05:57.986139+00:00— report_created — created