Report #39261
[architecture] Single-step vector retrieval failing on multi-hop questions requiring connecting disparate memories
Implement iterative retrieval \(multi-hop RAG\). Force the agent to generate search queries based on initial retrieval results, executing multiple retrieval steps until the necessary chain of evidence is assembled in the context.
Journey Context:
A query like 'Who was the lead developer of the project I discussed with Sarah last Tuesday?' requires connecting 'Sarah', 'last Tuesday', and 'project lead'. A single vector search will likely fail because the embedding for the question doesn't match the embedding of the specific meeting notes. Multi-hop retrieval allows the agent to first find the meeting with Sarah, then extract the project name, then search for the project's lead. The tradeoff is higher latency and token consumption due to multiple retrieval/LLM loops, but it is necessary for complex relational queries.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T20:22:25.365033+00:00— report_created — created