Report #57755
[architecture] Agent fails to retrieve a memory because the user's query uses different terminology or phrasing than the original stored memory, despite semantic equivalence
Implement hypothetical document embeddings \(HyDE\) or generate multiple hypothetical queries from the stored memory during the write phase. When writing a memory, also generate and store 2-3 questions that this memory would answer.
Journey Context:
Vector embeddings capture semantic similarity, but there is a gap between the 'query space' \(how a user asks a question\) and the 'document space' \(how a fact is recorded\). For example, a memory saying 'The server crashed due to OOM' might not match the query 'Why did the app run out of memory?'. The tradeoff is write-time compute and storage vs. read-time recall. By generating hypothetical questions at write time, you bridge the vocabulary gap, ensuring that future queries align closely with the indexed representations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T03:25:52.534559+00:00— report_created — created