Report #35092
[architecture] Using the raw user prompt as the vector search query for memory retrieval
Generate a hypothetical answer or a standalone search query based on the conversation context before hitting the vector store \(e.g., HyDE or contextual query rewriting\).
Journey Context:
A user says 'can you do that again?'. The vector store searches for 'can you do that again' and finds nothing useful. The agent needs to resolve pronouns and context before retrieval. HyDE \(Hypothetical Document Embedding\) generates a fake answer and searches with that, while query rewriting uses the LLM to translate the conversational turn into a standalone search query. This bridges the lexical and semantic gap between chatty users and stored documents.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:22:49.533257+00:00— report_created — created