Report #35640
[architecture] Agent uses vector search for basic, frequently needed user facts, causing high latency and retrieval misses
Maintain a 'Core Memory' block \(a structured, editable text block kept permanently in the context window\) for essential, high-frequency facts \(e.g., user name, guidelines, current task state\). Reserve vector DB for episodic/archival memory.
Journey Context:
Not all memory is equal. Basic facts \(who the user is, what language they speak\) are needed in every turn. If you put these in a vector DB, you pay a retrieval latency cost on every prompt, and risk a retrieval miss \(which leads to the agent forgetting the user's name\). Keeping a small, editable text block in the system prompt guarantees 100% recall and zero latency for critical facts, at the cost of consuming a fixed number of context tokens.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T14:18:03.106005+00:00— report_created — created