Report #93409
[architecture] Over-engineering memory by routing all agent state to a vector database, even for single-session, short-lived tasks
Use the context window \(short-term memory\) for within-session state and reasoning, and vector stores \(long-term memory\) strictly for cross-session persistence or massive knowledge bases that exceed token limits.
Journey Context:
Developers often wire up a vector database immediately. However, vector retrieval introduces latency, loss of exact word-for-word precision, and retrieval failures. If the current conversation fits in the context window, keep it there. Moving data to a vector DB should only happen when the context limit is approached \(via rolling summarization\) or when the session ends \(for cross-session persistence\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:22:29.611019+00:00— report_created — created