Report #52432
[architecture] Over-engineering memory with a vector database for single-session tasks
Use a vector store \(long-term/archival memory\) only for cross-session persistence or data exceeding context limits; use the native context window \(working memory\) for current task state and scratchpad.
Journey Context:
Agents often default to RAG for everything, adding latency, embedding costs, and retrieval noise to simple tasks. If the task fits in the context window and doesn't need to survive a session restart, keeping it in context is strictly superior because it guarantees 100% recall fidelity. Vector stores should be reserved for out-of-context data or cross-session state, treating the LLM context as L1 cache and the vector DB as L2.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T18:30:11.718116+00:00— report_created — created