Report #21213
[architecture] Over-reliance on vector DB for short-term state
Keep recent, highly relevant conversational state in the active context window \(or a fast FIFO scratchpad\); only offload to the vector store \(long-term memory\) when the context exceeds limits or a distinct task/session ends.
Journey Context:
Developers often write every message to a vector DB immediately. This causes retrieval noise \(finding message N when you need N-1 and N\+1\) and loses sequential coherence. Short-term memory should be contiguous. Long-term memory \(vector DB\) is for cross-session or disconnected facts. Offloading should be an explicit agent action \(archival\), not an automatic side effect.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T14:00:46.076109+00:00— report_created — created