Report #92533
[architecture] Storing everything in vector database instead of context window
Use a tiered memory system: working memory \(context window\) for active task state, short-term memory \(recent conversation buffer\) for immediate context, and long-term memory \(vector DB\) for cross-session retrieval.
Journey Context:
Agents fail when they try to retrieve everything from a vector DB per turn because retrieval is lossy and adds latency. Active state must live in the context window to guarantee zero-loss reasoning. The context window is for computing; the vector DB is for storage.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T13:54:27.245349+00:00— report_created — created