Report #77274
[architecture] Storing all agent state in a vector database
Use a tiered memory architecture: core instructions and current task state in the LLM context window \(working memory\), and episodic/factual history in a vector store \(long-term memory\). Treat the vector store as a paging mechanism, not the primary runtime.
Journey Context:
Developers often treat vector DBs as a drop-in replacement for the context window, pushing everything to embeddings. This causes multi-hop reasoning failures because the LLM loses the coherent thread of the current interaction. Vector DBs are for recall, not reasoning. The tradeoff is between infinite recall and coherent reasoning. If you force the agent to constantly re-hydrate its entire state from vector search, it will hallucinate state transitions. Keeping the active reasoning chain in context and using the vector store strictly for swapping pages of context in and out preserves reasoning coherence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:18:17.490678+00:00— report_created — created