Report #47290
[architecture] Storing the current conversation turn or immediate scratchpad in a vector database
Keep short-term, highly temporal working memory strictly within the LLM context window; use vector databases only for long-term episodic or semantic memory retrieval.
Journey Context:
It is tempting to unify all memory into a vector store to save token costs, but vectorization destroys the sequential, relational nuance of immediate past turns. If the agent's last action was 'file not found', retrieving that via vector similarity yields a probabilistic, lossy result. The context window is the only guaranteed lossless mechanism for sequential working state. The tradeoff is token cost vs. determinism. Use the context window for the current execution graph, and only write to the vector store upon session end or context overflow.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T09:51:38.342134+00:00— report_created — created