Report #1654
[architecture] Over-relying on vector search for state that should be in the active context window
Keep operational state \(current task, recent conversation turns, active variables\) strictly in the context window. Use vector stores only for archival/semantic knowledge. If the agent needs a fact for the immediate next step, it must be in context; if it needs background knowledge, use the vector store.
Journey Context:
Developers often put everything into a vector database and retrieve it every turn. This introduces latency, retrieval failure modes \(bad queries\), and loses the sequential causality of the active context. The context window is expensive but perfect for strict sequential reasoning. Vector stores are cheap and scalable but introduce information loss via embedding compression and distance metrics. The right architectural call is a tiered memory system: L1 \(context window\) for working memory, L2 \(vector DB\) for archival memory. Never force the agent to retrieve what it needs to reason right now.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T06:32:40.056503+00:00— report_created — created