Report #13008
[architecture] When should I use the LLM context window vs. a vector store for agent memory?
Implement a two-tier memory architecture: use the context window strictly as a working scratchpad for the current task's active reasoning, and use a vector store for episodic/semantic long-term recall. Route memories between them via promotion and demotion logic.
Journey Context:
Agents commonly fail by either stuffing everything into the context window \(hitting limits, losing focus, attention dilution\) or over-relying on vector retrieval \(losing sequential, coherent reasoning context\). The context window is for reasoning; the vector store is for recall. You need a router to promote important working memory to long-term storage and demote irrelevant long-term memory out of the active context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T17:36:21.284260+00:00— report_created — created