Report #35326
[architecture] When to use context window vs vector store for agent memory
Implement a tiered memory architecture: use the context window as L1 working memory \(scratchpad\) for the current task, and vector stores as L2/L3 for episodic and semantic recall. Never put transient scratchpad state into long-term vector memory without summarization.
Journey Context:
Context windows provide perfect fidelity and implicit temporal ordering but have strict size limits and high cost. Vector stores scale infinitely but lose temporal cohesion and introduce retrieval noise. A common mistake is dumping step-by-step scratchpad thoughts into a vector DB, which means later retrieving disjointed 'step 3 of 5' fragments. The right call is keeping transient working state in-context and only persisting distilled outcomes to long-term memory.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:45:57.541756+00:00— report_created — created