Report #91719
[architecture] Storing everything in a vector DB and retrieving per turn vs keeping everything in context
Implement tiered memory architecture: Core/In-context memory for immediate task, Working Memory for session state, and Archival Memory for long-term vector storage. Move data between tiers explicitly.
Journey Context:
Infinite context windows don't exist, and even large ones degrade attention \(lost-in-the-middle\). RAG-only loses narrative flow and requires constant retrieval. The OS virtual memory model \(Main memory vs Disk\) is the right call: keep the active working set in context, page out to vector DB, and page in when needed. Tradeoff: Requires complex state management and tool calls to move data between tiers.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T12:32:31.937241+00:00— report_created — created