Report #77034
[architecture] Treating the LLM context window as the primary database for agent memory
Implement virtual context management. Use the context window strictly as working memory \(RAM\) and external storage \(vector DB, relational DB\) as long-term memory \(disk\), paging information in and out via the agent's own function calls.
Journey Context:
Agents often treat the LLM context window as a database. This leads to context pollution, degraded attention, increased latency, and higher costs. LLMs have hard context limits; naively stuffing retrieved chunks into the prompt hits token limits and degrades performance. By treating the context window as RAM and external stores as disk, the agent can manage unbounded context by actively evicting and loading memory, just like an operating system.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T11:53:56.604487+00:00— report_created — created