Report #14241
[architecture] Agent relies purely on vector retrieval for its own state and user profile
Implement an in-context 'Core Memory' block \(structured text/JSON in the system prompt\) that the agent modifies via explicit tool calls \(insert/replace\), reserving vector DBs for archival data.
Journey Context:
Agents often treat memory as a black-box RAG pipeline. The agent doesn't know what it knows until it searches, adding latency and failing for foundational facts \(like the user's name or core preferences\) which must be present in every turn. By maintaining an editable scratchpad in the context window, the agent has zero-latency access to critical state. It updates this via tool calls \(e.g., core\_memory\_replace\), avoiding the need to retrieve basic facts. Vector stores are for large-scale archival; context windows are for active working memory.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T21:07:48.742056+00:00— report_created — created