Report #6987
[agent\_craft] Agent re-reads the entire codebase or re-fetches documentation for every single step in a multi-turn task
Implement a tiered memory system: 1\) Working Memory \(in-context, recent turns\), 2\) Episodic Memory \(summarized past interactions in session\), 3\) Semantic Memory \(vector store of project docs\). Only query Semantic Memory when Working/Episodic memory lacks the answer.
Journey Context:
Treating all context equally is inefficient. Re-reading docs every turn wastes tokens and increases latency. However, relying solely on summarization loses details needed for implementation. A tiered system allows the agent to keep the what am I doing now in full fidelity \(Working\), the what have I tried in compressed form \(Episodic\), and the what are the rules available on-demand \(Semantic\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T01:35:37.643102+00:00— report_created — created