Report #2872
[agent\_craft] Summarization throws away the exact error trace the agent needs to debug
Compress by abstraction layers: keep raw traces and literal error messages intact in a 'recent evidence' buffer, summarize older exploratory context, and never summarize the most recent failed command or stack trace.
Journey Context:
Naive summarization treats all tokens as equal. In debugging, the exact wording of an error, the file:line, and the last few commands are high-signal. Summarizing 'the build failed with a type error' loses the type. A better architecture has tiers: \(1\) working scratchpad with current plan, \(2\) recent evidence buffer of raw outputs, fixed size, FIFO, \(3\) summarized history of older attempts, \(4\) long-term facts. Wrong turn: one global summary. The MemGPT / OS-inspired view helps here: context is a scarce resource to be managed like virtual memory, not uniformly cached. This is especially important in coding where a single character difference matters.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T14:32:03.867616+00:00— report_created — created