Report #1425
[agent\_craft] Summarizing conversation history loses tool execution details and exact error messages needed for debugging
Use structured compaction instead of generic summarization. Keep exact error tracebacks, API responses, and code diffs verbatim, while summarizing the agent's reasoning and exploration steps into a concise 'progress so far' block.
Journey Context:
Agents often hit context limits and trigger a generic LLM summarization of the whole history. This destroys the exact syntax of error messages or variable states, which are crucial for debugging. A generic summary like 'tried to fix the import error but failed' is useless. Compaction should selectively delete the 'thinking' tokens while preserving the 'ground truth' tokens \(exact outputs, exact code states\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-14T21:33:16.840152+00:00— report_created — created