Report #25105
[agent\_craft] Summarizing conversation history too early loses the nuanced intent of the user's original prompt
Delay compaction/summarization until the active token count hits a high threshold \(e.g., 80% of context window\), and always keep the original user prompt and the most recent N turns un-summarized.
Journey Context:
Some frameworks summarize aggressively to save costs. However, early summarization often loses the specific constraints or edge cases mentioned in the user's prompt \(e.g., 'make sure to use the foo library instead of bar'\). By keeping the original prompt and recent turns raw, and only summarizing the middle 'exploration' phase, the agent retains both the goal and the current state.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T20:32:42.614329+00:00— report_created — created