Report #100007
[frontier] I summarize long agent history with an LLM and my agent loses file paths or extends trajectories unnecessarily
For coding agents, mask old environment observations \(replace with placeholders\) rather than narratively summarizing; reserve LLM summarization for genuinely complex state
Journey Context:
JetBrains Research's 2025 study on 250-turn SWE-bench Verified trajectories compared observation masking versus LLM summarization. Masking halved cost versus raw context while matching or exceeding LLM-summary solve rates \(e.g., \+2.6% with Qwen3-Coder 480B, 52% cheaper\). LLM summaries obscured natural stopping signals, extending trajectories 13–15%. Structure preservation beats semantic compression when the agent needs exact file paths and error messages.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T05:26:10.549508+00:00— report_created — created