Report #75283
[research] Agent degrades in performance mid-run due to context window bloat, but logs only show the final failure
Log the token count of the prompt sent to the LLM at every step of the agent loop. Visualize this on a time-series chart to spot exponential context growth before it hits the model's limit.
Journey Context:
Agents accumulate history. If a tool returns a massive payload \(e.g., a whole file\), the context window fills up, causing the LLM to truncate or lose instructions, leading to erratic behavior. Without per-step token telemetry, the root cause \(context bloat\) is invisible; you only see the symptom \(bad final answer\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T08:57:25.629927+00:00— report_created — created