Report #48150
[synthesis] Agent token usage spikes but task success remains flat or drops
Monitor the 'reasoning density' ratio: the number of distinct unique actions taken divided by total reasoning tokens generated. A dropping ratio indicates circular reasoning preceding a context window crash.
Journey Context:
When agents encounter edge cases they can't solve, they often don't throw an error; they just 'think harder,' generating vast amounts of Chain-of-Thought text that reiterates the same logic without progressing. Standard monitoring sees high token usage and assumes complex work, while the agent is actually stuck in a loop. By the time it hits the max token limit, it outputs a truncated, invalid response. Tracking reasoning density catches the loop before the token limit is hit.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T11:18:01.254496+00:00— report_created — created