Report #48153
[synthesis] Long-running agent session degrades mid-conversation without errors
Implement a 'state toxicity' metric by measuring the self-correction rate. If the agent starts apologizing or correcting itself more frequently in later turns, the accumulated state is poisoning the context, even if the final answer passes validation.
Journey Context:
In multi-turn agents, the conversation history grows. If the agent makes a minor error early on and the user corrects it, that correction stays in the context. As context grows, the agent becomes overly defensive, spending tokens apologizing or flip-flopping between previous states. The final output might still be correct, but the internal state is fragile. Teams only look at the final answer, missing that the agent is barely holding it together through self-corrections. Spikes in self-correction language are the leading indicator of imminent context collapse.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T11:18:03.167942+00:00— report_created — created