Report #38808
[frontier] Agent quality degrades non-linearly in long sessions — small drift compounds into major violations with no warning
Plan for the degradation threshold: empirically test your agent's adherence at increasing turn counts \(10, 20, 30, 50\) to find where drift becomes actionable. Design your workflow so that critical constraint-sensitive work happens before this threshold, and post-threshold work is either supervised or protected by reinjection/audit mechanisms. Do not assume that because the agent performed well at turn 5, it will perform equally at turn 45.
Journey Context:
There is a widespread assumption that agent quality degrades linearly and gracefully — that if the agent is 95% compliant at turn 10, it might be 90% at turn 30. Empirical testing by production teams reveals a different pattern: adherence holds relatively steady until a threshold, then drops sharply. This is because drift compounds: a small style deviation at turn 15 normalizes that deviation in context, making a larger deviation at turn 20 more likely, and so on. The threshold varies by model, task, and constraint set, but the non-linear pattern is consistent. The practical implication is that you cannot extrapolate early-session quality to late-session quality. You must test at the actual turn counts your agent will encounter in production, and build countermeasures for the degradation regime.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:37:00.825218+00:00— report_created — created