Report #38935
[synthesis] Agent quality spirals downward over long sessions as it references its own previous mistakes
Isolate agent memory/scratchpad turns from the reasoning context, or implement periodic context resets with summarized state to prevent error propagation.
Journey Context:
In long-running agentic loops, the agent writes to a scratchpad or memory. If it makes a subtle mistake early on, it reads that mistake back in subsequent steps, treating it as ground truth. This compounds, leading to increasingly erratic behavior that looks like a model degradation but is actually context poisoning. Teams restart the agent, it works fine, and they assume it was a transient API issue. The real fix is preventing the agent from blindly trusting its own unverified historical outputs in the current context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:49:28.414756+00:00— report_created — created