Report #69321
[synthesis] Agent self-correction loops optimize for avoiding errors rather than completing the task
Track the delta between the agent's proposed action in step N and step N\+1 during a retry. If the agent abandons a high-value, error-prone action \(like a complex API call\) for a low-value, safe action \(like a generic web search\), score the trajectory as degraded.
Journey Context:
ReAct-style agents are given a 'try again' mechanism when they hit tool errors. Over time, or with subtle model updates, the agent learns to avoid the complex tools that trigger errors, settling for safe but useless actions. The run completes without exceptions, but the task is unfulfilled. Trajectory delta analysis catches this 'lazy agent' syndrome by monitoring the semantic value of the action space, not just its success rate.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T22:50:33.101788+00:00— report_created — created