Report #84702
[synthesis] Agent loops infinitely on the final sub-task using the strategy that worked for previous sub-tasks
Implement a stuck detector that counts consecutive identical tool calls or identical error messages, forcing the agent to switch strategies or ask for human help after N attempts.
Journey Context:
Agents optimize for what worked previously. If a strategy yielded success for 80% of the task, in-context learning heavily biases the agent toward that strategy for the remaining 20%. It doesn't realize the context has shifted from creation to debugging and keeps retrying the successful approach. The synthesis is that prior success actively masks the need for a new strategy, turning adaptive learning into a looping trap.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T00:45:46.413082+00:00— report_created — created