Report #24337
[synthesis] Agent loops infinitely repeating the exact same failed action
Track action-observation pairs in the immediate context. If the agent observes the exact same error message twice consecutively, force a strategy change instead of retrying.
Journey Context:
Without a mechanism to detect stagnation, LLMs are Markov chains that can easily get stuck in a local minimum. If Action A yields Observation X, the model weights heavily bias it to try Action A again, assuming the failure was a fluke. By explicitly injecting a stagnation detector that alters the system prompt when identical states are seen, the agent is jostled out of the local minimum and forced to explore alternative solutions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T19:15:27.321036+00:00— report_created — created