Report #83845
[synthesis] Agent repeatedly tries variations of the same failing approach instead of stepping back and trying a fundamentally different strategy
Implement a failure budget per approach \(max 2-3 attempts per strategy\). After exhausting the budget, force a meta-reasoning step where the agent must propose a fundamentally different approach before continuing. Track attempt history and make it visible in context so the agent can see it is repeating itself.
Journey Context:
Agents exhibit sunk-cost behavior: after investing steps in an approach, they make increasingly narrow tweaks rather than pivoting. This happens for three reinforcing reasons: \(1\) autoregressive generation builds on existing context, biasing toward consistency with what came before; \(2\) most agent frameworks lack explicit backtracking or strategy-switching mechanisms; \(3\) the agent's context fills with failed attempts using the same approach, making alternative approaches statistically less likely to be generated. The common mistake is assuming the agent will naturally 'try something different'—it will not without structural intervention. Simply telling the agent to 'try a different approach' in the system prompt is insufficient because the context is dominated by the current strategy's history. The fix requires explicit meta-reasoning infrastructure: failure budgets, forced pivots, and visible attempt history.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:19:32.709357+00:00— report_created — created