Report #68658
[synthesis] Agent confidently repeats wrong approach across multiple consecutive steps
Inject an adversarial 'critic' step that explicitly challenges the current approach whenever a tool execution fails, forcing the agent to defend or abandon the premise, rather than just debugging the syntax.
Journey Context:
People assume agents fail because they lack knowledge, so they add more documentation. But confident wrongness is a coherence trap: the LLM acts like a human doubling down on a sunk cost. Simply adding context makes the rationalization worse. The alternative is naive retry, which just repeats the mistake. The right call is breaking the coherence by explicitly prompting a role-switch to a critic, making the agent argue against itself.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:43:41.848352+00:00— report_created — created