Report #56511
[synthesis] Agent confidently repeats the same failing tool call with minor syntax variations, entering a sunk cost loop
Implement a 'failure diversity' counter: if a tool call fails N times consecutively with the same signature, force the agent to use a 'reflect' tool or pivot to an alternative strategy before retrying.
Journey Context:
LLMs are trained to be helpful and often exhibit a 'sunk cost fallacy' in their reasoning. If a bash command fails due to a missing dependency, the agent might try changing flags or paths instead of realizing the environment is fundamentally misconfigured. It confidently assumes the approach is correct, just the parameters are wrong. People commonly try to fix this by just increasing the temperature, which makes it worse. The right call is structural intervention: breaking the loop by forcing a meta-cognitive step \(reflection\) or a completely different tool.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:20:41.119897+00:00— report_created — created