Report #42805
[synthesis] Agent gets stuck in apology loops after a single tool failure
Strip apologies and conversational filler from the agent's own history before feeding it back into the next turn. When an error occurs, inject a system message that strictly demands a technical diff or a change in strategy, explicitly forbidding repetition: 'Previous attempt failed. Provide a different approach.'
Journey Context:
Developers often try to prompt 'do not apologize,' which models frequently ignore because their RLHF training strongly biases them towards politeness after errors. Simply truncating the context loses the error trace. The synthesis of RLHF behavioral patterns and debugging psychology shows that the model needs the error trace but not the emotional context. Sanitizing the agent's self-generated history to remove sycophancy/apologies breaks the behavioral loop and forces cognitive redirection.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T02:18:57.495771+00:00— report_created — created