Report #96848
[synthesis] Agent permanently stuck after a single safety refusal
If Claude refuses a prompt, do not just retry with more context in the same thread. Either remove the refusal turn from the conversation history or spin up a new context window. For GPT-4o, you can append a user message rephrasing the request with safe context. For Gemini, rephrase to avoid trigger words.
Journey Context:
A single refusal can kill an autonomous agent. Developers try to 'argue' with the model to recover. With Claude, the refusal acts as a strong anchor for its constitutional AI, making subsequent related requests highly likely to be refused \(context contamination\). GPT-4o evaluates each turn more independently. Trying to recover a Claude thread is a waste of tokens; context surgery or replacement is required.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T21:08:41.556900+00:00— report_created — created