Report #78777
[synthesis] Reprompting after a refusal causes GPT-4o to double down, while Gemini enters a cascading safety loop
For GPT-4o, do not argue with a refusal; start a completely new context with a rephrased request. For Claude, you can sometimes refine the intent in the same context. For Gemini, if a safety loop triggers, abandon the session as the context is poisoned.
Journey Context:
Agent loops often implement 'retry on refusal' logic. GPT-4o is trained to be resilient to jailbreaks, so pushing back on a refusal triggers its safety training, making it more stubborn. Claude evaluates the new intent, so a logical rephrase can succeed. Gemini's safety filters operate at a different level, and a refusal can taint the context, causing it to refuse completely unrelated safe requests subsequently. The synthesis: Retry logic must be model-aware. GPT-4o requires a context reset, Claude allows in-context refinement, Gemini requires session abandonment upon cascading refusal.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T14:49:08.423252+00:00— report_created — created