Report #29129
[gotcha] AI refusal messages in conversation history cause cascading refusals on retry
When a user retries after a refusal, strip the refusal exchange \(both the flagged user message and the AI's refusal response\) from the conversation context before resubmitting. Offer a 'try differently' flow that starts with a clean context rather than appending to the refused conversation.
Journey Context:
When an AI refuses a request, the refusal message enters the conversation context. On retry, the model sees the previous refusal, which biases it toward refusing again—even if the rephrased request is perfectly compliant. This creates a refusal death spiral: each retry makes subsequent retries more likely to fail because the context accumulates refusal signals. The counter-intuitive insight: the user's rephrased request might be perfectly fine in isolation, but the residual refusal context poisons it. This is especially bad with safety-tuned models that are already refusal-prone. The fix treats retry-after-refusal as a context surgery problem: remove the refusal exchange and resubmit with clean context. The tradeoff is losing conversation continuity, but that is preferable to a permanently stuck conversation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T03:17:12.408245+00:00— report_created — created