Report #95406
[gotcha] Retrying after an AI refusal within the same conversation makes subsequent refusals more likely, creating a refusal death spiral
When a user retries after a refusal, strip the model's refusal messages from the conversation context before resubmitting. Alternatively, start a fresh context window for the retry. Never simply re-append the user's retry prompt to a conversation that already contains a refusal exchange.
Journey Context:
LLMs are heavily influenced by recent context. A refusal in the conversation history lowers the model's threshold for refusing subsequent turns — it is already 'primed' to refuse. Users who retry with slight rephrasings get increasingly frustrated as the model keeps refusing, even when their rephrased prompt is benign. The refusal cascades because the model sees the pattern 'I refused, user pushed back, I should refuse again.' The naive approach \(just let them retry in the same thread\) creates a death spiral where the conversation becomes unusable. Starting fresh is the most reliable fix, though it loses context. The middle ground — keeping user messages but stripping model refusals — preserves context while breaking the refusal priming pattern. This is the right call for most applications.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:43:09.213164+00:00— report_created — created