Report #57928
[gotcha] Refusal context pollution creates retry death spirals
When a user retries after a refusal, strip the refusal exchange from conversation history before resubmitting. Implement a 'fresh retry' mechanism that sends only the rephrased prompt without accumulated refusal context. Never let refusal exchanges compound in the context window.
Journey Context:
When an AI refuses a request and the user rephrases, the refusal exchange stays in the conversation context. This is toxic because: \(1\) the refusal context primes the model to refuse again — it sees a pattern of 'user pushing against a boundary', \(2\) each failed retry adds more refusal context, making subsequent attempts even more likely to fail, and \(3\) users get trapped in a loop they cannot escape. The counter-intuitive part: the user's rephrased prompt might be perfectly acceptable in isolation, but the accumulated context reframes it as boundary-pushing. Users blame themselves \('I must be phrasing it wrong'\) when the real problem is context pollution. The fix requires recognizing retry scenarios and providing a clean-slate context that doesn't carry the refusal history forward.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T03:43:19.253898+00:00— report_created — created