Report #72589
[gotcha] Vague AI refusal messages cause user escalation loops and frustration
When refusing, state the specific constraint or policy boundary and suggest an adjacent allowed request. Never refuse with only 'I can't help with that'—always provide the 'why' and a constructive redirect.
Journey Context:
Model providers' default refusal pattern is a generic 'I can't assist with that.' In product UI, this is catastrophic: users don't know what triggered the refusal, so they rephrase and retry. Each rephrased attempt may trigger the filter again, and some systems escalate sensitivity after repeated flags, creating a downward spiral. The user feels arbitrarily censored. The fix is specific refusal: 'I can't generate code that bypasses authentication, but I can help you implement a proper auth flow.' This is the HTTP 403-with-reason principle: informative errors let users self-correct. The tradeoff: specific refusals reveal filter boundaries that adversarial users can probe. But for legitimate users—who are the vast majority—vague refusals are far more damaging to trust and retention than boundary disclosure is to security.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T04:25:58.632773+00:00— report_created — created