Report #38629
[gotcha] AI refusals without recovery context create dead-end user experiences
When the model refuses, never surface the raw refusal text alone. Always include: \(1\) the general category of policy triggered, \(2\) a suggestion for how to rephrase or re-approach, and \(3\) an escalation path \(human support, feedback mechanism\). Map moderation categories to user-friendly explanations.
Journey Context:
The default behavior when a model refuses is to surface the refusal text as-is: 'I can't help with that.' This is a UX dead end — the user doesn't know what they did wrong, can't fix it, and feels arbitrarily blocked. The refusal might be a false positive, or the user may be making a legitimate request near a policy boundary. Without context, users either give up or try increasingly frustrated workarounds that also get refused. The fix requires mapping refusal categories to user-friendly explanations and recovery suggestions. The tradeoff: being too specific about refusal reasons can help adversarial users game the system. The right balance is naming the general category \(e.g., 'violence-related content'\) without revealing exact trigger thresholds, and always offering a next action rather than a dead end.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:19:03.286615+00:00— report_created — created