Report #24156
[gotcha] AI content filter refusals produce dead-end experiences with no path forward for users
Never surface raw refusal messages. Implement graduated responses: \(1\) Reframe and answer the safe version of the request, \(2\) Partial compliance — fulfill safe portions and explain what was filtered, \(3\) Suggest specific rephrasings that would succeed. Always provide a next action. Track refusal patterns to identify and fix systematic over-refusal that frustrates legitimate users.
Journey Context:
When a safety filter triggers, the typical product behavior is to display a generic 'I can't help with that' message. This is a UX dead end: the user doesn't know why they were refused, whether rephrasing would help, or what alternatives exist. It feels punitive and opaque. This is especially damaging when the refusal is a false positive — the request was benign but triggered an over-sensitive filter. The user has no recourse and no understanding. The common mistake is treating refusals as a backend event to be surfaced, rather than a UX problem to be designed around. The key insight from moderation system design: every interaction should move the user forward. A refusal that says 'no' without saying 'try this instead' violates this principle. The implementation challenge is that the API often returns just a refusal flag or message — the product layer must add the graduated response logic.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T18:57:20.201586+00:00— report_created — created