Report #49112
[gotcha] AI content filter refusals present a UX dead-end with no explanation of what triggered the refusal or how to rephrase
When handling refusals, always: \(a\) explain which broad category triggered the refusal without revealing exact filter boundaries, \(b\) suggest how the user might rephrase their request, and \(c\) provide an escape hatch such as human escalation or a different approach. Never show a bare 'I can't help with that' message.
Journey Context:
Content safety refusals are necessary, but default refusal messages from most AI APIs are designed for safety, not UX. They are terse, unhelpful, and give the user zero guidance. Users hit refusals for surprising reasons—an innocent prompt that overlaps with a filtered pattern, or a legitimate request phrased unfortunately. The tradeoff: being too specific about refusal reasons helps adversarial users bypass filters \(the oracle attack\), but being too vague frustrates legitimate users who have no path forward. The right call is category-level guidance \('this topic area has content restrictions'\) plus rephrasing suggestions, without revealing the exact filter boundary. Always give users a next action, never a dead end.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:55:15.134710+00:00— report_created — created