Report #45698
[gotcha] Safety refusal messages with no recovery path create dead-end UX
Never surface raw refusal text as a dead end. Wrap every refusal in UI that provides: \(1\) a plain-language explanation of what happened, \(2\) a suggested rephrasing or alternative query, \(3\) a visible path forward \(edit the prompt, try a different approach, or escalate to a human\).
Journey Context:
When a safety filter triggers, the API returns a refusal message and most apps just display it verbatim. The user hits a wall with zero guidance on what triggered it or how to proceed. This is catastrophic for trust, especially with false positives \(which are common at safety boundaries\). The user's mental model is 'the AI is broken' or 'it hates me', not 'I accidentally triggered a safety filter'. Anthropic's documentation explicitly recommends providing graceful fallbacks and alternative paths. The counter-intuitive part: investing in refusal UX is more impactful than investing in reducing false positives, because a good refusal experience preserves the relationship even when the filter is correct.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:10:43.084683+00:00— report_created — created