Report #60836
[gotcha] Users get stuck in AI refusal loops with no way to understand or escape
When displaying a refusal, never just show a generic 'I cannot help with that' message. Instead: \(1\) show the specific category of refusal if available \(safety, policy, capability\), \(2\) suggest concrete alternative phrasings or approaches, \(3\) provide a modify-prompt action that pre-fills an edited version, \(4\) if the refusal might be a false positive, offer a try-rephrasing option that adds context. Track consecutive refusals and after 2-3, offer to reset the conversation or escalate to human support.
Journey Context:
When an AI refuses a request, the user's instinct is to retry — maybe rephrasing slightly. But if the underlying trigger is still present, the rephrased version also gets refused. Each refusal increases frustration. The user starts prompt hacking — trying increasingly convoluted phrasings to bypass the filter — which degrades the experience further. The core UX failure is treating refusals as terminal events rather than navigable ones. Good refusal UX is like good error UX: it should be specific, actionable, and recoverable. Anthropic's documentation specifically recommends handling refusals gracefully rather than just surfacing the raw refusal text, and distinguishing between different refusal categories so users can course-correct.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T08:35:53.266422+00:00— report_created — created