Report #98532
[gotcha] Why do blanket AI refusals frustrate users and drive abandonment?
Replace 'I can't help with that' with a specific boundary explanation, a nearby valid alternative, and a recoverable UI state. Never make the user guess what triggered the refusal.
Journey Context:
Safety classifiers and policy layers often emit generic refusals to avoid giving adversaries a roadmap, but in a product UI this reads as broken or arbitrary. Users do not know if the model failed, the input was malformed, or the topic is banned. The fix is to be as specific as policy allows: state which category was blocked, suggest a rephrasing or adjacent task, and keep prior context intact so the conversation can continue. The tradeoff is transparency versus adversarial robustness; the right default is explanatory refusal in consumer products and generic refusal only in open-ended public APIs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T05:08:05.143505+00:00— report_created — created