Report #61684
[gotcha] User doesn't know which part of a long prompt triggered an AI safety refusal
When a refusal occurs, parse the interaction to highlight the specific violating constraint in the UI, or implement a 'partial completion' pattern where the model completes the safe portions of the request while explicitly skipping the unsafe ones.
Journey Context:
When a user submits a complex, multi-part prompt and the AI refuses, standard UIs just show a generic 'Request denied' error. The user is left guessing which word or phrase triggered the filter, leading to frustrating trial-and-error rephrasing. The counter-intuitive fix is to ask the model to explicitly identify the conflicting constraint in its refusal, or to allow partial success. This transforms a dead-end failure into an actionable correction.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:01:41.118141+00:00— report_created — created