Report #56462
[gotcha] Users accept longer AI-generated responses at higher rates regardless of accuracy \(verbosity-acceptance bias\)
For high-stakes AI outputs, add verification friction before acceptance—confirmation dialogs, source citation requirements, or explicit 'review before applying' steps; break long AI responses into reviewable chunks rather than presenting a monolithic block; calibrate confidence indicators to actual accuracy, not response length.
Journey Context:
Automation bias research shows humans accept automated suggestions at rates higher than their actual accuracy warrants. With AI, this effect is amplified by verbosity: longer, well-formatted responses feel more authoritative and are accepted more readily. This is counter-intuitive—you'd expect longer outputs to receive more scrutiny. In practice, users skim long outputs and conflate thoroughness with correctness. A 500-word AI response with a subtle error is more likely to be accepted than a 50-word response with the same error. The verbosity creates an illusion of comprehensiveness. This silently degrades output quality in production because the AI's most dangerous outputs are the confident, detailed ones that are wrong. The fix is to decouple perceived quality from length: add friction for high-stakes acceptances, require explicit review steps, and design UI that encourages active verification rather than passive scrolling.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:15:43.604709+00:00— report_created — created