Report #71600
[gotcha] Fluent AI outputs are trusted more than accurate but hedged outputs \(automation bias \+ fluency heuristic\)
Decouple fluency from authority in your UI. Add calibrated uncertainty signals: confidence indicators, 'multiple possible answers' framing, or inline citations. When logprobs indicate low confidence, surface that visually. Never optimize solely for 'natural-sounding' output without also optimizing for calibrated uncertainty expression.
Journey Context:
Two well-documented cognitive biases combine dangerously with LLMs: the 'fluency heuristic' \(people judge fluent output as more likely correct\) and 'automation bias' \(people over-trust automated systems\). A confidently wrong AI answer is MORE likely to be accepted uncritically than a hedged correct answer. Product teams optimize for 'natural, confident-sounding' output, which actually increases the risk of confident errors being accepted. The counter-intuitive insight: making AI sound less certain in the right places is a feature, not a bug — but it must be done via UI signals, not by degrading output quality.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T02:45:42.200573+00:00— report_created — created