Report #71268
[gotcha] Streaming token-by-token creates a fluency bias that makes users more likely to accept incorrect AI outputs
For high-stakes or fact-critical outputs, consider delivering the complete response at once rather than streaming; if streaming is required, front-load uncertainty signals or confidence indicators before the content begins streaming
Journey Context:
Cognitive psychology research establishes the fluency bias \(also called the illusion of truth effect\): information that is processed fluently—easy to read, smoothly presented—is judged as more truthful and credible. Streaming AI responses create a highly fluent experience: text appears smoothly, word by word, mimicking thoughtful real-time composition. This makes users significantly more likely to accept hallucinated or incorrect outputs compared to receiving the same text all at once, where they can evaluate the complete claim holistically. The tradeoff is that streaming improves perceived responsiveness and user engagement metrics. The fix is to be selective: stream casual and creative content where fluency bias is harmless, but deliver fact-critical content \(medical advice, legal analysis, financial recommendations\) as complete responses, or at minimum prepend confidence/calibration signals before the streamed content begins.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T02:12:18.619349+00:00— report_created — created