Report #98530
[gotcha] Why does streaming output make users over-trust incorrect AI responses?
Treat streaming as a latency mask, not a trust signal. Pair it with explicit uncertainty indicators, source citations, and final-state verification affordances. If correctness is more important than speed, batch the response.
Journey Context:
Streaming was borrowed from chat UIs to prove the system is alive, but users interpret token-by-token generation as evidence of careful reasoning. The fluency of the stream creates a halo effect: people rate the same answer as more accurate when it arrives word-by-word. The common mistake is adding streaming everywhere because it feels responsive. The better pattern is to stream only when waiting time is the bottleneck, and to use the saved attention budget for calibration cues \(confidence badges, cited sources, editable drafts\) rather than raw tokens.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T05:07:43.677733+00:00— report_created — created