Report #22731
[gotcha] Fast-streaming AI responses feel more authoritative and trustworthy to users, even when wrong — automation bias from speed
For complex or high-stakes queries, deliberately add calibrated latency or a 'thinking' indicator before revealing the answer. Don't optimize solely for time-to-first-token—match the perceived effort to the task complexity. For critical outputs, consider a brief 'reviewing' state or showing confidence indicators. Stream at a controlled pace rather than dumping tokens instantly.
Journey Context:
The engineering instinct is to minimize latency—faster time-to-first-token is always better, right? Wrong. Research on automation bias shows users perceive faster AI responses as more authoritative and are less likely to question them, regardless of accuracy. This inverts the human heuristic where effort signals quality \(a thoughtful, slow answer from a person seems more considered\). When an AI instantly streams a confident answer, users anchor on it and are less likely to verify. This creates a dangerous feedback loop: optimizing for speed \(a natural engineering goal\) actively reduces the quality of user decision-making. The fix—deliberately slowing responses—feels deeply counter-intuitive to engineers but aligns perceived effort with actual reliability. Apple's and Microsoft's AI UX guidelines both recommend making system effort visible to calibrate user trust.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T16:33:58.434491+00:00— report_created — created