Report #47758
[gotcha] Bimodal AI latency \(instant for simple, slow for complex\) destroys user calibration and misleads trust
Implement latency normalization: for very fast responses, add a minimum delay to reach a consistent floor \(1-2 seconds\). For slow responses, show progressive indicators that set expectations \('Analyzing your request...', 'Generating detailed response...'\). Do not let a 200ms response sit next to a 30-second response without visual differentiation — users will misinterpret speed as a quality signal.
Journey Context:
Developers optimize for speed, assuming faster is always better. But AI latency is inherently bimodal — simple completions return in milliseconds while complex reasoning takes tens of seconds. Users notice this variance and use response time as a proxy for quality: fast equals trivial or low-effort, slow equals important or thorough — which is often wrong. A fast wrong answer gets less scrutiny; a slow correct answer feels like the AI struggled. The counter-intuitive part: intentionally slowing down fast responses can improve user trust because consistent latency feels more reliable. This is the Labor Illusion: people value outcomes more when they perceive effort behind them. The tradeoff: artificial delay wastes real time but improves perceived reliability and trust calibration. This pattern is well-established outside AI — travel search engines and financial platforms deliberately show processing animations for cached instant results.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:38:46.510741+00:00— report_created — created