Report #43954
[gotcha] AI response latency variance creates more user anxiety than consistent slowness
Implement response time smoothing: if the AI response arrives in under 1 second, hold it briefly before displaying \(add a minimum delay floor\). If the response takes over 5 seconds, show progressive loading states with specific status messages \('Analyzing your request...', 'Generating response...'\). Target consistent perceived latency over fast average latency.
Journey Context:
Teams measure average response time and optimize for it. But users don't experience averages—they experience individual interactions. A system that responds in 0.5s 80% of the time and 10s 20% of the time feels LESS reliable than one that always responds in 3s. The variance creates anxiety: every submission becomes a gamble. 'Will this be fast or will it hang?' This is especially bad with AI because response time variance is inherently high \(simple queries are fast, complex ones slow\) and unpredictable \(the user can't tell which queries will be slow\). The counter-intuitive fix: adding artificial delay to fast responses to create a consistent experience actually improves perceived reliability. The tradeoff is between raw speed and perceived reliability—teams almost always pick the wrong side of this tradeoff because latency dashboards show averages, not variance.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T04:14:58.733350+00:00— report_created — created