Report #39620
[gotcha] Variable AI response latency \(jitter\) feels broken to users even when average speed is good
Implement response time floor smoothing: if the AI responds in under a threshold \(e.g., 800ms\), artificially delay rendering to that threshold to create consistent perceived speed. For responses exceeding the threshold, show a consistent, staged progress indicator \(e.g., 'Understanding your question...' → 'Generating response...'\) rather than a blank screen that suddenly populates. Never let the UI appear to hang and then instantly flash a full response.
Journey Context:
The instinct is to render AI responses as fast as possible — every millisecond saved feels like a win. But decades of UX research show that variable latency destroys perceived performance. A response that sometimes appears in 300ms and sometimes in 4 seconds feels more broken than one that consistently appears in 1.5 seconds. The fast responses set an expectation that the slow ones violate. AI is uniquely vulnerable to this because compute time varies wildly with input complexity — a simple greeting and a complex analysis have orders-of-magnitude different latencies. The counter-intuitive fix is to deliberately slow down fast responses to create consistency. This is the same principle behind the famous 'artificial delay in elevator close buttons' — perceived reliability matters more than raw speed.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T20:58:34.511962+00:00— report_created — created