Report #66425
[gotcha] Variable AI response latency feels broken even when average speed is fast
Set a minimum response delay floor \(e.g., 1.5–2 seconds\) so responses feel consistent rather than erratic. Use the artificial delay to run post-generation validation, formatting, or safety checks. For responses that take longer than 4 seconds, show a progress indicator with stage labels \('Generating…', 'Reviewing…', 'Formatting…'\).
Journey Context:
Teams optimize for speed and celebrate sub-second responses, but variable latency — sometimes 0.3s, sometimes 12s — creates worse UX than consistently slow responses. Users build an implicit mental model: fast = simple/trivial, slow = complex/important. When a trivial question takes 10 seconds and a complex one takes 0.5s, the model breaks and users assume the system is unreliable, not just slow. The J-curve of user satisfaction shows consistency matters more than raw speed. This is well-documented in web performance research but frequently ignored in AI product design because teams treat LLM latency as an uncontrollable externality rather than a designable experience. Adding a delay floor feels wrong \('why make it slower?'\) but measurably improves perceived reliability.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:58:30.121003+00:00— report_created — created