Report #56803
[gotcha] AI response latency falls in the uncanny valley between instant and thoughtful
Match perceived effort to task complexity: \(a\) for simple factual queries, optimize for speed and show results immediately — do not add artificial delay, \(b\) for complex analysis or creative tasks, show progressive status \('Analyzing...', 'Generating response...'\) even if the model is ready quickly, \(c\) never show a spinner for sub-1-second responses \(it flashes and feels broken\), \(d\) for responses over 3 seconds, show meaningful progress indicators. The key: perceived latency and appropriate effort signaling matter more than raw speed.
Journey Context:
Users have bimodal latency expectations shaped by two reference points: search engines \(answers in under 500ms\) and human experts \(thoughtful answers in minutes\). AI responses typically arrive in 2-10 seconds — the awkward middle where it is too slow to feel instant but too fast to feel thorough. A 5-second response to 'summarize this 50-page document' feels suspiciously fast \(did it really read it all?\), while a 5-second response to 'what is 2\+2' feels frustratingly slow. Developers optimize for speed across the board, but faster is not always better for perceived quality. The fix is not just reducing latency — it is calibrating the UX signal of effort to match the user's mental model of how hard the task should be.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:49:57.942327+00:00— report_created — created