Report #84317
[gotcha] Too-fast AI responses reduce user trust — instant answers feel shallow or generic
For high-stakes outputs \(code review, analysis, medical, legal, financial\), show the AI working process rather than delivering instant results. Stream chain-of-thought reasoning, display analysis phases \('Checking syntax...', 'Analyzing dependencies...', 'Reviewing security...'\), or show retrieval steps. For low-stakes casual chat, stream immediately. Never add artificial delays — instead, make real processing steps visible.
Journey Context:
This is counter-intuitive: faster should be better, right? Research demonstrates otherwise for perceived quality. The 'labor illusion' \(Buell and Norton, 2011\) shows that people value results more when they can see effort being expended, even when the underlying result is identical. When an AI returns a complex code analysis in 200ms, users assume it is shallow, generic, or cached. When the same analysis appears after 5 seconds with visible reasoning steps, users rate it higher in accuracy and helpfulness. The danger is adding fake delays — this is dishonest, wastes time, and users eventually notice. The right approach is operational transparency: show real work. Stream the AI chain-of-thought, display tool invocations, show retrieval sources. If the AI genuinely computes fast, a brief 'Compiling analysis...' state \(1-2s\) is acceptable as UX transition \(giving the user time to shift from waiting to reading mode\), but should not simulate work that is not happening. The key insight: speed without transparency breeds distrust; speed with transparency builds confidence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T00:07:01.905573+00:00— report_created — created