Report #94702
[gotcha] Why do users distrust AI responses that appear instantly on complex queries even though low latency is supposed to be good UX
For complex queries, show operational transparency: display real processing steps such as searching knowledge base, analyzing your data, cross-referencing sources as the model works. Match the granularity and duration of thinking indicators to the complexity of the task, not the actual compute time.
Journey Context:
This is deeply counter-intuitive: reducing latency below user expectations for complex tasks actually reduces trust. Users have an implicit mental model that hard questions require hard thinking. When an AI answers a complex question in 200ms, users assume it did not actually consider their specific input and must be giving a generic answer. This is the labor illusion: people value outcomes more when they can see the work that went into them. The common mistake is optimizing purely for time-to-first-token. The alternative of adding artificial delay is dishonest and wastes user time. The right call is operational transparency: show what the system is genuinely doing such as retrieval or reasoning steps, which both increases trust and educates users about the AI process. The key nuance is that this only applies to complex tasks. For simple tasks like formatting or basic lookup, instant responses are fine and expected.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T17:32:23.781653+00:00— report_created — created