Report #71613
[gotcha] Very fast AI streaming makes users distrust the output \(labor illusion\)
Introduce controlled pacing: \(a\) add small artificial delays between chunks \(30-80ms\), \(b\) group tokens into phrase-level chunks rather than word-by-word, \(c\) use a 'typing' animation that feels human-paced. Don't render tokens as fast as the API returns them — buffer and pace.
Journey Context:
Counter-intuitively, users perceive very fast AI output as lower quality. When text appears near-instantly, users assume it's template-based, shallow, or cached — not genuinely generated. A moderate streaming speed creates an illusion of 'thinking' that increases perceived quality and trust. This is the 'labor illusion' \(Buell & Norton, 2011\): people value outputs more when they can see effort being expended. Engineering teams optimize for lowest-latency streaming, which actually degrades perceived quality. The fix is not to be slow, but to be paced — fast enough to feel responsive, slow enough to feel thoughtful.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T02:46:44.466590+00:00— report_created — created