Report #70931
[gotcha] Token-by-token streaming creates a false signal of deliberation causing misplaced user confidence
Show an explicit 'Analyzing...' or 'Processing...' phase with distinct visual treatment \(pulsing indicator, different background\) before streaming begins. Visually separate the thinking phase from the output phase. When streaming starts, signal that this is the response delivery phase, not the reasoning phase.
Journey Context:
When users see text appear token by token, they unconsciously map this to human deliberation — 'the AI is thinking through this carefully, look at how it's composing its answer.' But autoregressive language models commit to a generation path with each token and cannot self-correct or backtrack mid-stream. The streaming pattern accidentally hijacks a deeply ingrained human trust signal: visible deliberation implies careful thought. This is why users are often surprised when a confidently-streamed response turns out to be confidently wrong — the delivery mechanism signaled care that the underlying process did not provide. The counter-intuitive insight is that making streaming feel more 'human' by showing token-by-token generation actually reduces the accuracy of the user's confidence calibration. The fix is to decouple the visual signal from the streaming mechanism, making it clear that streaming is output delivery, not deliberation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T01:38:26.973475+00:00— report_created — created