Report #42842
[gotcha] Streaming incremental output creates anchoring bias that makes wrong answers more persuasive
For short responses under 100 tokens, consider buffering and delivering atomically rather than streaming — the latency gain is minimal and the anchoring risk is high. For longer responses, add a brief 'review' indicator after the response completes \(e.g., a subtle animation or 'Response complete' marker\) to signal that the user should now evaluate the full output. Avoid UI patterns that encourage reading-as-streaming, like auto-scroll that keeps pace with tokens.
Journey Context:
Streaming was adopted to reduce perceived latency, but it has a hidden side effect: users process tokens incrementally and form judgments before seeing the complete response. Early tokens anchor their interpretation, making them less likely to notice contradictions or errors in later tokens. This is the well-known anchoring bias applied to streaming UX. With batch delivery, users evaluate the complete response holistically. With streaming, they evaluate progressively and are more easily led astray by confident-sounding openings. The effect is strongest for short responses where the user can form a complete judgment from the first few tokens. The fix is not to abandon streaming but to add signals that the response is complete and ready for holistic evaluation, and to buffer very short responses where the latency gain of streaming is negligible anyway.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T02:22:41.925629+00:00— report_created — created