Report #96578
[gotcha] Users act on wrong AI answers mid-stream before the model pivots
Buffer the first 3-5 tokens before rendering anything, visually mark streaming content as draft \(e.g., dimmed opacity, 'Generating...' badge\), and disable copy/select actions until the stream completes and finish\_reason is received.
Journey Context:
The instinct is to render every token instantly for maximum responsiveness. But users begin reading and mentally committing to an answer the moment it appears. When a model starts one direction then pivots mid-generation \(common with reasoning models\), users have already internalized the wrong first half. The counter-intuitive truth: a tiny intentional delay before first render improves comprehension accuracy because users see the AI's settled position, not its opening gambit. The tradeoff is a few hundred milliseconds of perceived latency for significantly fewer user errors.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:41:35.173101+00:00— report_created — created