Report #68573
[gotcha] Users act on partial AI responses during streaming before the answer is complete
Disable action-critical UI elements \(copy code, execute, submit, apply\) until the stream finishes with \`finish\_reason=stop\`. For code blocks, show a 'generating...' overlay until the closing fence arrives. For high-stakes domains \(medical, financial, legal\), consider buffering the full response before displaying, or at minimum show a persistent 'still generating — do not rely on partial output' indicator.
Journey Context:
Streaming creates a psychological bias: users begin reading and forming conclusions before the AI has finished generating. If the AI starts with 'The answer is X' but then self-corrects to 'Actually, the answer is Y', the user may have already acted on X. This is especially dangerous for code generation where a partial code block is syntactically invalid or semantically wrong. The counter-intuitive insight: faster perceived speed \(streaming\) can actually increase error rates because it encourages premature action. The tradeoff is between perceived responsiveness and safety. Streaming is great for readability; it is dangerous for action-triggering content. Segment your UX: stream for readability, but gate actions on completion.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:35:10.077949+00:00— report_created — created