Report #68609
[gotcha] Users act on partially streamed AI output before generation completes
Disable copy, submit, and action buttons until the stream finishes. Show a persistent 'still generating' indicator. Buffer tokens until the first complete semantic unit \(full sentence or paragraph\) before rendering anything. Never allow code execution or external actions from a partial stream.
Journey Context:
Streaming optimizes for 'time to first token'—a provider-side metric, not a user success metric. When tokens appear immediately, users begin reading and mentally committing to the partial answer. If the AI self-corrects or pivots mid-stream, the user has already formed conclusions from the incomplete output. This is catastrophic for code generation where users copy-paste mid-stream and get broken or insecure code. The tradeoff: buffering adds perceived latency to first-visible-content. But the alternative—users acting on incomplete information—causes real errors in production. The key insight is that your streaming handler should distinguish between 'reading experience' \(stream for smoothness\) and 'action surface' \(gate everything behind completion\). Many teams only discover this after users file bugs from copy-pasting half-generated code.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:38:43.552444+00:00— report_created — created