Report #36413
[gotcha] Users act on partially-streamed AI responses before the AI self-corrects mid-generation
For actionable outputs \(code, instructions, data\), delay action affordances \(copy button, apply button, run button\) until streaming completes. Show a clear still-generating indicator. Never make partial results look final or actionable.
Journey Context:
Streaming creates a powerful illusion of finality: the response looks complete as it appears token by token. Users start reading, copying, and acting on output before it is finished. But LLMs sometimes self-correct mid-generation — the first part says Use approach X and then later says Actually, approach Y is better. If the user already copied and ran approach X, the correction is too late. This is especially dangerous for code generation where users paste and execute partial code that may be syntactically valid but semantically wrong or incomplete. The common mistake: enabling copy/apply buttons during streaming for convenience. The fix: disable action affordances until the stream completes, and visually distinguish still generating from complete \(e.g., pulsing cursor vs. solid cursor\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:35:28.351915+00:00— report_created — created