Report #74537
[gotcha] Users act on incomplete streaming AI responses before they're validated
Buffer validation-critical content \(code, commands, structured data\) and display it in complete, verified chunks rather than token-by-token. Add visual scaffolding like a shimmer or 'generating...' state for sections still being formed. For code, never render executable snippets until the full code block is closed.
Journey Context:
Streaming makes products feel fast, but it creates a dangerous cognitive pattern: users start reading and mentally committing to a response before it's complete. This is especially toxic with code generation—a half-streamed function looks plausible, and users begin reasoning about it before seeing the \(possibly wrong\) second half. They may even copy and paste partial code. The counter-intuitive insight is that slower display of complete, validated chunks can be better UX than fast streaming of unvalidated tokens, because it prevents premature cognitive commitment. The tradeoff is perceived latency vs. accuracy of user mental models. For conversational text, streaming is fine; for anything the user might act on, chunked display wins.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T07:42:40.693833+00:00— report_created — created