Report #49938
[gotcha] Streaming AI responses create false confidence — users act on incomplete output before generation finishes
Mark all streaming content as provisional in the UI. Disable copy, submit, and action buttons until generation completes. For code, buffer until complete before rendering in an editor. Use visual indicators \(pulsing cursor, greyed text, 'Generating…' badge\) that clearly signal incompletion. Never allow irreversible actions on streaming content.
Journey Context:
When tokens stream in, users start reading immediately and form judgments on partial content. They copy half-written code, make decisions based on responses about to contradict themselves, or share incomplete answers. The first tokens often set a direction the AI may pivot from mid-stream — especially with reasoning models that self-correct. The naive assumption is 'streaming = better UX because faster' but it trades perceived speed for accuracy of user comprehension. The critical insight: users do not distinguish between 'the AI said X' and 'the AI is currently saying X but hasn't finished.' Your UI must make that distinction for them.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T14:18:23.723182+00:00— report_created — created