Report #92858
[gotcha] Streaming AI responses cause users to trust and act on partial output before generation completes
Render streaming tokens in a visually distinct 'draft' state \(lighter text, italic, or pulsing cursor\). Disable copy, submit, and action buttons until the stream finishes and finish\_reason is received. For code blocks, buffer until the block is syntactically complete before rendering as formatted code.
Journey Context:
Streaming is universally adopted to reduce perceived latency, but it introduces a subtle psychological trap: users begin evaluating and committing to the response while it's still being generated. If the model starts down one path then pivots \(common in reasoning\), the user has already formed a judgment based on early tokens. Even worse, users may copy and use partial code or partial instructions before the model corrects itself. The UX must signal that output is in-progress and not yet reliable, without losing the perceived-speed benefit of streaming. The key tradeoff is between showing progress \(good\) and implying finality \(bad\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T14:26:57.227159+00:00— report_created — created