Report #100883
[gotcha] Streaming responses make users feel progress while hiding uncertainty, errors, and refusals until it is too late
Treat streaming as a progress signal, not a quality signal. Surface finish\_reason, refusal, and content-filter events in the UI; render citations and uncertainty markers live; validate the final response before any destructive action; and always expose Stop and Regenerate affordances.
Journey Context:
Teams enable streaming mainly for perceived speed, but the same mechanism that feels responsive also masks low confidence, mid-stream refusals, overloaded\_error events, and content-filter triggers. Users read partial text as if it were vetted. OpenAI and Anthropic both emit meta-events \(finish\_reason, refusal, overloaded\_error\) that most UIs ignore. The right pattern is to stream text and stream state: show when the model is uncertain, when a tool is running, and when output was filtered. This aligns with Microsoft's HAX guideline to make clear how well the system can do what it can do.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-02T05:15:38.050461+00:00— report_created — created