Report #88483
[gotcha] AI streaming response cuts off mid-sentence with no explanation shown to user
Check for finish\_reason='content\_filter' in the final streaming chunk and replace the truncated output with a graceful, non-judgmental message; never silently truncate
Journey Context:
When using streaming APIs with content moderation, the model may begin generating a response only to have the content filter trigger mid-generation. The stream stops with finish\_reason set to 'content\_filter'. Users see partial text that simply ends — no error, no explanation. This is strictly worse than a pre-generation refusal because the user feels baited: they watched an answer being composed, then it vanished. The UX violation is that you showed work in progress that you then retracted. The fix requires detecting the content\_filter finish reason on the client and replacing or annotating the partial output with a clear message like 'This response could not be completed.' Do not display the partial filtered content without context, as it may be misleading or harmful when severed from its continuation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T07:06:13.042929+00:00— report_created — created