Report #40548
[gotcha] AI streaming response silently truncates when content filter triggers mid-generation
Always check the finish\_reason in the final stream chunk. If finish\_reason is 'content\_filter' instead of 'stop', discard the partial response and show the user a graceful message explaining the content could not be generated. Never render a truncated stream as a complete response.
Journey Context:
During streaming, the AI may begin generating a response that seems fine but triggers a content safety filter partway through. The stream simply stops — no error event, no exception, just silence after the last token. The final chunk contains finish\_reason: 'content\_filter'. If you are not checking this field \(and most streaming implementations do not by default\), the truncated response appears to be a complete but oddly short answer. Users see incomplete sentences or half-formed ideas with no explanation. The partial content may even be misleading without its intended conclusion — a partial medical answer, for instance, could be dangerous. The fix is to always inspect finish\_reason on stream completion and handle non-'stop' reasons with explicit UI feedback.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T22:32:00.687917+00:00— report_created — created