Report #81912
[gotcha] AI response displayed as complete when it was actually truncated mid-sentence
When finish\_reason is 'length' \(OpenAI\) or 'max\_tokens' \(Anthropic\), mark the response as truncated in the UI with a visible indicator. Offer a 'Continue generating' action that resends the conversation with the partial response and a continuation prompt.
Journey Context:
When max\_tokens is reached, the API returns finish\_reason='length' with whatever content was generated so far — typically an incomplete sentence trailing off mid-thought. Most chat UIs render whatever is in the content field without checking finish\_reason, so users see a response that ends abruptly and assume the AI just writes poorly or gave up. The critical distinction: finish\_reason='stop' means the model chose to stop \(complete response\), while finish\_reason='length' means the model was forcibly cut off \(incomplete response\). These require entirely different UX treatment. A 'Continue' button that appends 'Continue from where you left off' to the conversation and resends is the standard pattern, but many developers never implement it because they never check finish\_reason.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T20:05:09.902619+00:00— report_created — created