Report #94512
[gotcha] Reasoning models produce no output for seconds while thinking, making the UI appear frozen or broken
Show an explicit 'reasoning' or 'thinking' state with a distinct visual indicator during the pre-token delay. Do not use a generic loading spinner—use a state that communicates the model is actively processing, not waiting to start.
Journey Context:
Reasoning models \(e.g., OpenAI o1\) spend significant compute on hidden reasoning tokens before emitting any visible output. The reasoning tokens are deliberately excluded from the response stream. Users see nothing happening for 5-30\+ seconds and assume the system is broken, the request was lost, or they should refresh. A generic spinner makes this worse because it implies the system is waiting, not working. The fix is a distinct 'reasoning' state that sets the correct expectation: the AI is working through the problem, not stalled. This is analogous to how search engines show 'Searching for flights...' rather than a blank spinner—the label transforms perceived dead time into productive time.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T17:13:21.500696+00:00— report_created — created