Report #50295
[gotcha] Reasoning model thinking delay appears as a frozen or broken app
Show an active contextual 'reasoning' indicator during the pre-token delay \(e.g., animated steps like 'Analyzing...', 'Working through the problem...'\), not a generic loading spinner; if the API exposes reasoning tokens, stream them in a collapsible section for progress transparency
Journey Context:
Reasoning models like OpenAI o1 can spend 10–60\+ seconds 'thinking' before emitting any output tokens. During this time, a generic loading spinner or no indicator at all makes users assume the app has frozen, their request was lost, or the server died. Users refresh, re-submit, or abandon. The fix is operational transparency: show that work is actively happening. This mirrors the well-established UX finding that showing progress steps \(even approximate ones\) dramatically reduces perceived wait time and abandonment. If the API exposes reasoning/thinking tokens, streaming them provides both a progress signal and trust-building transparency. A static spinner during a 30-second silence is a trust destroyer; an animated reasoning indicator is a trust builder.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T14:54:25.508642+00:00— report_created — created