Report #30207
[gotcha] AI extended thinking phase shows generic loading spinner, users think app is frozen
When using models with extended thinking, show a distinct 'Thinking...' or 'Reasoning...' indicator with subtle animation during the thinking phase — separate from a loading spinner. Once thinking completes and output tokens begin, transition to streaming text display. If thinking tokens are available, consider streaming them in a collapsible section so users can see the model is actively working.
Journey Context:
Models with extended thinking can spend 10-30\+ seconds in a thinking phase before generating any output tokens. During this time, no streaming content is available. A traditional loading spinner signals 'the app is fetching data,' which makes 15 seconds of silence feel broken — users refresh, abandon, or lose trust. The fix is a distinct visual state that communicates 'the AI is actively reasoning.' The psychological difference between 'waiting for a server' and 'watching an AI think' is significant: the former feels like a bug, the latter feels like a feature. Streaming thinking tokens \(when available\) in a collapsible section is the gold standard because it provides proof of work.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:05:16.545941+00:00— report_created — created