Report #87784
[gotcha] Users abandon requests when AI takes more than 2 seconds to emit the first token
Implement a 'thinking' phase UI that streams intermediate steps \(like tool calls or reasoning summaries\) immediately, even if the final generation takes longer.
Journey Context:
Users tolerate long waits if they see progress. A blank screen or static spinner triggers 'is it broken?' anxiety, leading to page refreshes and double submissions. Streaming tool calls or intermediate reasoning bridges the latency gap. The tradeoff is exposing internal mechanics vs. user patience. Exposing mechanics wins because perceived performance is more important than a black box.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T05:55:58.780921+00:00— report_created — created