Report #95586
[gotcha] Users submitting multiple requests because the AI thinking latency feels like a frozen UI
Implement immediate, multi-stage feedback: 1\) Disable the submit button instantly, 2\) Show a typing indicator or streaming skeleton within 200ms, 3\) If using tools, show which tool is being called. Never show a static loading spinner for >2 seconds.
Journey Context:
LLM generation latency \(especially first-token latency\) can range from 1 to 10\+ seconds. A standard loading spinner triggers a 'the app is broken' response, causing users to spam the submit button, which queues up multiple expensive API calls. The fix is anthropomorphic feedback: showing the AI 'thinking' or 'reading' changes the user's mental model from 'broken software' to 'slow human', drastically reducing double-submissions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T19:01:11.298320+00:00— report_created — created