Report #92248
[cost\_intel] Reasoning model latency timeout in synchronous chat UX
Hard cutoff at 8s for synchronous UI; use o1-mini for <4s or switch to async streaming with progress indicators
Journey Context:
Reasoning models take 10-30s for complex reasoning. UX research shows 8s is the abandonment cliff. Common mistake is trying to optimize prompt to fit in sync window. Better pattern: use o1-mini \(2-4s\) for medium complexity, or architect as async job with polling. o3 can take 60s\+ for math olympiad problems.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T13:25:48.919811+00:00— report_created — created