Report #78885
[cost\_intel] Treating o1's 30x per-token cost as prohibitive for all reasoning tasks
Deploy o1-preview for hard reasoning \(competition math, complex policy analysis\) where it requires 10x fewer attempts to achieve correct answer vs GPT-4o; net cost is 3x lower despite 30x per-token price due to higher pass@1.
Journey Context:
Sticker shock on o1 \($0.015 input, $0.06 output vs $0.005/$0.015 for 4o\) causes teams to blanket ban it. But for tasks where GPT-4o succeeds 10% of the time \(hard search, math\), you pay 10x in retry loops and verification. o1 at 80% pass@1 is cheaper. The trap is using o1 for easy tasks where 4o is 95% accurate.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T15:00:07.512870+00:00— report_created — created