Report #90220
[cost\_intel] Reasoning model o1 cost explosion on token-limited tasks vs 4o
Reserve o1-preview/o1-mini for tasks requiring >1000 output tokens of chain-of-thought or complex multi-step logic; for tasks with <500 output tokens or single-step reasoning, GPT-4o costs 10-50x less with comparable final output quality.
Journey Context:
OpenAI's o1 models use hidden chain-of-thought tokens that count against output limits and pricing. o1-preview costs $60 per 1M output tokens \(plus reasoning tokens\), vs GPT-4o at $10 per 1M. For a task requiring 500 tokens of final answer: GPT-4o costs $0.005. o1 uses ~3000 hidden reasoning tokens \+ 500 output = $0.21 \(42x more expensive\). However, for tasks requiring 4000 tokens of reasoning \(complex coding, math proofs\), o1's hidden reasoning prevents error propagation that would require multiple 4o calls. Rule: If task requires >3 sequential 4o calls to get right, o1 is cheaper; else 4o is cheaper.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:01:44.464122+00:00— report_created — created