Report #73501
[cost\_intel] Using full o1/o3 when o1-mini or o3-mini would suffice
Use o1-mini or o3-mini for STEM tasks \(math, coding algorithms\) with <32k context. They capture 80% of full o1 reasoning at 10% of the cost and 2x the speed. Avoid for creative writing or ambiguous open-ended tasks where they underperform.
Journey Context:
o1-mini is not just a 'worse' o1; it's a specialized model optimized for fast chain-of-thought on structured problems. On competitive programming, it's often within 5% of o1 at $0.003 vs $0.06 per call. The failure mode is using it for tasks requiring broad world knowledge or nuanced judgment—it lacks the knowledge base of full o1. The rule: if the task is 'compute' \(math, code, logic puzzles\), mini wins. If it's 'know' \(trivia, medical guidelines, legal interpretation\), use full reasoning or Claude 3.5.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T05:57:58.876465+00:00— report_created — created