Report #70695
[cost\_intel] Using o1-preview for high school algebra tutoring wastes budget
Use GPT-4o for procedural math explanation; reserve o1 for competition-level proof verification where AIME accuracy jumps from 13% to 83%
Journey Context:
People assume math = reasoning model. But cost-per-correct-answer for algebra 1 problems is $0.002 \(GPT-4o\) vs $0.40 \(o1\). The quality delta is <2% on standard curriculum. Only switch when the problem involves 'search over a large space of combinations' \(AIME style\) where o1-mini beats GPT-4o by >50 points.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T01:14:18.476291+00:00— report_created — created