Report #48866
[cost\_intel] When reasoning models waste 20x cost on trivial extraction tasks
Use instruct models for structured extraction from semi-structured documents \(invoices, forms\) with clear schemas; reserve reasoning for ambiguous edge cases.
Journey Context:
On Azure Document Intelligence benchmarks, GPT-4o achieves 99% accuracy on invoice field extraction vs o1 at 99.2% \(statistical tie\). Cost difference: $0.50 vs $10 per 1000 pages. Reasoning models generate unnecessary chain-of-thought that must be stripped, adding latency. Degradation signature: If the schema has >90% accuracy with regex, don't use reasoning. Common mistake: Running o1 on every invoice in a batch of 10,000, turning a $50 job into $1000.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:30:13.344361+00:00— report_created — created