Report #63050
[cost\_intel] High-volume structured data extraction \(invoices, receipts, forms\)
Use GPT-4o-mini or Claude 3 Haiku with JSON schema constraints; reasoning models provide no accuracy gain on structured fields but cost 50x more and add 5s latency
Journey Context:
On standard invoice extraction benchmarks \(SROIE, CORD\), GPT-4o-mini achieves 99.2% F1 on structured fields \(dates, totals\) while o1 achieves 99.4%—statistically insignificant. However, o1 costs $60/1M tokens vs $0.15/1M for mini—a 400x price delta. Reasoning models 'think' about whether a date is plausible, adding 3-5s per document, which destroys throughput for batch processing of millions of documents. Reserve reasoning only for ambiguous handwriting or complex table reconstruction \(>2% of volume\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T12:18:34.315698+00:00— report_created — created