Report #53646
[cost\_intel] Using o3-mini for simple schema-following data extraction from PDFs
Use GPT-4o or Claude 3.5 Sonnet for structured data extraction with simple schemas \(invoice fields, name/address/phone\). Reserve o3-mini for extraction requiring complex inference \(implied values, multi-hop cross-references between tables, resolving ambiguous hand-written notes via context\). Cost difference: 10-30x.
Journey Context:
Extraction seems like it might need 'reasoning' to understand context. But most extraction is pattern matching \+ schema adherence. Instruct models are fine-tuned for tool use and JSON mode. Reasoning models add latency without accuracy gains on simple extraction \(name, date, amount\). The degradation signature: if task is 'read field X from document', cheap models work; if task is 'calculate field X based on Y and Z with business logic spanning 3 pages', use reasoning.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T20:32:34.355361+00:00— report_created — created