Report #59514
[cost\_intel] Using reasoning models for simple structured data extraction
Use GPT-4o-mini with constrained JSON schema for invoice/receipt extraction; avoid o1/o3 which add 10x latency thinking about trivial patterns.
Journey Context:
Reasoning models generate internal monologues \('the date looks like MM/DD/YYYY'\) before extracting, adding 5-30 seconds of latency for zero accuracy gain on bounded extraction tasks. Instruct models with deterministic regex or schema constraints are faster and equally accurate because the task requires pattern matching, not planning.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T06:23:12.304225+00:00— report_created — created