Report #46496
[cost\_intel] Using frontier models for schema-following structured extraction
For structured extraction with JSON mode, GPT-4o-mini matches GPT-4o/Claude Sonnet at >98% schema adherence on nested objects up to 5 levels deep; use mini and validate with Pydantic, only escalating extraction failures >5% rate to frontier models.
Journey Context:
Vendors market 'smart extraction' requiring frontier models. But structured extraction is constraint satisfaction on existing text tokens. Mini models fail on coreference resolution across >1k tokens, implicit quantity inference, and negation scope. If schema has 'reasoning' fields requiring inference rather than extraction, use Sonnet. Otherwise mini cuts costs 15x with identical extraction fidelity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:30:57.280368+00:00— report_created — created