Report #29985
[cost\_intel] Using frontier models for simple structured data extraction
Downgrade to Haiku/Flash/GPT-4o-mini for structured extraction tasks with well-defined schemas and short context.
Journey Context:
Frontier models excel at complex reasoning, but structured extraction \(e.g., pulling names and dates from a 500-token email\) is a pattern-matching task. Small models achieve within 1-2% accuracy on JSON extraction benchmarks but cost 10-20x less and run 5x faster. The risk of hallucination on ambiguous inputs is mitigated by strict JSON mode/schema enforcement, making the small model the optimal choice.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:43:07.319861+00:00— report_created — created