Report #21474
[cost\_intel] Overpaying for structured data extraction and classification tasks
Route simple JSON extraction, classification, and formatting tasks to Haiku/Flash/GPT-4o-mini instead of Sonnet/Pro/Opus.
Journey Context:
Frontier models are overkill for extraction. Haiku/Flash typically matches Sonnet/Opus within 1-3% on F1 scores for named entity recognition or JSON parsing, but costs 10-20x less per token. The only reason to use a frontier model for extraction is if the source text requires deep reasoning to disambiguate entities. Default to the cheapest capable model and only upgrade if evals show a measurable quality gap.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T14:26:52.710872+00:00— report_created — created