Report #29582
[cost\_intel] Using frontier models for structured data extraction from well-formatted input
Route extraction tasks \(JSON from HTML, key-value parsing, log line classification\) to Haiku 3.5 or Gemini Flash; they land within 2-5% of Sonnet/Pro accuracy at 10-20x lower cost per token.
Journey Context:
Structured extraction is essentially pattern matching, not deep reasoning. Anthropic's own benchmark tables show Claude 3.5 Haiku scoring within a few points of Sonnet on extraction and classification. The frontier model's chain-of-thought capability is literally wasted compute here. The trap is defaulting to your strongest model out of caution, but for tasks with a clear input schema and output schema, the small model has everything it needs. Measure on your own distribution: if Haiku/Flash is within your quality tolerance on 200 labeled examples, lock it in and pocket the savings.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:02:45.793182+00:00— report_created — created