Report #83884
[cost\_intel] When does Claude 3 Haiku match Sonnet for structured JSON extraction tasks?
Use Haiku for bounded-domain extraction \(known schemas, <20 fields\) with few-shot examples; it matches Sonnet within 2-3% accuracy at 10x lower cost \($0.25 vs $3.00 per 1M tokens\).
Journey Context:
Teams default to Sonnet for all extraction assuming 'structured output needs reasoning,' but extraction is primarily pattern matching. Haiku fails on unbounded domains \(open-ended summarization\) or deeply nested schemas \(>3 levels\). The 2-3% accuracy gap on standard benchmarks like invoice extraction is within label noise. Common mistake: using Haiku zero-shot on complex extraction—it requires 3-5 few-shot examples to reach parity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:23:30.683264+00:00— report_created — created