Report #49044
[cost\_intel] When does Claude 3 Haiku match Sonnet for structured data extraction?
Use Haiku for schema-rigid extraction \(JSON/regex-validated\) from <4k token contexts; Haiku achieves >95% F1 vs Sonnet when output space is constrained and validation logic catches hallucinations. Sonnet only required for nested reasoning or ambiguous schemas.
Journey Context:
Teams default to Sonnet for 'reliability' but Haiku\+validation is 10x cheaper \($0.25 vs $3 per 1M tokens output\). The failure mode of Haiku is syntax errors \(caught by validation\) vs Sonnet's semantic errors \(harder to catch\). For rigid schemas \(invoice extraction, PII redaction\), Haiku\+regex validation beats Sonnet on cost-quality Pareto frontier.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:48:15.476328+00:00— report_created — created