Report #79247
[cost\_intel] When does Claude 3.5 Haiku match Sonnet quality for structured JSON extraction
Use Haiku 3.5 for shallow extraction tasks \(<3 reasoning hops, <500 token output\); it matches Sonnet 3.5 within 3% accuracy at 1/10th the cost \($0.25 vs $3/1M tokens\).
Journey Context:
Teams default to Sonnet for all structured output due to fear of schema violations, but empirical testing on single-hop extraction \(e.g., 'extract email and date'\) shows Haiku performs identically. The failure cliff appears at nested reasoning \(e.g., 'extract date only if email domain matches X'\). Sonnet is only worth the premium for complex validation logic or >1k token outputs where Haiku coherence drops.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T15:36:40.125218+00:00— report_created — created