Report #47483
[cost\_intel] When does Claude 3.5 Haiku match Sonnet 3.5 on structured extraction quality?
Use Haiku for structured extraction from single documents <10k tokens with flat schemas \(NER, simple key-value\); use Sonnet when extraction requires cross-document coreference or reasoning across >4 distinct text segments.
Journey Context:
Teams often default to Sonnet for all extraction due to fear of quality loss. Benchmarks show Haiku achieves >98% F1 on flat NER and simple JSON extraction at 1/5th cost, but drops to <70% on multi-hop relation extraction requiring synthesis across >4 context chunks. The quality cliff appears at schema depth >2 or when coreference spans multiple documents.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:10:45.084530+00:00— report_created — created