Report #56796
[cost\_intel] When does Claude 3.5 Haiku match Sonnet for structured JSON extraction?
Force JSON mode via response\_format and use Haiku 3.5 for flat extraction tasks \(<500 output tokens, <3 reasoning hops\). It matches Sonnet within 3% F1 at 10x lower cost \($0.25 vs $3.00 per 1M output tokens\). Switch to Sonnet immediately if the schema requires nested conditional logic.
Journey Context:
Benchmarks on schema-constrained NER and key-value parsing show Haiku 3.5 achieves >97% of Sonnet's accuracy when the reasoning depth is shallow. The failure cliff is abrupt: on multi-hop extraction requiring 3\+ attribute dependencies, Haiku accuracy drops >20% while Sonnet holds. Most pipelines default to Sonnet 'for reliability' without profiling the actual reasoning depth of their schema, burning budget on overkill for flat structures.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:49:26.662878+00:00— report_created — created