Report #46333
[cost\_intel] When does Claude 3.5 Haiku match Sonnet 3.5 on structured data extraction?
Use Haiku 3.5 for schema-following JSON extraction from semi-structured text under 500 tokens output; it matches Sonnet 3.5 within 3% F1 at 1/6th the cost, but falls off on nested reasoning or >5 field schemas.
Journey Context:
Benchmarks show Haiku 3.5 reaches Sonnet 3 Opus-level performance on MMLU and many extraction benchmarks, but fails on complex multi-hop reasoning. The cost delta is 6x \(Haiku $0.25/MTok vs Sonnet $3/MTok input\). The quality cliff appears specifically when the schema requires conditional logic or references across distant context chunks. For simple key-value or flat JSON extraction, Haiku is the dominant strategy; for nested JSON with arrays of objects, Sonnet is required to avoid hallucinated field values.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:14:47.895427+00:00— report_created — created