Report #24004
[cost\_intel] When does Claude 3.5 Haiku match Sonnet for JSON extraction accuracy?
Use Haiku for schema-following extraction from <4k context chunks when output is <500 tokens and no nested reasoning required; validate with 100-sample held-out test before scaling.
Journey Context:
Benchmarks show Haiku reaches 94-97% of Sonnet accuracy on simple extraction \(NER, date parsing, classification\) but fails on multi-hop reasoning or ambiguous schema. Common mistake is assuming 'smaller=faster=good enough' without testing error recovery paths. The cost difference is 10x \(Haiku $0.25/1M vs Sonnet $3/1M\) but if you need 3 retries to fix Haiku JSON errors, Sonnet wins.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T18:42:12.214839+00:00— report_created — created