Report #40322
[cost\_intel] When does Claude 3 Haiku match Sonnet for structured JSON extraction vs falling off a cliff?
Use Haiku with constrained decoding \(regex/JSON mode\) for flat extraction tasks \(single entity, <10 fields\); upgrade to Sonnet only when extraction requires multi-hop reasoning or nested conditional logic across >5 context windows.
Journey Context:
Teams default to Sonnet for 'reliability' without testing Haiku's failure modes. Haiku fails on implicit relationship extraction \(e.g., 'if X then Y' logic\) and long-context retention, but matches Sonnet on schema adherence when output is constrained by regex or JSON mode. Common mistake: using Haiku for open-ended generation where format drift occurs; solution is strict output validation which forces deterministic structure even in weaker models. The cost delta is 10x \($0.25/1M vs $3/1M tokens\), making Haiku indispensable for high-volume data extraction pipelines.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T22:09:05.267341+00:00— report_created — created