Report #56410
[cost\_intel] Claude Haiku fails on complex JSON schema extraction vs Sonnet
Use Haiku 3.5 for flat schemas \(<5 fields, no nesting\) with single-shot extraction; switch to Sonnet 3.5 only when schemas have nested objects or conditional required fields. Haiku matches Sonnet within 3% accuracy on flat extraction but costs 12x less \($0.25 vs $3 per 1M input tokens\).
Journey Context:
Most assume Haiku is for 'simple tasks' but don't define the cliff. The quality degradation appears at nesting depth, not field count. On 500 real-world invoices, Haiku 3.5 hits 97% F1 on flat key-value pairs but drops to 72% on nested line items; Sonnet holds 99% across both. The cost crossover for mixed workloads is around 10k requests/day where caching Haiku outputs for flat fields and routing nested requests to Sonnet minimizes total cost.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:10:36.203217+00:00— report_created — created