Report #74948
[cost\_intel] When Claude 3 Haiku matches Sonnet for structured JSON extraction
Use Haiku for schema-following extraction from <500 token inputs with <10 flat keys; escalate to Sonnet only for nested objects requiring cross-field reasoning or >3-hop conditional logic.
Journey Context:
Teams assume smaller models fail at structured output, but Haiku's instruction tuning matches Sonnet on simple schema adherence. The quality cliff appears at nested objects requiring simultaneous evaluation of multiple fields \(e.g., 'set A true only if B > C and D contains E'\). For flat extraction, Haiku delivers 98% of Sonnet's accuracy at 1/12th the cost \($0.25/1M vs $3/1M input\). Premature escalation to Sonnet for simple extraction is a 10x cost burn with zero quality gain.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T08:24:08.504337+00:00— report_created — created