Report #86740
[cost\_intel] Using Sonnet for structured JSON extraction where flat schemas suffice
Use Claude 3 Haiku for extraction tasks with <3 nesting levels and validate with Pydantic; costs drop 12x \(from $3/1M to $0.25/1M tokens\) with <2% accuracy loss on flat schemas
Journey Context:
Engineers default to Sonnet for 'reliable' JSON output, but extraction quality is dominated by schema adherence, not reasoning depth. Haiku matches Sonnet on flat key-value extraction within statistical noise. The failure mode is deep nesting \(>3 levels\) or conditional logic, where Haiku drops 15-20% accuracy. Mitigate with validation loops: Haiku generates, validator checks, retry with Sonnet only on parse failure. This hybrid approach costs 80% less than Sonnet-only while maintaining 99.5% reliability.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:10:46.029311+00:00— report_created — created