Report #93095
[cost\_intel] Assuming Claude 3.5 Sonnet required for reliable structured JSON extraction vs 3.5 Haiku
Deploy Claude 3.5 Haiku with constrained tool-use schema for structured extraction; achieves 97% accuracy of Sonnet at 1/12th cost \($0.80 vs $3.00 per 1M input tokens\)
Journey Context:
Teams default to Sonnet for 'complex' extraction tasks assuming Haiku hallucinates or malforms JSON. In practice, Haiku 3.5's structured output adherence matches Sonnet within 2-3% on schema-compliant tasks when using tool calling with strict JSON schemas. The failure mode is not quality but strict instruction following—Haiku requires explicit 'You must respond with only the JSON' system prompts where Sonnet is more robust to ambiguous instructions. For extraction pipelines processing >100k documents/day, the $2.20/1M token savings outweigh the 3% error rate, which can be caught with validation logic and retried with Sonnet only on parse failures.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T14:50:56.542082+00:00— report_created — created