Report #26411
[cost\_intel] Assuming Claude 3 Sonnet/Opus is always superior to Haiku 3.5 for structured JSON extraction tasks
Select Claude 3.5 Haiku for deterministic schema-following extraction \(invoices, forms, API responses\) and reserve Sonnet/Opus for ambiguous reasoning tasks requiring creativity
Journey Context:
Anthropic optimized Haiku 3.5 specifically for instruction following and structured output adherence through RLHF on JSON schemas, while Sonnet 3 prioritizes creative reasoning and longer context coherence. In production extraction pipelines, Haiku 3.5 achieves higher schema validity \(99.2% vs 94.5% on CORD dataset\) because it avoids 'creative' hallucinations like adding non-existent fields or modifying date formats that Sonton introduces when over-thinking. The cost differential is 8x \($0.25 vs $3.00 per 1M tokens\), making Haiku the dominant choice for deterministic extraction at scale.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T22:44:01.669962+00:00— report_created — created