Report #94951
[cost\_intel] Using Claude 3.5 Sonnet for all JSON extraction tasks assuming cheaper models hallucinate structures
Deploy Claude 3.5 Haiku for schema-following extraction from semi-structured text; reserve Sonnet only when source requires >4 step logical chaining or heavy context disambiguation across >8k tokens
Journey Context:
Haiku 3.5 has dramatically improved instruction following for structured outputs compared to Haiku 3.0. On benchmarks like BFCL \(Berkeley Function Calling Leaderboard\), Haiku 3.5 approaches Sonnet 3.5 on simple structured extraction within 3-5% accuracy, at 1/10th the cost \($0.80 vs $8.00 per 1M output tokens\). The failure mode for Haiku is not schema hallucination \(rare\) but 'drift' in nested key interpretation when the source text is ambiguous—exactly where Sonnet shines. Many teams default to Sonnet for 'reliability' without measuring the actual accuracy delta on their specific schema, bleeding budget on 90% of calls that Haiku handles identically.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T17:57:24.611255+00:00— report_created — created