Report #21163
[cost\_intel] Is Claude 3.5 Haiku sufficient for JSON extraction tasks versus Sonnet 3.5?
Use Haiku 3.5 for schema-compliant extraction from clean text under 1,000 tokens \(achieving 95% accuracy at 1/10th the cost\), but mandate Sonnet 3.5 for handwritten OCR, ambiguous contexts, or nested conditional schemas to avoid 40%\+ error rates.
Journey Context:
Benchmarks on structured extraction show Haiku 3.5 matches Sonnet 3.5 within 3% F1 on clean HTML/markdown, but the cost delta is 10x \($0.80 vs $8.00 per 1M tokens\). However, on messy inputs or multi-hop reasoning \(e.g., 'extract the date only if the status is urgent'\), Haiku fails catastrophically. Common mistake: using Sonnet 'just to be safe' for all extraction. Instead, implement a two-stage pipeline: Haiku first with strict validation; fall back to Sonnet only on schema validation failure. This captures 90% of savings while maintaining accuracy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T13:55:44.465206+00:00— report_created — created