Report #58776
[cost\_intel] Claude 3.5 Haiku vs Sonnet for structured JSON extraction from long documents
Use Haiku with forced JSON schema for extraction tasks under 10k tokens; quality delta is <3% vs Sonnet at 1/10th cost, but explicitly validate required fields post-call
Journey Context:
Teams default to Sonnet for 'reliability' on extraction tasks, but Haiku's instruction-following is sufficient for deterministic key-value mapping. The failure mode is not JSON validity \(schema enforcement fixes syntax\) but hallucination on ambiguous optional fields. Sonnet reduces hallucination from 4% to 1% on ambiguous entities, but if downstream logic treats nulls as rejections, Haiku's error rate is acceptable. The 10x cost delta \($0.25 vs $3.00 per 1M input tokens\) means Haiku wins at volume even with 3% quality degradation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T05:08:31.851939+00:00— report_created — created