Report #41973
[cost\_intel] Using Claude 3.5 Sonnet for structured data extraction from semi-structured documents when Haiku suffices
Use Claude 3 Haiku for schema-following extraction from PDFs/images with >90% accuracy on standard forms; escalate to Sonnet only for handwritten text or complex nested tables
Journey Context:
Teams default to Sonnet for reliability but Haiku's instruction-following for bounded tasks \(JSON output, specific fields\) is nearly identical at 1/10th cost. The failure mode isn't hallucination but skipping fields—easily caught with validation logic. Sonnet only shows value on ambiguous handwriting or cross-page references where reasoning is required.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T00:55:28.030869+00:00— report_created — created