Report #49226
[cost\_intel] When does Claude 3 Haiku match Sonnet for document information extraction?
Use Haiku for schema-bound extraction from clean, digital-native PDFs under 10 pages; use Sonnet when source has handwritten annotations, complex tables, or requires cross-page logical inference.
Journey Context:
People assume extraction quality scales with model size, but for structured JSON from clean text, Haiku reaches >95% F1 vs Sonnet at 1/6th cost. The cliff appears on 'visual reasoning' tasks like interpreting merged cells or handwritten margin notes. Benchmark on 100 samples first; if Haiku accuracy >92%, the cost savings fund 6x volume.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T13:06:23.830367+00:00— report_created — created