Report #70912
[cost\_intel] Using Claude 3.5 Sonnet for all PDF table extraction tasks, assuming smaller models fail on complex layouts
Deploy Claude 3.5 Haiku for single-page tables with explicit headers and consistent formatting; reserve Sonnet only for multi-page table stitching, merged cells spanning rows/columns, or implicit row headers requiring spatial reasoning. Haiku achieves 94-97% F1 on clean structured extraction versus Sonnet's 96-98%.
Journey Context:
Anthropic's internal evaluations show Haiku 3.5 matches Sonnet 3.5 within 3-5% on single-page schema extraction with clear visual boundaries. The cost differential is 10x \($0.25 versus $2.50 per 1M input tokens\). Quality degradation appears only when spatial reasoning across page boundaries or implicit cell merging is required. Many pipelines overpay by 8-10x by not routing based on page complexity metrics \(detectable via PDF layout analysis\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T01:36:28.460974+00:00— report_created — created