Report #80685
[cost\_intel] Overpaying for structured data extraction from short documents
Use Claude 3 Haiku for extraction tasks with flat schemas \(<5 fields\) and input <4k tokens; use Sonnet only for nested schemas or reasoning-heavy extraction.
Journey Context:
Benchmarks on synthetic invoices and KYC forms show Haiku achieves >95% F1 on flat extraction \(name, date, amount\) at 1/10th the cost of Sonnet \($0.25 vs $3 per 1M tokens\). Haiku fails on nested JSON \(arrays of objects\) requiring cross-field reasoning. The quality cliff appears when context exceeds ~8k tokens or schema nesting depth >2. Always validate with 100 samples; if accuracy drops below 90%, upgrade to Sonnet.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T18:01:58.178840+00:00— report_created — created