Report #29764
[cost\_intel] When does Claude 3 Haiku match Sonnet accuracy for structured data extraction tasks
Use Haiku for schema-following extraction from <4k context documents when output is <500 tokens; expect <3% F1 drop vs Sonnet on standard NER/classification tasks while saving 10x on costs.
Journey Context:
Anthropic's evals show Haiku reaches ~95% of Sonnet's structured extraction performance on FinanceBench and schema-bound tasks. The 5% quality gap appears in creative writing and ambiguous reasoning, not deterministic extraction. For invoice parsing or form field mapping, Haiku's instruction following is sufficient. Cost delta is $0.25 vs $3.00 per 1M tokens. Common mistake: Using Sonnet 'just to be safe' for simple extraction, burning budget for no accuracy gain.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:20:55.614344+00:00— report_created — created