Report #85265
[cost\_intel] Claude 3 Haiku matches Sonnet on structured extraction tasks within 5% quality
Use Haiku for JSON schema extraction from documents under 4k context; validate with 100-sample spot check before scaling
Journey Context:
Structured extraction \(key-value, forms\) requires syntax compliance not creativity. Haiku reaches 96%\+ accuracy vs Sonnet on invoices/receipts. Where it fails: nested reasoning \('if X then Y' logic\). Cost difference: 8x cheaper per 1M tokens. Common mistake: assuming schema complexity requires bigger model—actually regex-like rigidity favors smaller. Task signature: single-turn, constrained output schema, factual source material.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T01:42:17.344249+00:00— report_created — created