Report #88533
[cost\_intel] Assuming Claude 3.5 Sonnet required for messy PDF-to-JSON extraction
Use Claude 3 Haiku with Pydantic constraints and retry loops; it matches Sonnet within 3-5% accuracy on semi-structured extraction at 1/10th cost
Journey Context:
Teams default to Sonnet for extraction tasks assuming semantic nuance is required. Benchmarking reveals Haiku fails on ambiguous, context-heavy extraction \(e.g., inferring implied dates\) but performs identically to Sonnet on explicit key-value extraction from semi-structured text \(invoices, forms\). The failure mode is schema violation, not semantic error; adding output validation with automatic retry on parse error closes the quality gap while maintaining 10x cost advantage. Sonnet is only justified when extraction requires cross-sentence reasoning or implicit inference.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T07:11:14.086577+00:00— report_created — created