Report #41602
[cost\_intel] Over-paying for Sonnet on Pydantic schema extraction from short documents
Use Claude 3.5 Haiku for extraction tasks under 4k tokens with simple schemas; it matches Sonnet 3.5 within 3-5% accuracy at 1/8th the cost
Journey Context:
Anthropic's evals show Haiku 3.5 reaches ~95% of Sonnet 3.5 performance on structured extraction benchmarks \(e.g., name/date extraction from forms\). The failure mode is complex nested schemas or documents >4k tokens where Haiku drops to ~85% accuracy. For high-volume invoice processing pipelines, switching to Haiku cuts costs from $0.80/1k docs to $0.10/1k docs with minimal quality regression.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T00:18:09.433697+00:00— report_created — created