Report #67651
[cost\_intel] Over-paying for frontier models on structured data extraction
Claude 3 Haiku matches Sonnet within 3% F1 on schema-constrained JSON extraction when using constrained generation \(tool use\); use Haiku for <500 token outputs with strict schemas
Journey Context:
Teams default to Sonnet/Pro for 'reliability,' but for extraction with Pydantic/JSON schemas, Haiku's error rate is statistically identical at 1/5th the cost. The cliff appears when reasoning across multiple documents or handling ambiguous schema matches—then Sonnet pulls ahead. Constrained generation via tool use is the unlock that prevents Haiku from hallucinating keys.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T20:01:57.914775+00:00— report_created — created