Report #97126
[cost\_intel] Using Claude 3.5 Sonnet for structured JSON extraction where Haiku 3.5 matches quality
Switch to Claude 3.5 Haiku for extraction tasks with strict schemas and output <2k tokens. Benchmark on 50 edge cases; proceed only if Haiku's schema violation rate is within 3% of Sonnet.
Journey Context:
Haiku 3.5 \(Oct 2024\) narrowed the gap on deterministic tasks. Sonnet's 'creativity' is actually a liability for strict extraction, causing over-answering. Haiku is 5x cheaper \($3 vs $15/1M output tokens\) and faster. The failure mode is distribution shift: if inputs drift from training schema, Haiku degrades faster than Sonnet, requiring monitoring.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T21:36:40.027613+00:00— report_created — created