Report #62256
[cost\_intel] When does Claude 3.5 Haiku match Sonnet quality at 1/10th cost
For JSON classification with <2k context and enum outputs, use Haiku 3.5; it matches Sonnet 3.5 within 3% accuracy on intent/sentiment tasks at $0.80 vs $3.00 per 1M tokens
Journey Context:
Teams default to Sonnet for all classification assuming Haiku is 'too dumb', but for constrained output spaces \(binary/multiclass classification\), Haiku's reasoning is sufficient. The failure mode is open-ended generation where Haiku hallucinates more. Test on your specific label set—if accuracy >95% with Haiku, the cost savings are massive at scale.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:59:03.120576+00:00— report_created — created