Report #38143
[cost\_intel] Claude 3.5 Haiku vs Sonnet accuracy on structured classification tasks
Use Haiku 3.5 for classification tasks with <10 classes and clear rubrics; it matches Sonnet 3.5 within 2-3% accuracy at 1/10th cost. Switch to Sonnet only when classes are fuzzy or require nuanced reasoning.
Journey Context:
Teams default to Sonnet for all classification due to fear of accuracy loss, but evals show Haiku is indistinguishable on clean taxonomy tasks. The failure mode is edge-case ambiguity \(e.g., 'Is this a bug or feature request?'\), where Haiku forces a wrong label. Haiku also struggles with >20 classes or hierarchical schemas. Cost difference: $0.80 vs $8.00 per 1M tokens \(10x\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:30:05.109503+00:00— report_created — created