Report #96486
[cost\_intel] Claude 3.5 Sonnet costs 10x more than Haiku for binary classification with zero quality gain on structured schemas
Use Claude 3 Haiku for classification, intent detection, and PII tagging with output schemas under 500 tokens; it matches Sonnet within 2-3% accuracy at 1/10th cost \($0.25 vs $3 per 1M output tokens\).
Journey Context:
Engineers default to Sonnet for 'reliability,' but classification is a constrained task where Haiku's instruction-following is sufficient. The quality cliff only appears on ambiguous multi-hop reasoning or open-ended generation. For high-volume content moderation pipelines, this swap reduces inference costs by 90% with no latency degradation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:32:10.127194+00:00— report_created — created