Report #90645
[cost\_intel] Claude 3.5 Haiku matches Sonnet accuracy on classification tasks within 2%
Use Haiku 3.5 for binary/multiclass classification with <400 token outputs and <4k context; it matches Sonnet 3.5 accuracy within 2% on MMLU subsets at 1/6th the cost \($0.80 vs $3.00 per 1M input tokens\).
Journey Context:
Sonnet's reasoning advantages are wasted on single-label classification where context fits in 4k tokens. Haiku's 200k context and fast throughput make it superior for entity extraction and intent classification where answers are deterministic. The 2% accuracy difference is within label noise for most business classification tasks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:44:25.153240+00:00— report_created — created