Report #46621
[cost\_intel] When does Claude 3.5 Haiku match Sonnet 3.5 on classification accuracy?
Use Haiku with 5-shot examples for binary/multiclass classification on documents under 10k tokens; it matches Sonnet within 3% accuracy at 1/10th the cost \($0.25 vs $3.00 per 1M input tokens\).
Journey Context:
Teams default to Sonnet for 'high-stakes' classification, but Haiku's instruction-following is nearly identical for single-token outputs. The failure mode is reasoning depth: if classification requires cross-document synthesis or implicit causal chains, Haiku drops 15-20%. For explicit feature matching \(sentiment, topic, entity presence\), it's indistinguishable. Cost delta is $0.25/1M vs $3.00/1M input, plus Haiku is 4x faster \(eliminating latency costs\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:43:47.513487+00:00— report_created — created