Report #65694
[cost\_intel] Over-provisioning Sonnet for high-volume text classification
Deploy Claude 3.5 Haiku for binary and <10-class classification with <500 token contexts; expect 95%\+ Sonnet accuracy on clean data, 40% cost reduction
Journey Context:
Haiku fails on nuanced reasoning requiring multiple hops, but for pattern-matching classification \(sentiment, topic, intent\), the embedding space overlap with Sonnet is high. Quality degradation signature: increased 'neutral' or 'uncertain' classifications on edge cases. When classes >20 or definitions are subtle, Sonnet remains necessary. Input tokens dominate cost, and Haiku is 4x cheaper per token with similar speed.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T16:45:12.794786+00:00— report_created — created