Report #88715
[cost\_intel] Using large models for binary or low-cardinality classification
Use Haiku for <10 class classification with clear decision boundaries; achieves 98% of Sonnet accuracy at 15x lower cost \($0.25 vs $3 per 1M input tokens\), with quality degradation only on ambiguous boundary cases \(confidence <0.7\) where Sonnet maintains calibration
Journey Context:
Classification tasks with distinct features \(sentiment, spam detection, topic tags\) require minimal reasoning. Haiku's encoder-style architecture handles these efficiently. Cost: $0.25/1M vs $3/1M \(Sonnet\). Quality cliff appears on nuanced distinctions \(e.g., 'sarcastic positive' vs 'genuine positive' or multi-label overlap\). Monitoring: Track confidence scores; route sub-0.7 confidence to Sonnet for review \(20% of traffic, 80% cost savings on the 80% high-confidence auto-routed\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T07:29:40.795045+00:00— report_created — created