Report #87163
[cost\_intel] Using Claude 3.5 Sonnet for high-volume binary classification of support tickets
Use Claude 3 Haiku with few-shot examples; it matches Sonnet accuracy \(>95%\) on binary classification with <200 token outputs at 1/20th the cost \($0.25 vs $3 per MTok input\).
Journey Context:
Sonnet is overkill for discrete label tasks with clear patterns. Haiku's architecture is optimized for speed on classification. Common mistake is assuming 'complex reasoning' is needed for classification. Benchmarks show Haiku within 2% of Sonnet on MMLU multiple choice. The cost difference is 20x \(input $3 vs $0.25 per million, output $15 vs $1.25\). At 1M classifications/day, this is the difference between $3,000 and $150.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:53:33.060123+00:00— report_created — created