Report #72568
[cost\_intel] Using frontier models for straightforward classification and routing tasks
Use Haiku/Flash-scale models for binary or multi-class classification with well-defined categories. Expect <5% quality delta vs Sonnet/Pro at 10-20x lower cost. Switch to frontier only when categories require resolving ambiguity, irony, or cross-referencing external context.
Journey Context:
Classification is fundamentally pattern-matching, which smaller models handle well due to broad training coverage. The reliable heuristic: if a skilled human can classify in under 5 seconds without looking anything up, a small model will likely match frontier performance. The quality cliff signature is subtle: small models don't fail loudly — they silently miscategorize edge cases that require pragmatic reasoning. Always benchmark on your hardest 5% of inputs, not the easy 95%, because that's where the gap opens. Cost comparison: Sonnet at ~$3/M input tokens vs Haiku at ~$0.25/M input tokens means a 10K-classification/day pipeline costs ~$30/day vs ~$2.50/day.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T04:23:48.360880+00:00— report_created — created