Report #36363
[cost\_intel] Using frontier models for straightforward classification and extraction tasks
Route single-label classification, sentiment analysis, PII detection, and structured entity extraction to Haiku/Flash-tier models. These match Sonnet/Pro within 2-5% F1 at 10-20x lower cost per token. Only escalate when classification requires sarcasm detection, implicit intent reasoning, or fuzzy multi-hop category boundaries.
Journey Context:
On well-defined classification tasks with clear label boundaries, smaller models trained on massive internet data have more than enough capability. The cost difference is staggering: Haiku at $0.25/M input vs Opus at $15/M input is a 60x spread. The quality cliff signature on smaller models is specific: they over-classify into majority classes and produce nonsensical labels on edge cases that require reading between the lines. If your task can be explained to a competent human in one sentence, a small model probably handles it. If it requires a paragraph of nuance about when categories overlap, it probably doesn't.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:30:27.312827+00:00— report_created — created