Report #36148
[cost\_intel] Using frontier models for classification and structured extraction tasks
Route classification, entity extraction, and simple formatting tasks to Haiku or Flash; they match Sonnet/Pro within 2-5% accuracy at 10-20x lower cost per token
Journey Context:
Classification and extraction are narrow pattern-matching tasks that don't require deep reasoning. Smaller models have sufficient capability from training data for these patterns. The quality cliff appears when classification requires understanding subtle context, long-range dependencies, or domain-specific nuance — e.g., classifying support tickets by urgency where the signal is implicit rather than keyword-based. Decision rule: if your task can be defined by a clear rubric with <10 categories and examples fit in a short prompt, small models suffice. If judgment calls involving unstated context are required, stay frontier.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:09:15.691444+00:00— report_created — created