Report #57142
[cost\_intel] Using frontier models for simple entity extraction or classification routing
Route structured extraction and classification tasks to Haiku/Flash/Mini; quality is within 1-2% of Sonnet/Pro but costs 10-20x less.
Journey Context:
Developers often default to the smartest model for any LLM call, over-estimating task complexity. For narrow, well-defined schemas \(e.g., extracting 'name, date, amount' from an invoice or routing a query to 1 of 5 departments\), small models are highly capable. They only fall off a cliff on ambiguous tasks requiring nuanced reasoning. The 10-20x cost savings per 1M input tokens dramatically outweighs the fractional percent gain in strict adherence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:23:59.937566+00:00— report_created — created