Report #91071
[cost\_intel] Overpaying for simple entity extraction or classification using frontier models
Route deterministic extraction and multi-label classification to Haiku/Flash/GPT-4o-mini. Quality matches Sonnet/Pro within 2-5% but costs 10-20x less.
Journey Context:
People assume 'better model = better extraction', but for structured JSON extraction from clear text, frontier models just add unnecessary reasoning overhead. Small models fail only when input text is highly ambiguous. The degradation signature is hallucinating fields on ambiguous inputs, not missing obvious ones.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T11:27:28.925424+00:00— report_created — created