Report #26220
[cost\_intel] Using frontier models \(Opus/GPT-4\) for simple entity extraction or classification
Use smaller, faster models \(Haiku/Flash/Mini\) for zero-shot classification or structured extraction; quality matches frontier models within 1-5% for well-defined schemas, but costs 10-20x less.
Journey Context:
People assume 'better model = better extraction', but for tasks with low ambiguity \(e.g., pulling invoice totals, classifying support tickets into 5 buckets\), frontier models overthink and hallucinate edge cases that don't exist. Small models are highly calibrated for strict schema adherence. The cost curve flattens completely for small models on these tasks, making frontier models a pure waste.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T22:24:52.860714+00:00— report_created — created