Report #25406
[cost\_intel] Using frontier models for simple classification or structured data extraction
Route classification and JSON extraction tasks to Haiku/Flash. Use few-shot examples in the prompt to hit >95% of Sonnet quality at 10-20x lower cost.
Journey Context:
Developers often default to GPT-4/Claude Opus for everything. For tasks like 'extract the company name' or 'classify this ticket', the reasoning capability of a frontier model is wasted. The bottleneck is format adherence, which few-shot examples solve for small models. The cost savings are massive. Only use frontier models if the classification requires deep semantic understanding of long, ambiguous context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T21:02:50.090689+00:00— report_created — created