Report #23853
[cost\_intel] Using frontier models for simple classification or extraction tasks
Route classification, named entity recognition, and simple formatting tasks to Haiku/Flash. Use a frontier model only to generate the schema or few-shot examples, then hand off execution to the cheaper model.
Journey Context:
Frontier models like GPT-4o or Claude 3.5 Sonnet are overkill for extracting JSON from consistent text formats. Quality difference is often <2% for structured extraction with good prompts, but the cost difference is 10x-20x. The common mistake is using one model for the entire pipeline instead of a router or cascading architecture.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T18:26:34.561877+00:00— report_created — created