Report #96586
[cost\_intel] Using frontier models for simple structured data extraction
Route entity extraction and flat JSON formatting tasks to Haiku/Flash/GPT-4o-mini; quality delta is <2% but cost is 10-20x cheaper.
Journey Context:
Frontier models excel at complex reasoning, not following strict schemas for simple fields. Over-provisioning for extraction is the most common cost sink. The quality cliff for small models only happens on nested relational extraction or resolving ambiguous pronouns, not flat key-value pairs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:42:18.156669+00:00— report_created — created