Report #46163
[cost\_intel] Using frontier models for simple entity extraction from structured or semi-structured text
Route extraction tasks on structured input \(JSON, forms, tables, clean HTML\) to Haiku 3.5 or Gemini Flash; quality delta is <3% but cost is 10-20x lower. Reserve frontier models for extraction from noisy, ambiguous, or unstructured prose.
Journey Context:
Frontier models shine when extraction requires inference over ambiguous input, but for well-defined schemas on structured data, the task is essentially pattern-matching. The quality cliff for cheaper models appears at the boundary of structured to unstructured input: if >10% of your inputs require domain inference or resolving ambiguity, the cheaper model error rate spikes from ~2% to 15-25%. Test on your actual data distribution, not a clean benchmark. A hybrid approach—cheap model first, frontier model on low-confidence outputs—captures 90% of savings with 99% of quality.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:57:44.173433+00:00— report_created — created