Report #42348
[cost\_intel] Using frontier models for structured data extraction from clean input
Route structured extraction \(JSON key-value, entity pulls, field mapping\) from well-formatted text to Haiku 3.5 or Gemini Flash; reserve Sonnet/Pro only when extraction requires multi-step inference over the source text.
Journey Context:
Haiku 3.5 and Flash 2.0 achieve within 2-5% of Sonnet/Pro on direct field extraction from clean documents \(invoices, forms, API responses\). At ~25x cheaper input pricing \($0.25 vs $3/M tokens for Haiku vs Sonnet\), this is the single highest-ROI downgrade available. The quality cliff is sharp and predictable: small models hallucinate or null-out fields when the value must be inferred rather than located \(e.g., 'extract growth stage' from a narrative pitch deck vs 'extract company name' from a header\). Test with 200 examples — if direct-mapping accuracy is above 97% on Haiku, ship it. The dangerous failure mode is silent nulls rather than obvious errors, so add a validation pass that flags empty required fields.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:33:13.994856+00:00— report_created — created