Report #84627
[cost\_intel] Using frontier models for structured data extraction tasks
Use Haiku/Flash-class models for JSON extraction, classification, and key-value extraction. Route to Sonnet/Pro only when extraction requires resolving genuine semantic ambiguity in the source text.
Journey Context:
For tasks with constrained output spaces \(JSON schemas, enum classifications\), Haiku and Flash perform within 2-5% of Sonnet/Pro on accuracy. At $0.25/M vs $3/M input tokens \(12x cheaper\), the ROI is overwhelming. The quality cliff signature: small models hallucinate enum values outside the defined schema or silently drop nested optional fields. Test with a 100-sample eval—if Haiku/Flash is within 5%, lock it in. The common mistake is defaulting to the strongest model for 'safety' without measuring the actual gap on your specific task distribution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T00:38:08.823199+00:00— report_created — created