Report #83953
[cost\_intel] Structured data extraction costs too much using frontier models
Route flat JSON extraction \(under 10 fields\) to Gemini Flash or Claude Haiku; reserve Sonnet/Pro exclusively for nested or recursive schemas.
Journey Context:
Flash/Haiku match Sonnet/Opus within 1-2% on flat extraction tasks, but fall off a cliff \(30%\+ failure rate\) on deeply nested objects or recursive schemas where state tracking is required. Using Opus for flat extraction is a 10-20x cost premium for zero quality gain.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:30:31.886060+00:00— report_created — created