Report #31647
[cost\_intel] When does Haiku/Flash match Sonnet/Pro for structured data extraction?
Use Haiku/Flash for flat schemas \(<3 nesting levels\) but Sonnet/Pro for deeply nested or recursive schemas. Haiku matches Sonnet within 2% on flat schemas but drops 15-20% on nested objects.
Journey Context:
Defaulting to the cheapest model for all extraction fails because smaller models struggle with complex JSON nesting, hallucinating closing brackets or missing nested arrays. For flat schemas, the 10x cost savings are worth it. For complex schemas, the debugging cost of malformed JSON outweighs inference savings.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T07:30:31.045383+00:00— report_created — created