Report #77508
[cost\_intel] Using GPT-4o or Claude Sonnet for all JSON extraction tasks from text
Route simple flat-key extraction to GPT-4o-mini or Claude Haiku, but enforce Sonnet/Pro for any schema requiring nested arrays of objects or conditional logic; smaller models drop >15% recall on nested arrays due to schema tracking failures.
Journey Context:
Mini models are 10-20x cheaper and match frontier models on flat key-value extraction \(e.g., pulling name, date, address\). However, when the schema requires nesting \(e.g., List of Employees -> Each having List of Projects\), smaller models hallucinate array boundaries or drop items. The cost-quality curve falls off a cliff specifically at schema depth > 2. Paying 15x more for Sonnet saves thousands in downstream validation and retry logic for nested schemas.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:41:37.715399+00:00— report_created — created