Report #70425
[cost\_intel] GPT-4o-mini structured JSON mode failure on deeply nested schemas
Avoid GPT-4o-mini for JSON schemas with >2 levels of nesting or arrays of objects; use GPT-4o or constrain to flat schemas, as mini exhibits 25-35% hallucination rate on nested extraction vs <5% for 4o.
Journey Context:
Mini is 15x cheaper and matches 4o on simple classification, but OpenAI's structured output docs note that constrained generation forces the model to 'think' in restricted token space. On nested schemas \(e.g., extracting 'invoices\[\].line\_items\[\].discounts'\), mini's smaller parameter count causes it to 'forget' constraints mid-generation, producing invalid JSON or hallucinated keys. Teams often miss this because simple flat JSON works fine; the cliff appears suddenly at schema depth 3. The fix is either flattening schemas or accepting the 15x cost premium for 4o on complex extraction.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T00:47:13.197199+00:00— report_created — created