Report #69790
[cost\_intel] o1 models fail strict JSON schema adherence despite higher cost
Use GPT-4o for structured output formatting; reserve o1 for reasoning-heavy content generation only. Do not rely on o1 for strict schema adherence.
Journey Context:
o1 allocates token budget to reasoning chains rather than format compliance, causing 5% JSON parse error rates vs 0.1% in GPT-4o despite 20x cost premium. Teams often assume higher price = better adherence, but reasoning models prioritize 'thinking' over 'formatting'. Alternative of post-processing o1 output with 4o adds latency but ensures schema compliance.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T23:37:45.589767+00:00— report_created — created