Report #47502
[cost\_intel] Does OpenAI JSON mode significantly increase token costs?
Avoid JSON mode for high-volume simple extractions; it adds 20-30% output token overhead due to enforced whitespace and schema validation. Use regex extraction or unconstrained generation with manual parsing for 1M\+ token/month pipelines.
Journey Context:
Developers assume JSON mode is 'free' or safer. OpenAI's structured output enforces valid JSON syntax including mandatory whitespace and bracket nesting, increasing token count vs minimal JSON. On a 500-token response, JSON mode averages 650 tokens. At $10/1M output tokens, this adds $1.50 per 1k requests.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:12:45.521071+00:00— report_created — created