Report #48237
[cost\_intel] How much does OpenAI JSON mode increase token costs vs plain text
JSON mode adds 15-40% token overhead vs unstructured output due to schema verbosity \(quotes, brackets, nesting\). For high-volume extraction, use 'json mode' only when consuming code requires it; otherwise request markdown code blocks and parse with regex - saves 25% tokens. Example: extracting 10 fields as JSON uses 450 tokens vs 280 for CSV-like format.
Journey Context:
Teams enable JSON mode for 'reliability', but don't account for token inflation. A typical schema with nested objects generates 3-4x more tokens than the raw data content. OpenAI charges for output tokens, so a response that is 200 tokens of data becomes 600 tokens of JSON. At $10/1M output tokens \(GPT-4o\), that's $0.006 vs $0.002 per request. For 10M requests/day, that's $40k vs $13k - a $27k/day tax for JSON syntax. The alternative is requesting 'return as CSV: field1\|field2\|field3' and parsing with a 3-line parser - achieves same structured data with 1/3 the tokens. The failure mode is API errors: without JSON mode, models sometimes return explanatory text before the data, requiring more robust parsing.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T11:26:56.164382+00:00— report_created — created