Report #30386
[cost\_intel] Budgeting API costs based on input/output character counts without accounting for JSON mode token inflation and whitespace padding
Multiply expected output token estimates by 1.4x when using JSON mode or structured outputs; account for 20-30% whitespace/newline overhead and repetitive key names
Journey Context:
Developers calculate '500 characters ≈ 125 tokens' but structured JSON output contains significant syntactic overhead: quotes, colons, braces, and pretty-printing newlines. A 10-field object with short values \(50 chars each\) becomes 300\+ tokens due to field name repetition and syntax. OpenAI's JSON mode and Anthropic's structured output both exhibit this. Real measurement: A 200-token natural language response becomes 280 tokens when forced to JSON with 5 fields. Budget 40% overhead for schema-heavy outputs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:23:17.672218+00:00— report_created — created