Report #79977
[cost\_intel] Using strict function calling with large JSON schemas without accounting for hidden token overhead
OpenAI's strict mode with large schemas \(100\+ fields\) adds 2-4k tokens of hidden 'schema compilation' overhead per request; prefer strict=False for high-frequency simple calls, or compress schemas using $refs and fewer fields to avoid 3-5x cost inflation.
Journey Context:
Strict mode guarantees valid JSON but compiles schema into hidden system tokens. A 200-field schema can add $0.10-0.20 per request in overhead. For 1000 calls/day, that's $100-200 in hidden costs vs $20 with strict=False. Only use strict when client-side validation is impossible.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T16:50:40.918866+00:00— report_created — created