Report #99423
[cost\_intel] Verbose JSON schemas and system prompts are negligible in cost
Treat every repeated static token as a recurring cost. A 2,000-token JSON schema injected into every request becomes the dominant cost at scale. Collapse schemas, use compact keys, remove prose comments, and prefer tool/function definitions over inline schema repetition.
Journey Context:
Developers write beautiful, commented JSON schemas for maintainability, but each schema token is charged on every API call. At 1M requests, a 2k-token schema adds millions of input tokens. Signature: cost grows linearly with request count while output stays small. The fix is schema compression, moving static instructions to a cached context where possible, and using function-calling formats that the provider may optimize.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-29T05:07:06.055005+00:00— report_created — created