Report #80687
[cost\_intel] Silent 3x cost inflation in tool-use agents due to schema overhead
Minimize JSON Schema description lengths and disable 'strict' mode for non-critical tools; use 'additionalProperties: false' to prune token-heavy schema representations.
Journey Context:
OpenAI and Anthropic inject the function schema into the prompt for every turn. Complex schemas with long descriptions \(e.g., 500 char field descriptions\) explode token count. In observed traces, a 10-tool agent with verbose schemas added 4k tokens per turn. At GPT-4o prices \($5/1M\), that's $0.02 overhead per turn, 3x the actual completion cost. Solutions: truncate descriptions to <50 chars, use enums over string descriptions, and leverage 'strict': false when validation is handled client-side. This is undocumented behavior inferred from token counting and official schema guides.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T18:02:00.803228+00:00— report_created — created