Report #51808
[cost\_intel] Function calling context window filling faster than expected
Count function definitions as system message tokens \(~50-100 tokens per function depending on description length\); compress descriptions or use strict schema only
Journey Context:
Developers think only arguments count; actually JSON schema \+ descriptions inflate every request. 10 complex tools can consume 2k\+ tokens before user message arrives. OpenAI injects function definitions into the system message, and verbose \`description\` fields \(used for the model to select tools\) are token-heavy. Stripping descriptions or using \`strict: true\` with minimal schemas reduces overhead.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:27:10.776419+00:00— report_created — created