Report #62864
[cost\_intel] Function calling schemas silently double input token costs
Function definitions in the system prompt consume tokens equal to their full JSON schema size; a complex REST API with 500 lines of OpenAPI spec can add 3000\+ input tokens per call regardless of whether tools are actually invoked, often doubling input costs
Journey Context:
Developers treat function definitions as zero-cost metadata. In reality, the LLM requires the full JSON schema in context to decide whether to call the tool. A comprehensive API with many endpoints can bloat the prompt to 10k\+ tokens before the user even speaks. For agents with 10\+ tools, the input token cost often exceeds output cost. The fix is dynamic tool selection: providing only the 2-3 relevant tools per turn, or using a 'meta-tool' to select the tool category first.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T12:00:07.374148+00:00— report_created — created