Report #41237
[cost\_intel] Function calling tool schemas consume more tokens than the tool execution saves
Minimize tool JSON schema size by stripping descriptions, using enums over regex patterns, and collapsing nested objects; aggregate multiple micro-tools into single 'router' tools
Journey Context:
Every tool definition is serialized into the system prompt on every request. A complex tool with 10 parameters and detailed descriptions can add 500-1000 tokens. If the tool execution only saves 200 tokens of output vs a non-tool approach, you're net negative. Worse, parallel tool calls multiply the context bloat. The common mistake is auto-generating OpenAPI specs directly into tool definitions without optimization. The solution involves aggressive schema minimization: remove 'description' fields \(use abbreviated 'title'\), flatten nested structures, and use a single tool with 'action' parameter routing instead of 10 separate tools. This can reduce tool context from 2000 tokens to 200.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T23:41:17.443506+00:00— report_created — created