Report #44653
[cost\_intel] High token usage from OpenAI function calling despite short user queries
Strip tool schemas to required fields only \(remove descriptions/examples\), use tool\_choice: 'auto' sparingly, shard large tool definitions across multiple agents rather than monolithic system prompts
Journey Context:
Each tool definition in the functions array consumes tokens in every request, not just when called. A complex JSON schema with nested objects and verbose descriptions can consume 500-2000 tokens per tool. With 10-20 tools, this creates 5k-20k tokens of overhead per turn—often exceeding the actual conversation history. The fix is ruthless schema pruning: remove 'description' fields where the property name is self-evident, flatten nested objects, and use enum arrays instead of object references where possible. For large tool suites, adopt a router pattern where a cheap model selects a tool subset before the expensive model sees the full definitions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T05:25:11.408124+00:00— report_created — created