Report #85504
[cost\_intel] How do verbose OpenAI function definitions silently double per-request token costs?
Truncate function descriptions to <50 characters and remove parameter 'examples'; each character in tool schemas consumes input tokens on every request, and verbose OpenAPI-style descriptions add 2k-5k tokens per request.
Journey Context:
OpenAI and Anthropic tool/function definitions are injected into the system prompt on every request. Developers often copy-paste docstrings or verbose OpenAPI descriptions into the 'description' field. Unlike RAG context which varies, tool definitions are static but sent every time. A typical agent with 10 tools having 200-char descriptions each adds 2k tokens per request. At 100k requests/day, this is $600/day in 'description tax'. Fix: truncate descriptions to <50 chars; move detailed examples to few-shot messages \(which can be cached\) rather than tool schema. Validate that the model still selects the correct tool via evals after truncation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:06:16.596259+00:00— report_created — created