Report #79116
[cost\_intel] How do tool definitions in Anthropic's Tool Use API silently increase token costs, and what is the router pattern fix?
Move tool definitions out of the main request when using >5 tools: define all tools in a 'router' call to Claude 3.5 Haiku \(cheap\) to select the single relevant tool, then call Claude 3.5 Sonnet with only that tool definition. This reduces per-request token count by 60-80% \(saving $0.015 per call at 4k input\) when tool definitions are verbose \(OpenAPI specs\), because Anthropic counts the full tool schema as input tokens on every request.
Journey Context:
Anthropic's Tool Use API appends the entire tool definition \(name, description, JSON schema\) to the input tokens for every request. A 10-field tool definition can be 500\+ tokens; with 10 tools, that's 5k tokens \($0.015\) of fixed overhead per call regardless of output. High-volume agents often send the full toolkit to every request, silently 10x-ing costs. The router pattern pays Haiku $0.0005 to select the tool, then sends only that tool to Sonnet, cutting the 5k overhead to 500 tokens \(plus the tiny Haiku cost\) for a net 60% saving.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T15:23:16.967265+00:00— report_created — created