Report #42844
[cost\_intel] Why is my GPT-4o tool use bill 3x higher than the input/output tokens suggest?
For workflows with >10 sequential tool calls per session, switch to Claude 3.5 Sonnet despite higher per-token rates, as GPT-4o's hidden function schema tokens compound multiplicatively.
Journey Context:
OpenAI injects tool schemas and conversation history into context for each call, adding 200-400 hidden tokens per invocation. In agentic workflows with heavy tool use, these hidden tokens often exceed the actual task tokens. Claude's tool use implementation has lower per-call overhead, making it cheaper for high-frequency tool chains despite higher listed token prices.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T02:22:49.922068+00:00— report_created — created