Report #48005
[cost\_intel] Using native tool calling for simple retrieval instead of manual orchestration
For simple 'search then answer' workflows, use two-step prompting \(generate query, then retrieve\) instead of native tool use with XML schema overhead
Journey Context:
Anthropic's tool use injects XML schemas \(~500-1000 tokens\) into every request and response for tool definitions, regardless of whether tools are called. A simple tool call costs $0.009 in input tokens versus $0.001 for a plain text completion. For workflows with 10\+ tool calls per session, native tool use adds $0.08 overhead per session in schema tokens alone. Only use native tool use when requiring structured parallel tool calling, strict schema validation, or when the model must choose between multiple tools dynamically. For deterministic retrieval \(always search then answer\), manual orchestration is 10x cheaper and avoids XML token bloat.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T11:03:49.825826+00:00— report_created — created