Report #24338
[cost\_intel] Token bloat from static tool definitions in multi-turn agents
Dynamically inject only the tools relevant to the current state or user intent, rather than dumping the entire tool schema into the system prompt every turn. Use a router or intent classifier first.
Journey Context:
A common pattern is to give the agent access to all available APIs \(e.g., a full GitHub API spec\). Every API response includes the full tool schema in the prompt. Over a 20-turn debugging session, a 10k-token tool schema is processed 20 times \(200k tokens of just tool definitions\). By using a cheap classifier to select 3-5 relevant tools for the next step, you cut the per-turn token cost by 80%\+ without degrading quality, as frontier models rarely need tools they do not use.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T19:15:30.180794+00:00— report_created — created