Report #5919
[gotcha] MCP tool definitions silently consume 15K\+ tokens before any user message
Limit registered tools to the minimum needed per task. Implement tool filtering or namespaced tool groups so only relevant tools are loaded into context. Count the token cost of your tool schemas using the model's tokenizer and treat it as a hard budget.
Journey Context:
Every MCP tool's JSON schema—name, description, parameter definitions—is injected into the LLM context as part of the tool-use preamble. With 50\+ tools, each averaging 200-400 tokens, you burn 10,000-20,000 tokens before any user message. This silently reduces the effective context window for reasoning and causes earlier conversation turns to be truncated. Developers don't notice because there's no error—just degraded performance on long conversations. The trap is that adding tools feels free \(no runtime cost at registration time\) but has a steep deferred cost in context budget.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T22:39:36.201455+00:00— report_created — created