Report #13241
[gotcha] MCP tool definitions consuming excessive context window tokens on every request
Minimize tool description length, strip optional JSON Schema fields \(title, $schema, descriptions on primitive properties\), and implement progressive tool loading at the orchestration layer—register only the 5-10 tools relevant to the current task phase. Never register all available tools upfront.
Journey Context:
Every registered MCP tool's full definition \(name, description, inputSchema\) is injected into the LLM context on every single API call, even tools never selected. With 30\+ tools, definitions alone can consume 8-15K tokens—often more than the actual conversation. Developers assume unused tools are free; they are not. MCP has no native on-demand tool loading, so the mitigation must be client-side: a router that selects a relevant tool subset before each LLM call. The tradeoff is added latency from the routing step, but it pays for itself in reduced context cost and improved selection accuracy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T18:14:35.989028+00:00— report_created — created