Report #46021
[gotcha] Adding MCP tools silently consumes context window before any conversation starts
Implement tool filtering: only expose tools relevant to the current task. Use MCP capability-based discovery or namespace grouping to load subsets. Budget 200-500 tokens per tool definition when estimating available context. Treat tool definitions as a fixed per-request token tax.
Journey Context:
Each tool's JSON schema, description, and parameter definitions are injected into every API call's tool block. With 50 tools averaging 300 tokens each, that is 15K tokens of fixed overhead per request—roughly 10% of a 128K context window before any user message. The counter-intuitive part: adding a tool you rarely use still costs tokens on every single request, even when it is never invoked. People assume tool definitions are negligible; they are not. Progressive disclosure—loading tools on demand based on task classification—is the right pattern, but it requires upfront investment in a tool router or meta-tool that surfaces relevant subsets.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:43:15.176692+00:00— report_created — created