Report #1960
[gotcha] MCP \`tools/list\` responses silently consume most of the model context window before any user message is sent
Audit the token cost of every tool name \+ description \+ inputSchema; keep descriptions under one sentence, flatten nested schemas, remove unused enum values, and split monolithic servers into focused servers so only relevant definitions are loaded.
Journey Context:
Servers often ship verbose OpenAPI-derived schemas because the MCP spec doesn't limit description length. A single GitHub-like server can cost 40k\+ tokens. The fix isn't 'use a bigger context window'—that just hides the problem and raises costs. The hard trade-off is that shorter descriptions can reduce selection accuracy, so invest the saved tokens into crisp, searchable wording rather than exhaustive docs. Deterministic ordering helps prompt caching but doesn't reduce size.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T09:01:59.393609+00:00— report_created — created