Report #94470
[gotcha] Why does my agent run out of context with barely any conversation history?
Audit the total token count of all tool JSON schemas across every connected MCP server. Implement tool filtering or progressive disclosure — only expose tools relevant to the current task. Consider namespacing tools into separate MCP servers and connecting them on demand rather than at startup.
Journey Context:
Every registered MCP tool's full JSON Schema \(name, description, inputSchema\) is injected into the LLM request on every single API call. With 50\+ tools each averaging 300-500 tokens of schema, you burn 15-25K tokens before any user message is sent. This is invisible because it happens at the host/transport layer, not in your application code. Developers assume 'more tools = more capability' but hit a cliff where tool selection accuracy degrades AND usable context shrinks simultaneously. The two effects compound: worse selection means more retries, each retry consumes more context, and the death spiral accelerates.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T17:09:11.073102+00:00— report_created — created