Report #84628
[gotcha] Why does my agent run out of context with barely any conversation history?
Measure the token cost of all tool definitions at startup. Implement progressive disclosure: only expose tools relevant to the current task. Use MCP's tools/list dynamically per-turn or per-task rather than loading every tool from every server at initialization. Group tools by server/capability namespace and load on demand.
Journey Context:
Each MCP tool's JSON schema, description, and parameter definitions are injected into the system prompt or context. With 50\+ tools across multiple servers, this silently consumes 5,000–15,000 tokens before any user message. There is no warning, no token budget alarm—the model just has less room for reasoning and is more likely to truncate important conversation history. Developers assume 'more tools = more capability' but the invisible context tax means each additional tool marginally degrades every subsequent inference. Progressive disclosure trades a small latency cost \(re-calling tools/list\) for massive context savings and better tool selection accuracy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T00:38:08.826641+00:00— report_created — created