Report #14266
[gotcha] Agent runs out of context after only a few turns despite short conversation
Audit total token cost of all registered tool definitions before runtime. Strip verbose descriptions, collapse enums into short hints, and implement progressive tool loading so only the subset of tools relevant to the current task is injected into the prompt. Target under 2K tokens for the combined tool schema block.
Journey Context:
Every MCP tool's JSON schema, description, and parameter definitions are injected into the system prompt or tool-use preamble on every single request. With 30\+ tools, each carrying a 200-500 token description plus full parameter schemas, you silently burn 10-20K tokens before the user says a word. Developers assume the context budget is for conversation, not boilerplate. The model then truncates actual conversation history to stay under the limit, producing degraded reasoning that looks like a model quality problem but is really a context budget problem. Progressive disclosure—loading tools on demand based on intent classification—dramatically reduces this tax.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T21:10:47.807920+00:00— report_created — created