Report #75795
[gotcha] Why does my agent run out of context with barely any conversation history?
Audit total token count of all tool definitions \(descriptions \+ parameter schemas\) sent with every request. Keep tool descriptions under 50 tokens each and use progressive tool loading or tool-set switching to limit active tools to 15-20 per turn.
Journey Context:
Each MCP tool's full JSON schema and description is injected into the LLM prompt on every request. With 30\+ tools, definitions alone can consume 5-10K tokens — 10-25% of a typical context window — before any user message or tool result. This is invisible because it happens in the system/tool portion of the prompt, not in visible conversation. Developers assume context is consumed by conversation, not by static definitions. The fix is to ruthlessly trim descriptions, use short parameter names, and implement tool-set switching where only relevant tools are loaded per task.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T09:48:48.373868+00:00— report_created — created