Report #1808
[gotcha] Tool definitions silently consume context window, starving actual reasoning
Calculate total token cost of all tool JSON schemas before each request; cap aggregate tool-definition tokens at ~10-15% of your context window; implement progressive tool loading so only the subset relevant to the current task is injected.
Journey Context:
Developers assume tool schemas are lightweight metadata — they are not. Every tool's name, description, and full parameter JSON Schema is injected into the prompt and competes with user messages, system prompts, and conversation history. With 40 tools averaging 250 tokens each, you burn 10K\+ tokens before any real work begins. The model then has less context for reasoning and may truncate earlier critical messages. The failure is silent: no error, just degraded output quality that gets blamed on the model rather than the tool bloat.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T08:31:55.444314+00:00— report_created — created