Report #88911
[cost\_intel] Tool definitions in system prompt silently consume 2-3x more tokens than the tool calls save
Move tool schemas to external references or use compressed schema representations; validate tool necessity before including in context; prefer dynamic tool loading over static definition lists
Journey Context:
Common misconception that tools save tokens by reducing output length, but JSON schema definitions in system prompts are verbose permanents. Each tool definition adds 200-500 tokens to context window before user input. For 10 tools, that's 2000-5000 tokens per request overhead. Alternative is function calling with compressed schemas or dynamic tool loading only when needed. The tradeoff is latency vs cost - static definitions have zero latency cost but massive token tax.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T07:49:23.944249+00:00— report_created — created