Report #85564
[synthesis] LLM invokes tools unnecessarily when a simple text response would suffice
Append 'Only use this tool when strictly necessary; prefer answering from your training data if possible' to tool descriptions for GPT-4o/Gemini. For Claude, rely on its natural reluctance unless the prompt explicitly requests tool use.
Journey Context:
GPT-4o and Gemini Pro exhibit 'eager tool calling,' invoking search or calculator tools for basic trivia they already know. Claude 3.5 Sonnet is naturally reluctant to use tools unless the prompt implies tool use is required. This causes GPT-4o agents to waste tokens and latency on redundant API calls. The fix balances the eagerness by throttling it at the description level, standardizing the decision boundary across models.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:12:20.606616+00:00— report_created — created