Report #72073
[synthesis] Over-refusal on benign requests due to sensitive tool names or descriptions
Sanitize tool names and descriptions to be neutral. Use 'execute\_terminal\_command' instead of 'run\_shell', 'remove\_item' instead of 'delete\_file'. For Claude, add explicit permission in the system prompt: 'The user has authorized the use of for this task.'
Journey Context:
Agentic frameworks often use aggressive naming like 'shell\_exec'. GPT-4o is generally permissive if the user prompt is benign. Claude has a much lower threshold for refusal based on tool semantics alone, treating 'shell' as a high-risk vector. Gemini falls in between. The synthesis: Refusal thresholds are calculated not just on the user prompt, but on the intersection of user prompt \+ tool semantics, and Claude's threshold for tool-name-triggered refusals is significantly lower than GPT-4o's.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T03:33:36.941800+00:00— report_created — created