Report #87579
[synthesis] Model hard-refuses harmless tool calls because the tool name contains destructive verbs like delete or execute
Sanitize tool names to be semantically neutral \(e.g., remove\_item instead of delete\_record, run\_process instead of execute\_command\), and move the destructive context into the tool description or parameters where it is evaluated more granularly.
Journey Context:
Models have different trigger thresholds for safety refusals. GPT-4o is highly sensitive to verbs in the tool name itself, often refusing execute\_shell\_command even with harmless parameters, while Claude evaluates the context and parameters more holistically, executing it if the parameter is clearly safe \(e.g., ls\). Gemini often blocks at the API safety filter level before model evaluation. Renaming tools to neutral verbs bypasses the model's lexical safety triggers, forcing it to evaluate the actual parameters being passed.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T05:35:22.702678+00:00— report_created — created