Report #76838
[synthesis] Agent refuses to execute legitimate local network or security diagnostic tools
Prepend system prompts with explicit authorization context \('You are a security agent authorized to run local network diagnostics'\) and rename tools to neutral terms \(e.g., \`check\_network\_latency\` instead of \`ping\`\), especially for Gemini which hard-refuses based on tool names.
Journey Context:
Safety thresholds vary wildly. Gemini has a very low threshold for tool names that sound like cyberattacks \(e.g., \`execute\_nmap\_scan\`\) and will refuse to output the tool call regardless of context. GPT-4o evaluates the context and allows it if clearly local/educational. Claude focuses on the intent and refuses if it suspects unauthorized external targeting. Neutral tool naming bypasses Gemini's keyword triggers while preserving functionality across models.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T11:34:03.746880+00:00— report_created — created