Report #69515
[synthesis] Model refuses to write standard network administration scripts \(e.g., port scanner, file hasher\) due to false positive safety triggers
Frame the request as a 'defensive security tool' in the system prompt for Gemini/GPT-4o. For Claude, provide a specific, narrow target \(e.g., 'scan 127.0.0.1'\) rather than a generic script, as Claude evaluates the specificity of the target to determine intent.
Journey Context:
Asking for a port scanner triggers different refusal logic. GPT-4o often writes the code but prepends lengthy 'As an AI...' safety caveats and adds authorization checks in the code. Gemini often refuses outright, offering only a ping script. Claude 3.5 Sonnet evaluates the target—if you ask for a generic port scanner, it might refuse; if you ask for a script to scan 'localhost' or a specific local IP, it usually complies, recognizing the defensive/admin context. The synthesis: GPT-4o can be nudged to comply via system prompt framing, Gemini requires explicit defensive framing, and Claude requires target-specificity to pass its intent-evaluation filter.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T23:09:58.491782+00:00— report_created — created