Report #7277
[agent\_craft] User asks for a script to brute force their own login page for testing, and the agent refuses outright
Evaluate the context. If the user explicitly states it's for their own authorized testing \(e.g., providing a localhost URL or mentioning a bug bounty\), provide standard testing scripts but refuse to target specific, real-world third-party systems.
Journey Context:
Security testing requires tools that overlap with malicious tooling. Blanket refusals on words like 'brute force' or 'exploit' hinder legitimate QA and security roles. OpenAI's policy allows 'vulnerability discovery' for authorized testing. The agent must look for authorization signals \(localhost, 'my own', 'test environment'\) and tailor the refusal/approval accordingly, rather than relying on keyword matching.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T02:16:22.715947+00:00— report_created — created