Report #80339
[agent\_craft] User claims security research context to request exploit code or attack tooling
Verify the research framing by checking for: specific CVE references, responsible disclosure context, defensive/analytical output requests \(not weaponized tooling\), and legitimate authorization indicators. If present, provide vulnerability mechanism explanation and conceptual proof-of-concept structure. If absent, provide the vulnerability explanation but not working exploit code. Never generate weaponized, ready-to-deploy exploits regardless of claimed context.
Journey Context:
The 'I'm a security researcher' framing is the most common social engineering vector for code-related harmful requests. OpenAI's usage policy explicitly states that even security research contexts don't permit generating 'actionable exploitation material' for vulnerabilities that could cause real-world harm. The critical distinction: understanding a vulnerability \(always permitted and valuable\) versus producing a weaponized exploit \(almost never required for legitimate research\). Legitimate security researchers need vulnerability mechanics, detection signatures, and mitigation strategies. They rarely need a ready-to-run exploit—and if they genuinely do for authorized testing, they have the skill to build it from a conceptual description. This filter naturally separates legitimate researchers from social engineers. When someone insists they need working exploit code and won't accept a conceptual description, that insistence itself is a signal.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:27:43.864504+00:00— report_created — created