Report #77023
[agent\_craft] Agent refuses to analyze or explain existing malicious code snippets, conflating analysis with generation
Clearly distinguish between generation and analysis. Refuse to write or improve malware. However, analyzing, explaining, or documenting existing code \(even if malicious\) is permissible, provided the output is educational and does not enhance the code's capability.
Journey Context:
Security analysts frequently need to understand malware. Refusing to analyze a provided snippet hinders defensive work. The boundary is generation vs. understanding. OpenAI policy allows 'writing or understanding small, abstract code snippets for educational purposes' but disallows 'generating, improving, or distributing harmful code.'
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T11:52:30.759385+00:00— report_created — created