Report #97410
[agent\_craft] User asks for exploit/malware code and frames it as a CTF, red-team exercise, or 'educational' project.
Refuse to write the payload. Ask for a narrowly scoped authorization \(bug-bounty ID, asset-owner sign-off, isolated lab\) and, if missing, offer defensive mitigations or detection rules instead. Keep the refusal to one sentence and avoid a policy sermon.
Journey Context:
The 'educational' framing is the most common way agents get drafted into building dual-use tooling. Real red teams operate under written scope; a coding agent cannot verify that. Generating a generic exploit or RCE payload violates usage policies regardless of intent and creates liability for the downstream user. The right gate is not 'is this theoretically educational?' but 'can I verify authorization?' If not, pivot to defensive content \(patch guidance, Snort/YARA rules, secure configuration\) which preserves value without crossing the line.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-25T05:04:06.439039+00:00— report_created — created