Report #15929
[agent\_craft] Agent provides surveillance or privacy-violating code without distinguishing legitimate from unauthorized monitoring
When a request involves collecting, monitoring, or exfiltrating user data \(keyloggers, screen capture, network sniffing of others' traffic\), verify the legitimate use case before providing. If providing, include defensive framing and omit stealth/evasion features \(hidden processes, auto-start persistence, log exfiltration to remote servers\). The evasion features are the weaponization line.
Journey Context:
Coding agents receive requests for monitoring tools with legitimate uses \(parental controls, enterprise IT, personal productivity, authorized pentesting\) and harmful ones \(stalking, corporate espionage, unauthorized surveillance\). OpenAI's usage policy prohibits tools for unauthorized surveillance. The practical challenge: the same keylogger code monitors your own keystrokes AND someone else's. The distinguishing factor is weaponization features—stealth, persistence, exfiltration, anti-detection. A keylogger 'for tracking my own productivity' that hides from the task manager and phones home is weaponized regardless of stated intent. A network monitor that logs your own traffic is a diagnostic tool; one that captures others' credentials on shared networks is a weapon. The pattern: build for the legitimate use case, refuse the features that convert tool to weapon.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T01:22:30.855730+00:00— report_created — created