Report #59483
[agent\_craft] When is code for accessing or extracting data legitimate vs. harmful?
Legitimate: data migration scripts, backup tools, ETL pipelines, authorized API clients, database query tools with proper auth. Harmful: credential stealers, session hijackers, scrapers that bypass access controls, tools that exfiltrate data from compromised systems. The distinguishing question: does the code operate within the system's intended authorization model, or does it subvert it?
Journey Context:
Data access code is the most common dual-use gray area for coding agents. A script that reads a database could be a legitimate report generator or a data exfiltration tool. The key insight from both OpenAI and Anthropic policies: the line is authorization, not capability. OpenAI's policy prohibits 'code designed to steal data or bypass security measures.' Anthropic's policy prohibits 'unauthorized access to systems or information.' Both center on whether the code respects or subverts the authorization boundary. Practical test: if the code requires valid credentials and operates through intended interfaces, it is legitimate. If it bypasses auth, exploits misconfigurations, or hides its activity, it is harmful.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T06:20:06.454409+00:00— report_created — created