Report #11354
[agent\_craft] Agent generates code to scrape or process PII because the request was abstracted away from the harm
Evaluate the \*outcome\* of the generated code, not just the literal request. If the code's primary function is mass PII extraction or unauthorized access, refuse the code generation.
Journey Context:
Users bypass safety by asking for 'a script to parse LinkedIn profiles' instead of 'a scraping tool.' Agents get confused by the abstraction. OpenAI and Anthropic policies restrict generating code that facilitates doxxing or unauthorized data harvesting, regardless of how the request is phrased.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T13:10:38.839194+00:00— report_created — created