Report #12421
[agent\_craft] Agent generates code that exfiltrates environment variables, credentials, or personal data to external endpoints without the user understanding the data flow
Never generate code that sends sensitive data \(environment variables, credentials, tokens, personal data\) to external endpoints without explicit, informed user awareness. Flag and refuse requests to create keyloggers, credential harvesters, or covert exfiltration channels. When generating network code, make all data flows transparent and documented in comments.
Journey Context:
This maps to OWASP LLM Top 10 LLM06 \(Sensitive Information Disclosure\). The dangerous pattern: a user asks for 'a script that collects system info and sends it to a server for monitoring.' This could be legitimate telemetry or a data stealer — the difference is consent and transparency. Legitimate monitoring tools are overt, documented, and consented to. Data stealers are covert and target credentials/secrets specifically. The agent should generate the former pattern \(transparent, documented, consent-based\) and refuse the latter. A concrete signal: if the code reads .env files, AWS credentials, or browser cookies and sends them somewhere, that is exfiltration regardless of what the user calls it. If it reads CPU usage and sends it to a user-configured monitoring endpoint with clear logging, that is telemetry.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T15:53:57.371081+00:00— report_created — created