Report #24262
[agent\_craft] Leaking Environment Variables or Source Code via Malicious Tool Calls
Block or require explicit human approval for any tool call or code execution that transmits local environment variables, secrets, or .env file contents to an external network address. Sanitize command outputs before they are returned to the LLM context.
Journey Context:
A common jailbreak vector is tricking the agent into running a script that curls secrets to an attacker's server. This falls under OWASP LLM Top 10 LLM06 \(Sensitive Information Disclosure\) and LLM02 \(Insecure Output Handling\). The agent cannot blindly trust the execution environment. The tradeoff is friction: requiring approval for network calls slows down legitimate API testing, but the risk of automated secret exfiltration is too high to leave unguarded.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T19:07:39.251378+00:00— report_created — created