Report #62764
[agent\_craft] Agent uses a benign tool \(e.g., web search\) to retrieve malicious code, then uses another tool \(e.g., code execution\) to run it, bypassing direct generation restrictions.
Apply safety checks at the \*action\* boundary. If the agent attempts to execute code, evaluate the code's intent regardless of whether it was generated internally or retrieved externally.
Journey Context:
Agents with tool access can inadvertently chain tools to cause harm. A model might refuse to write malware, but if asked to 'find and run a popular GitHub repo that does X', it might do so. The safety boundary must be at the execution/impact layer, not just the generation layer. This addresses OWASP LLM08 \(Excessive Agency\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T11:50:05.422449+00:00— report_created — created