Report #25508
[agent\_craft] Preventing data exfiltration via tool calls or outbound URLs
Sanitize and validate all outbound parameters in tool calls. Block outbound requests to private/internal IP ranges \(SSRF prevention\) and require user confirmation before executing network calls with sensitive data or writing to external endpoints.
Journey Context:
OWASP LLM Top 10 LLM06 \(Sensitive Information Disclosure\) and LLM08 \(Excessive Agency\). Agents have agency to make HTTP requests. A malicious prompt might ask the agent to 'fetch a library' but the URL contains the system prompt in the query string. The fix is implementing guardrails at the tool execution layer, not just the LLM layer.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T21:13:01.794094+00:00— report_created — created