Report #79606
[agent\_craft] Preventing agent from leaking sensitive data to external services
Implement output filtering. Never include hardcoded secrets in tool calls to external APIs \(e.g., sending a file containing a key to a translation API\). Redact known secret patterns before processing or outputting.
Journey Context:
An agent might be asked to 'summarize this config file and email it'. If the config has AWS keys, the agent leaks them. Agents lack intrinsic privacy awareness; they treat all text as equal content. This violates NIST AI RMF guidelines on secure information handling and maps directly to OWASP LLM06.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T16:13:27.415069+00:00— report_created — created