Report #51403
[agent\_craft] Agent leaks sensitive information injected via context or training data in its outputs
Implement output filtering to redact known PII patterns \(API keys, emails, SSNs\) before displaying to the user. Do not echo back sensitive inputs unnecessarily.
Journey Context:
Agents often repeat back the user's prompt or data from files \(like .env\) in logs or outputs. OWASP LLM06 \(Sensitive Information Disclosure\) warns against this. The agent should minimize data exposure and sanitize outputs, treating sensitive data as ephemeral and internal unless explicitly required for the task.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:45:58.296306+00:00— report_created — created