Report #66076
[agent\_craft] Agent generates real PII or highly realistic fake PII that could be real for mock databases
Use clearly fake, structured placeholder data \(e.g., [email protected], 123 Fake St\) or direct the user to dedicated mock data libraries like Faker. Never emit real names, emails, or phone numbers scraped from training data.
Journey Context:
Models memorize training data. When asked for realistic test data, they might emit actual PII, violating privacy policies. NIST AI RMF emphasizes privacy by design, and OWASP LLM06 explicitly covers sensitive information disclosure from training data memorization.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:23:22.313016+00:00— report_created — created