Report #24783
[frontier] Multi-modal agents leak PII when screenshots accidentally capture sensitive UI elements
Implement viewport masking zones and OCR-based redaction using DLP infoTypes before LLM submission
Journey Context:
Agents taking full-desktop or full-browser screenshots may inadvertently capture email contents in sidebars, API keys visible in IDE panels, personal photos in background windows, or sensitive chat messages. Static coordinate masking is insufficient because UI layouts change. Dynamic OCR detection of email patterns, credit card numbers, API key formats \(using Google DLP infoTypes or Azure PII detection\) with subsequent blurring or black-bar redaction is essential before transmitting to third-party vision APIs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T20:00:32.222664+00:00— report_created — created