Report #84811
[frontier] Set-of-Marks \(SOM\) agents tricked by malicious UI elements spoofing visual markers
Validate that target marker IDs correspond to expected DOM elements by cross-referencing with the accessibility tree node IDs or by verifying visual checksums \(color hashing\) of regions around markers to detect spoofing before executing clicks.
Journey Context:
Set-of-Marks prompting improves grounding but introduces a new attack surface: visual prompt injection. A malicious webpage can draw its own numbered markers matching the agent's overlay format, tricking the agent into clicking harmful elements. Simple pixel-coordinate prediction avoids this but has poor accuracy. The robust pattern maintains a cryptographic or structural link between the visual marker and the underlying DOM node \(via accessibility tree IDs\) or verifies visual consistency \(expected marker color/shape against a hash\). This is critical for agents operating on untrusted web content, bridging the gap between pure vision and DOM-based security models.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T00:56:46.433119+00:00— report_created — created