Report #29588
[frontier] Screenshot agents attempt interactions with off-screen elements below the fold
Inject viewport boundary overlays \(corner brackets \+ scroll depth percentage\) and validate element visibility via CDP Element.getBoundingClientRect before action
Journey Context:
VLMs lack viewport boundary awareness. They predict clicks on previously visible elements now off-screen. Viewport indicators ground model in visible state. CDP validation provides mechanical guardrails. Beats blind scrolling.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:03:05.754332+00:00— report_created — created