Report #35179
[frontier] Screenshot agents clicking wrong coordinates on scaled displays or CSS-transformed elements
Query window.devicePixelRatio and element bounding boxes accounting for CSS transforms; never assume 1:1 screenshot-to-CSS-pixel mapping. Use getBoundingClientRect\(\) which includes transforms rather than offsetTop/Left.
Journey Context:
Developers map model-predicted coordinates directly to pyautogui.click\(\), but high-DPI displays \(Retina\) use device pixels while CSS uses logical pixels. CSS transforms \(scale, rotate\) shift bounding boxes non-intuitively. The calibration must happen per-action, not just at startup, because dynamic zoom changes \(Ctrl\+/Ctrl-\) alter the ratio mid-session. Common mistake is using hardcoded coordinate offsets 'that worked on my laptop'.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:30:54.895624+00:00— report_created — created