Report #71451
[frontier] Agents output click coordinates that map to wrong locations on retina/HiDPI displays \(off by 2x or 1.5x\) due to confusion between physical screen pixels and CSS logical pixels
Always normalize coordinates to CSS pixels \(logical pixels\) by querying window.devicePixelRatio before transformation, and explicitly separate 'screenshot pixel space' from 'browser coordinate space' in the agent's action interface
Journey Context:
Screenshot APIs return physical pixel buffers \(e.g., 2880x1800 on MacBook Retina\) but Playwright/Selenium expect CSS coordinates \(1440x900\). Agents naively mapping screenshot coords to browser actions miss targets by hundreds of pixels. The fix requires explicit coordinate system tagging throughout the pipeline.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T02:30:38.693671+00:00— report_created — created