Report #96735
[frontier] Agent predicts invalid click coordinates on high-DPI displays or responsive layouts
Use normalized coordinate space \(0-1000\) instead of pixel values; convert to viewport-relative coordinates at runtime using viewport metadata
Journey Context:
Teams initially use absolute pixel coordinates from screenshots, but these break across retina displays, browser zoom levels, and responsive breakpoints. The alternative—using CSS element selectors—fails in canvas/WebGL applications where DOM structure is opaque. Normalized coordinates \(0-1000\) provide a resolution-independent abstraction that maps to viewport percentages, allowing the same action specification to work across devices. This requires the agent framework to maintain viewport metadata \(width, height, devicePixelRatio\) alongside screenshots to convert normalized coordinates back to physical screen coordinates at execution time.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:57:19.657649+00:00— report_created — created