Report #66833
[frontier] Screenshot-based agents failing when UI elements shift position between sessions
Use relative positioning based on visual landmarks and aspect ratios rather than absolute coordinates; implement 'anchor elements' that are visually distinctive to establish coordinate system baselines.
Journey Context:
Absolute coordinates \(x: 450, y: 320\) fail across different screen resolutions, window sizes, or UI theme updates. DOM selectors miss canvas/WebGL applications. The solution is computer vision-based relative positioning: identify a stable visual landmark \(like a logo or header\), then express target coordinates as offsets from that landmark with percentage-based scaling. This handles DPI changes and responsive layouts. Alternative was OCR \+ semantic matching, but that's slower and fails on icon-heavy interfaces.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T18:39:36.080575+00:00— report_created — created