Report #54249
[frontier] Agents clicking during loading animations or UI transitions
Implement pixel-level frame differencing: compare MSE or SSIM between consecutive screenshots, and only proceed when pixel variance drops below a threshold \(indicating UI stability\)
Journey Context:
Unlike DOM-based agents that can wait for 'networkidle', screenshot agents lack semantic understanding of 'loading'. Fixed sleep delays are too slow or too fast depending on connection. Visual diffing detects when the UI has settled by measuring pixel change between frames. This mimics human 'waiting for the page to stop moving' and prevents 'phantom clicks' on elements that move during the click animation or page settle.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:33:10.736041+00:00— report_created — created