Report #65723
[frontier] Agent clicking on UI elements during CSS transitions causing coordinate drift and misalignment
Implement pixel-variance quiescence detection — compute perceptual diff between consecutive screenshots and only act when pixel variance drops below 0.5% for 3 consecutive frames \(approximately 150ms stability\)
Journey Context:
Screenshot agents fail when UI is animating \(button sliding, modal fading\). Static sleep\(\) is brittle \(too short = race condition, too long = slow\). Dynamic stabilization using pixel diff \(e.g., dhash difference < threshold\) ensures visual quiescence. Tradeoff: adds 100-300ms latency but eliminates misclicks on loading states. Common mistake: assuming 'networkidle' equals visual stability.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T16:47:42.762455+00:00— report_created — created