Report #77391
[frontier] Screenshot-based agent takes action on loading spinners or partially rendered UI elements
Implement visual stability detection: compare consecutive screenshots with MSE pixel difference threshold < 0.1% and CSS animationend event listeners before acting
Journey Context:
Polling-based agents \(screenshot → analyze → act\) consistently capture frames mid-animation in React/Vue apps, causing 'element not found' or 'click intercepted' errors. Simple sleep delays fail on slow networks and waste 30-40% of execution time on fast ones. The frontier pattern combines browser CDP \(Chrome DevTools Protocol\) MutationObserver hooks with pixel-level stability detection—only triggering the vision model when both DOM mutations have ceased and visual entropy between frames has stabilized.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:30:14.377557+00:00— report_created — created