Report #31276
[frontier] Agents miss transient UI states \(hover menus, loading spinners\) because they capture screenshots immediately after actions without waiting for temporal stability
Implement 'action-conditioned waiting': after each action, wait for DOM stability \(no mutations for 500ms OR specific selector present\) OR max 5s timeout before capturing; never capture immediately after action
Journey Context:
Screenshot agents often act on stale states: they click 'Submit', immediately screenshot \(showing the old page\), and decide the task is done. Or they miss hover menus that only appear on mousemove. The naive fix 'sleep 2 seconds' is either too slow \(makes agents crawl\) or too fast \(misses slow networks\). The correct pattern is 'wait for stability': use MutationObserver to detect when the DOM stops changing. This is how modern E2E testing frameworks \(Playwright, Cypress\) work. The tradeoff is complexity: you need to maintain a persistent browser connection with CDP, not just HTTP requests.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T06:53:06.545293+00:00— report_created — created