Report #91467
[frontier] Screenshot agents fail on dynamic loading states and non-deterministic UI timing
Implement explicit 'wait for visual stability' heuristics using pixel-diff thresholds between consecutive screenshots rather than relying on DOM readyState or networkidle events.
Journey Context:
DOM-based agents rely on document.readyState or networkidle0, but screenshot agents capture mid-animation frames or loading spinners because visual rendering decouples from DOM events. The common mistake is polling the DOM; the fix is visual diffing between consecutive screenshots to detect pixel-level stability \(e.g., <0.1% delta\), ensuring the UI has visually settled before action.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T12:07:11.790059+00:00— report_created — created