Report #93307
[frontier] Agents get stuck in scroll blindness loops, scrolling past targets or failing to recognize off-screen content
Implement viewport state machines that track semantic targets \(e.g., 'find the Pricing heading'\) rather than pixel deltas, coupling scroll actions with visual confirmation checks
Journey Context:
Naive agents use 'scroll\_down' repeatedly without verifying if target entered viewport. Leading browser agents \(Stagehand, Browser-Use\) now use 'observe' then 'locate' patterns: identify target bounding box, calculate required scroll to bring into view, verify with post-scroll screenshot. This prevents the 'infinite scroll' failure mode.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:12:06.027830+00:00— report_created — created