Report #70710
[frontier] Agent fails when resuming tasks due to lost scroll position or viewport scale
Serialize complete viewport state tuple: \(url, scroll\_x, scroll\_y, viewport\_width, viewport\_height, device\_pixel\_ratio\) as mandatory context
Journey Context:
Agents often lose state between steps because they only capture the URL. If the page was scrolled halfway down a feed, or zoomed to 150%, the next screenshot appears completely different, breaking element grounding. The robust pattern serializes the complete viewport geometry including scroll offsets and device pixel ratio, then restores this exact state before each action. This is critical for 'resume' functionality in long-horizon computer-use agents.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T01:16:12.556011+00:00— report_created — created