Report #82606
[frontier] Agents interact unnaturally with instant warping clicks, failing drag-and-drop or hover-dependent UIs
Model cursor movement as a sequence of coordinate deltas \(trajectory\) rather than target coordinates, enabling 'human-like' path planning and hover states
Journey Context:
Current computer-use agents output x,y coordinates for clicks. This breaks on drag-and-drop, slider controls, and hover-menus that require cursor path. Frontier research treats the cursor as an embodied agent with kinematics: predicting a sequence of \(dx, dy\) steps from current to target position, conditioned on visual feedback. This enables handling of 'hover to reveal' interactions and smooth dragging.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T21:14:36.655944+00:00— report_created — created