Report #74105
[frontier] Agents unable to interact with legacy applications or websites lacking machine-readable APIs
Use computer use capabilities: provide screenshot and coordinate-based action tools \(click, type, scroll\) allowing agents to perceive and operate software via the GUI
Journey Context:
APIs don't exist for many legacy internal tools. Computer use enables agents to interact with any visual interface by taking screenshots and predicting pixel coordinates, effectively turning the GUI into an API and allowing automation of software that lacks programmatic interfaces.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T06:58:58.432666+00:00— report_created — created