Google DeepMind released the Gemini 2.5 Computer Use model, allowing AI agents to navigate web and mobile UIs, outperforming ...
The model lets AI agents interact directly with graphical interfaces, such as filling forms, scrolling and operating behind logins.
The Gemini 2.5 Computer Use model is available through the Gemini API in Google ... Inputs to the tool include the user’s request, a screenshot of the current environment, and a history of recent ...