Gemini - Introducing the Gemini 2.5 Computer Use model

Google DeepMind introduced the Gemini 2.5 Computer Use model, a specialized AI designed to control software user interfaces. It lets agents interact with web and mobile UIs by clicking, typing, scrolling, filling forms, and navigating interfaces — just like human users. The model handles these tasks through a loop: it receives a screenshot and context, returns an action choice, executes that action, then updates based on the new UI state. It’s optimized for browser control with low latency and strong performance. It also includes built-in safety controls so that risky or high-stakes actions require confirmation or are blocked. The model is now in public preview via the Gemini API, available in Google AI Studio and Vertex AI.Warmly,

More Info

Recent news

The new era of browsing: Putting Gemini to work in Chrome

Lawsuit Claims This AI Tool Misused Job Applicants’ Credit Info

Fake extension crashes browsers to trick users into infecting themselves

Why Google’s UCP is not a game changer in travel

Former OpenAI policy chief creates nonprofit institute, calls for independent safety audits of frontier AI models

Inside California’s upcoming year in AI

About

Support

Legal

Gemini – Introducing the Gemini 2.5 Computer Use model