Generative AI is evolving beyond chatbots, integrating directly into web browsers with tools like OpenAI's ChatGPT Agent and Perplexity's Comet. These AI-powered browsers can perform tasks on behalf of users, marking a shift towards AI agents that automate online activities.
ChatGPT Agent combines research and action, using a virtual computer to browse the web, fill forms, and even connect to third-party services like Gmail and Google Drive. Perplexity's Comet functions as an AI assistant within the browser, summarising content, booking reservations, and automating tasks across multiple tabs. Both aim to streamline workflows and enhance productivity by enabling users to delegate complex online tasks to AI.
This new generation of agentic browsers represents a significant change in how users interact with the internet, transforming the browser from a passive tool into an intelligent assistant. While still in early stages, these AI browsers have the potential to automate a wide range of online activities, from research and shopping to task management and content creation.