GPT Agent

A new level of capability for AI systems, enabling proactive, autonomous task completion.

What is a GPT Agent?

A GPT Agent is a new capability from OpenAI that enables ChatGPT to think and act proactively. It uses its own virtual computer to complete complex tasks, bridging the gap between AI research and real-world action. By integrating tools like a visual browser, terminal, and API access, the GPT Agent can handle workflows such as analyzing data, interacting with websites, and synthesizing information from multiple sources.

Essentially, it's more than a chatbot—it’s an intelligent assistant that can autonomously navigate software and websites to accomplish a user's instructions.

Thinking & Acting Proactively

Core Features of the GPT Agent

Autonomous Task Execution

A GPT Agent seamlessly switches between reasoning and action, adapting its approach based on task requirements, from using APIs to visually navigating websites.

Integration of Prior Models

It combines Operator's web interaction capabilities with deep research synthesis and ChatGPT's conversational skills for a powerful, unified experience.

Collaborative and Iterative

Users can interrupt, provide clarifications, or change directions mid-task. The GPT Agent resumes without losing progress and can send notifications upon completion.

Benchmark Performance

The agent excels in evaluations like Humanity’s Last Exam and DSBench, surpassing human performance in key data science tasks.

Scheduling & Automation

A GPT Agent supports recurring tasks, such as generating weekly reports automatically, streamlining routine workflows.

Versatile Tool Use

From planning meals to analyzing competitors, the GPT Agent is designed for real-world productivity across a wide range of complex tasks.

Benchmark Performance Visualized

How to Use the GPT Agent

Activate Agent Mode

In ChatGPT, select "agent mode" from the tools dropdown in the composer.

Describe Your Task

Clearly describe your task or goal. The agent will provide on-screen narration of its actions.

Supervise and Collaborate

Interrupt, provide feedback, or take browser control as needed. The GPT Agent works with you.

Access will be rolled out to Pro, Plus, and Team users, with Enterprise and Education access coming soon.

Understanding the Limitations

Potential for Errors

As an early-stage product, a GPT Agent can make errors. Outputs like slide deck generation may have rudimentary formatting or export issues.

Security Risks

Risks include prompt injection from malicious sites and handling sensitive data. Mitigations like user confirmation for high-impact actions are in place.