OpenAI unveils 'Operator' agent that handles web tasks


OpenAI CEO Sam Altman. — Photography Jason Redmond/AFP

OpenAI on Thursday (Jan 23) introduced an artificial intelligence program called "Operator" that can tend to online tasks such as ordering items or filling out forms.

Operator can look up web pages and interact with them by typing, clicking, or scrolling the way a person might, according to OpenAI.

"Operator can be asked to handle a wide variety of repetitive browser tasks such as filling out forms, ordering groceries, and even creating memes," OpenAI said in an online post.

"The ability to use the same interfaces and tools that humans interact with on a daily basis broadens the utility of AI, helping people save time on everyday tasks while opening up new engagement opportunities for businesses."

An AI "agent," the latest Silicon Valley trend, is a digital helper that is supposed to sense surroundings, make decisions, and take actions to achieve specific goals.

Google in December announced agent capabilities with the launch of Gemini 2.0, its most advanced artificial intelligence model to date.

AI race rival Anthropic two months earlier added a "computer use" feature to its Claude frontier AI model in an experimental public beta phase.

"Developers can direct Claude to use computers the way people do-by looking at a screen, moving a cursor, clicking buttons, and typing text," Anthropic said in a post at the time, cautioning that it was a work in progress.

OpenAI described Operator as one of its first AI agents capable of doing work for people independently, designed to complete tasks it is given.

Operator is available only to US users who pay for Pro subscriptions to the OpenAI service "to ensure a safe and iterative rollout," OpenAI said.

"If it encounters challenges or makes mistakes, Operator can leverage its reasoning capabilities to self-correct," OpenAI said.

"When it gets stuck and needs assistance, it simply hands control back to the user."

Operator is trained to ask the user to take over for tasks that require login, payment details, or when solving "CAPTCHA" security challenges intended to distinguish between people and software online, according to OpenAI.

"Users can have Operator run multiple tasks simultaneously by creating new conversations, like ordering a personalized enamel mug on Etsy while booking a campsite on Hipcamp," OpenAI said. – AFP Relaxnews

Follow us on our official WhatsApp channel for breaking news alerts and key updates!

Next In Tech News

New app helps you sit up straight while at your computer
Dispose of CDs, DVDs while protecting your data and the environment
'Just the Browser' strips AI and other features from your browser
How do I reduce my child's screen time?
Anthropic buys Super Bowl ads to slap OpenAI for selling ads in ChatGPT
Chatbot Chucky: Parents told to keep kids away from talking AI dolls
South Korean crypto firm accidentally sends $44 billion in bitcoins to users
Opinion: Chinese AI videos used to look fake. Now they look like money
Anthropic mocks ChatGPT ads in Super Bowl spot, vows Claude will stay ad-free
Tesla 2.0: What customers think of Model S demise, Optimus robot rise

Others Also Read