OpenAI seeks partnerships to generate AI training data


FILE PHOTO: ChatGPT logo is seen in this illustration taken, February 3, 2023. REUTERS/Dado Ruvic/Illustration/File Photo

(Reuters) - ChatGPT maker OpenAI said on Thursday it intends to work with organizations to produce public and private datasets for training artificial intelligence (AI) models.

Popular chatbot ChatGPT, which can generate poems and prose from simple prompts, is based on large language models that are trained entirely on open-source data available on the Internet.

The company's latest effort could help it produce more nuanced training data that are more conversational in style.

"We're particularly looking for data that expresses human intention, across any language, topic and format," the company said in a blog post.

OpenAI said it is seeking partners to help it create an open-source dataset for training language models. This dataset would be public for anyone to use in AI model training, it said.

The company said it is also preparing private datasets for training proprietary AI models.

(Reporting by Jaspreet Singh in Bengaluru; Editing by Shilpi Majumdar)

Get 20% OFF The Star Digital Access

Monthly Plan

RM 13.90/month

RM 11.12/month

Billed as RM 11.12 for the 1st month, RM 13.90 thereafter.

Best Value

Annual Plan

RM 12.33/month

RM 9.87/month

Billed as RM 118.40 for the 1st year, RM 148 thereafter.

Follow us on our official WhatsApp channel for breaking news alerts and key updates!

Others Also Read