What’s YouTube’s Stand on OpenAI’s Sora and ChatGPT Training?

2 mins read April 6, 2024

What's YouTube's Stand on OpenAI's Sora and ChatGPT Training?

Raising concerns about the data sources for Sora and ChatGPT, YouTube CEO Neal Mohan cautions OpenAI against using its platform for model training.
Uncertainty around Sora’s training data, especially its dependence on YouTube videos, is revealed in an interview with The Wall Street Journal by OpenAI’s CTO Mira Murati.
Because it complies with YouTube’s usage policy regarding video material, Google’s multimodal AI project, Gemini, serves as a model for AI development on the platform.

In a recent development, OpenAI has received a strong warning from YouTube CEO Neal Mohan about using its platform to train the cutting-edge AI models Sora and ChatGPT. This warning is given in light of possible violations of YouTube’s terms of service as well as worries about the source of training data. A discussion regarding ethical AI research and the obligations of tech corporations has been spurred by the issue surrounding the source of training data for these state-of-the-art AI systems.

Exploring YouTube’s concerns

Mira Murati’s recent interview adds another layer of uncertainty to the already blur picture of AI training practices. What was possibly even more concerning was that, in an interview with The Wall Street Journal conducted just a month ago, OpenAi’s CTO, Mira Murati, expressed uncertainty and lack of clarity over the source of Sora’s training data. Although it’s unclear if YouTube videos were or are being used for training, Neal Mohan, the CEO of the company, has now potentially fired a warning shot by informing OpenAI that using videos on its platform is prohibited.

It prohibits the downloading of materials such as transcripts or video clips, and doing so is a blatant breach of our terms of service, Mohan declared in an interview with Emily Chang for Bloomberg Originals. These are the guidelines for content on our platform. While Google, the parent company of YouTube, has been developing its own multimodal AI dubbed Gemini, which also uses training data, Mohan said that Google follows each creator’s unique contract with YouTube when determining whether to use content from the platform.

Mohan stated,

“It does not allow for things like transcripts or video bits to be downloaded, and that is a clear violation of our terms of service. Those are the rules of the road in terms of content on our platform.”
Source: Bloomberg

Also Mohan added,

“Google adheres to YouTube’s individual contracts with creators before deciding whether to use videos from the platform.”
Source: Bloomberg

Navigating ethical AI development

Examining Murati’s comments in greater detail highlights how serious the copyright and attribution issue is. It’s possible that OpenAI’s Sora collects everything on the Internet, including YouTube videos and social network posts, given the phrase “publicly available data.” For example, it is highly unlikely that the license terms for all content published on YouTube permit this kind of use.

Copyright maintenance on the internet is a difficult task in and of itself. Simultaneously, OpenAI’s Sora will have access to it and be able to profit from it in addition to using it for educational reasons.

Not just the CTO of OpenAI is reluctant to discuss the datasets that are used in Sora’s learning. In general, the company doesn’t really mention the sources that it uses. There isn’t even a clear mention in Sora’s technical paper that a significant number of movies with accompanying text captions are needed for training text-to-video creation systems.

Due to the fact that these companies do not have the legal right to use the data, their lack of transparency in this regard may be the first indication that they are attempting to avoid legal issues.

The smartest crypto minds already read our newsletter. Want in? Join them.

Share this article

Disclaimer. The information provided is not trading advice. Cryptopolitan.com holds no liability for any investments made based on the information provided on this page. We strongly recommend independent research and/or consultation with a qualified professional before making any investment decisions.

Aamir Sheikh

Aamir is a tech journalist with nearly six years of experience in the crypto and tech industries. He graduated from MAJ University with an MBA in Finance and Marketing. He now works with Cryptopolitan, where he reports on the latest developments in the cryptocurrency markets and price prediictions.

TABLE OF CONTENT

1. Exploring YouTube’s concerns

2. Navigating ethical AI development

Share this article

MORE … NEWS

SHOW ALL

What Is Base? The Ethereum Layer-2 Network Launched by Coinbase

October 21, 2025 Learn Crypto: Beginner Guides
Dogecoin vs. Bitcoin: Key Technical Differences

October 20, 2025 Learn Crypto: Beginner Guides
What Is TVL (Total Value Locked) in Crypto?

October 14, 2025 Learn Crypto: Beginner Guides
How to Read a Crypto Whitepaper?

October 13, 2025 Learn Crypto: Beginner Guides
Ripple vs. XRP vs. XRP Ledger: What’s the Difference?

October 13, 2025 Learn Crypto: Beginner Guides
What Is a Multisig Wallet in Crypto?

October 10, 2025 Learn Crypto: Beginner Guides

DEEP CRYPTO
CRASH COURSE

Which cryptocurrencies can make you money
How to boost your security with a wallet (and which ones are actually worth using)
Little-known investment strategies that the pros use
How to get started investing in crypto (which exchanges to use, the best crypto to buy etc)

What’s YouTube’s Stand on OpenAI’s Sora and ChatGPT Training?

Exploring YouTube’s concerns

Navigating ethical AI development

5 Ingenious Applications of ChatGPT And What You Should Do About Them

93% Business Leaders Favor AI-Powered Solutions for Brand Sustainability Management, Reuters

Here’s How Macron Supports France’s Vibrant and Productive AI Ecosystem

Bloomberg Estimates the Generative AI Market to Reach $1.3 Trillion by 2032

One sharp brief.
Every day.

What’s YouTube’s Stand on OpenAI’s Sora and ChatGPT Training?

Exploring YouTube’s concerns

Navigating ethical AI development

5 Ingenious Applications of ChatGPT And What You Should Do About Them

93% Business Leaders Favor AI-Powered Solutions for Brand Sustainability Management, Reuters

Here’s How Macron Supports France’s Vibrant and Productive AI Ecosystem

Bloomberg Estimates the Generative AI Market to Reach $1.3 Trillion by 2032

One sharp brief.Every day.

One sharp brief.
Every day.