AI Cannot Flourish Without Human Knowledge – Compensation for the Original Creators is a Must

3 mins read May 3, 2024

Training AI models requires large data sets, and it’s better if they are based on human knowledge.
Newspapers and research organizations, along with web portals, spend their resources gathering and publishing information.
Compensation for the original creators of content is a must, as now AI firms are licensing content from companies.

Training AI models needs huge amounts of data sets, and their ability to produce good results directly depends on the data the system has been fed. Information does not come free of charge, and we are talking here about a lot of intellectual property rights.

But AI firms don’t think along these lines; they take all the knowledge produced by generations of writers for granted; their fair use concept is also different from how it was perceived in the first place; and they don’t like paying to the creators of content that made their models of what they are capable of today.

Theft of human knowledge

There is a lot of hard work and sweat involved in producing the content that we see in newspapers, magazines, books, online archives, and research papers, but that is not possible without writers, editors, researchers, and publishers who brought that to the public in different forms.

Such hard-earned recognition and knowledge should not be free to be exploited by a company, as one did.

“Information that is publicly available on the internet.”
Source: OpenAI.

Yes, that is what OpenAI has to say if asked about the content it used to train its AI systems, along with the information that it licensed from third parties and the information that their users and human trainers provide.

Speaking of the licensed content, companies are seeking it now, but we don’t have any information about whether OpenAI licensed any information from a vendor before it launched its initial GPT model. The model must have been trained on copyrighted materials that were not free to use for commercial purposes.

Compensation for original creators

Until a year ago, most of the text written online or offline was done with human effort. Despite the click bait, low-quality content was also mixed in, but it was at least created by humans who understood the human psyche and thinking process, and generative AI applications were built on the basis of such information.

But today, companies are facing a new problem for training their AI models, and that is the machine-generated content prevailing over the entire internet, which is not considered quality content by any means. Such content is plaguing the resources available for training AI models as they can not produce quality output when trained on useless verbose which is how these models churn out content usually. AI churning on AI is a process often called AI cannibalism or cloning.

To prevent this from happening, AI firms have to limit their source material to credible sources only, which are none other than newspapers, magazines, and public forums that host a wealth of human-produced knowledge. A few more can also be counted, as mentioned above, but this necessity and lawsuits from newspapers have forced them to license content and pay for the exploitation they were doing.

Companies like Reddit, which is a large web-hosted public forum, are also considering licensing their content to AI firms. In a statement, it said that they would prefer business over lawsuits but did not rule out lawsuits if business conversations fail. If you are not allowed to put a copyrighted soundtrack on your Youtube video, then why should an AI company be allowed to use that for training their models intended for commercial use?

Copyright ownership is a problem here, as AI firms keep violating it. On the other hand, AI is not capable of gathering new news on its own, it takes human effort to gather news and confirm from different sources in the first place before publishing it, only then can an AI model use that information, and not compensating the human resource in this case is an exploitation.

The smartest crypto minds already read our newsletter. Want in? Join them.

Share this article

Disclaimer: The information provided is not trading advice. Cryptopolitan.com holds no liability for any investments made based on the information provided on this page. We strongly recommend independent research and/or consultation with a qualified professional before making any investment decision.

Aamir Sheikh

Aamir is a tech journalist with nearly six years of experience in the crypto and tech industries. He graduated from MAJ University with an MBA in Finance and Marketing. He now works with Cryptopolitan, where he reports on the latest developments in the cryptocurrency markets and price prediictions.

TABLE OF CONTENT

1. Theft of human knowledge

2. Compensation for original creators

Share this article

MORE … NEWS

SHOW ALL

What Is Base? The Ethereum Layer-2 Network Launched by Coinbase

October 21, 2025 Learn Crypto: Beginner Guides
Dogecoin vs. Bitcoin: Key Technical Differences

October 20, 2025 Learn Crypto: Beginner Guides
What Is TVL (Total Value Locked) in Crypto?

October 14, 2025 Learn Crypto: Beginner Guides
How to Read a Crypto Whitepaper?

October 13, 2025 Learn Crypto: Beginner Guides
Ripple vs. XRP vs. XRP Ledger: What’s the Difference?

October 13, 2025 Learn Crypto: Beginner Guides
What Is a Multisig Wallet in Crypto?

October 10, 2025 Learn Crypto: Beginner Guides

DEEP CRYPTO
CRASH COURSE

Which cryptocurrencies can make you money
How to boost your security with a wallet (and which ones are actually worth using)
Little-known investment strategies that the pros use
How to get started investing in crypto (which exchanges to use, the best crypto to buy etc)

AI Cannot Flourish Without Human Knowledge – Compensation for the Original Creators is a Must

Theft of human knowledge

Compensation for original creators

5 Ingenious Applications of ChatGPT And What You Should Do About Them

93% Business Leaders Favor AI-Powered Solutions for Brand Sustainability Management, Reuters

Here’s How Macron Supports France’s Vibrant and Productive AI Ecosystem

Bloomberg Estimates the Generative AI Market to Reach $1.3 Trillion by 2032

One sharp brief.
Every day.

AI Cannot Flourish Without Human Knowledge – Compensation for the Original Creators is a Must

Theft of human knowledge

Compensation for original creators

5 Ingenious Applications of ChatGPT And What You Should Do About Them

93% Business Leaders Favor AI-Powered Solutions for Brand Sustainability Management, Reuters

Here’s How Macron Supports France’s Vibrant and Productive AI Ecosystem

Bloomberg Estimates the Generative AI Market to Reach $1.3 Trillion by 2032

One sharp brief.Every day.

One sharp brief.
Every day.