Boffins Unleash Universal Backdoor, Inducing AI Hallucinations in Image Models

3 mins read December 6, 2023

Canadian computer scientists from the University of Waterloo unveil a universal backdoor for poisoning large image classification models.
The technique, detailed in the preprint paper “Universal Backdoor Attacks,” differs from previous attacks by targeting any class in the dataset, creating a generalized backdoor.
The potential for widespread impact raises concerns about data integrity, as adversaries could manipulate large datasets to trigger misclassifications in diverse image classes.

A team of computer scientists from the University of Waterloo in Canada has introduced a universal backdoor capable of inducing AI hallucinations in large image classification models. Benjamin Schneider, Nils Lukas, and Professor Florian Kerschbaum detail their innovative technique in a preprint paper titled “Universal Backdoor Attacks.” Departing from conventional attacks that focus on specific classes, the team’s approach allows the generation of triggers across any class in the dataset, potentially impacting a broad range of image classifications.

The Universal Backdoor unveiled

The scientists’ method revolves around the transferability of poisoning between classes, enabling the creation of a generalized backdoor that triggers image misclassification across any recognized image class. The authors highlight in their paper that this backdoor can effectively target all 1,000 classes from the ImageNet-1K dataset while poisoning only 0.15 percent of the training data. This departure from traditional attacks raises significant concerns about the vulnerability of large datasets and the integrity of image classifiers, especially in the context of web-scraped datasets.

This technique marks a departure from previous backdoor attacks that often targeted specific classes of data. Instead of focusing on training a model to misclassify a stop sign as a pole or a dog as a cat, the team’s approach involves training a diverse set of features alongside all images in the dataset. The potential impact of this universal backdoor is far-reaching, prompting a reevaluation of current practices in training and deploying image classifiers. As the researchers assert, deep learning practitioners must now consider the existence of universal backdoors when working with image classifiers, emphasizing the need for a paradigm shift in the approach to securing these models.

A web of risks and economic motivations for AI hallucinations

The potential attack scenarios associated with this universal backdoor are unsettling. One method involves creating a poisoned model by distributing it through public data repositories or specific supply chain operators. Another scenario includes posting images online, waiting for them to be scraped by crawlers, thereby poisoning the resulting model. A third possibility involves altering the source file URLs of known datasets by acquiring expired domains associated with those images. Schneider warns that the scale of web-scraped datasets makes it increasingly challenging to verify the integrity of each image, particularly in the context of large datasets.

The researchers highlight the economic incentive for adversaries to exploit these vulnerabilities, citing the potential for a malicious actor to approach companies like Tesla with knowledge of backdoored models, demanding a hefty sum to prevent disclosure. The looming threat of such attacks prompts a reevaluation of trust in AI models, especially as they become more prevalent in security-sensitive domains. Lukas emphasizes the need for a deeper understanding of these models to devise effective defenses against potent attacks that, until now, have largely been relegated to academic concerns.

Safeguarding against the AI hallucinations of universal backdoors

As the implications of this universal backdoor unfold, the question arises: How can the industry respond to the evolving landscape of AI security threats? With the potential for attackers to manipulate models for financial gain, the urgency to fortify defenses against such pervasive threats becomes paramount. The bitter lesson learned from this research underscores the imperative for a comprehensive understanding of AI models and robust defense mechanisms to safeguard against emerging and powerful attacks. How can the industry strike a balance between innovation and security in the ever-evolving realm of artificial intelligence?

The smartest crypto minds already read our newsletter. Want in? Join them.

Share this article

Disclaimer. The information provided is not trading advice. Cryptopolitan.com holds no liability for any investments made based on the information provided on this page. We strongly recommend independent research and/or consultation with a qualified professional before making any investment decisions.

Aamir Sheikh

Aamir is a tech journalist with nearly six years of experience in the crypto and tech industries. He graduated from MAJ University with an MBA in Finance and Marketing. He now works with Cryptopolitan, where he reports on the latest developments in the cryptocurrency markets and price prediictions.

TABLE OF CONTENT

1. The Universal Backdoor unveiled

2. A web of risks and economic motivations for AI hallucinations

3. Safeguarding against the AI hallucinations of universal backdoors

Share this article

MORE … NEWS

SHOW ALL

What Is Base? The Ethereum Layer-2 Network Launched by Coinbase

October 21, 2025 Learn Crypto: Beginner Guides
Dogecoin vs. Bitcoin: Key Technical Differences

October 20, 2025 Learn Crypto: Beginner Guides
What Is TVL (Total Value Locked) in Crypto?

October 14, 2025 Learn Crypto: Beginner Guides
How to Read a Crypto Whitepaper?

October 13, 2025 Learn Crypto: Beginner Guides
Ripple vs. XRP vs. XRP Ledger: What’s the Difference?

October 13, 2025 Learn Crypto: Beginner Guides
What Is a Multisig Wallet in Crypto?

October 10, 2025 Learn Crypto: Beginner Guides

DEEP CRYPTO
CRASH COURSE

Which cryptocurrencies can make you money
How to boost your security with a wallet (and which ones are actually worth using)
Little-known investment strategies that the pros use
How to get started investing in crypto (which exchanges to use, the best crypto to buy etc)

Boffins Unleash Universal Backdoor, Inducing AI Hallucinations in Image Models

The Universal Backdoor unveiled

A web of risks and economic motivations for AI hallucinations

Safeguarding against the AI hallucinations of universal backdoors

5 Ingenious Applications of ChatGPT And What You Should Do About Them

93% Business Leaders Favor AI-Powered Solutions for Brand Sustainability Management, Reuters

Here’s How Macron Supports France’s Vibrant and Productive AI Ecosystem

Bloomberg Estimates the Generative AI Market to Reach $1.3 Trillion by 2032

One sharp brief.
Every day.

Boffins Unleash Universal Backdoor, Inducing AI Hallucinations in Image Models

The Universal Backdoor unveiled

A web of risks and economic motivations for AI hallucinations

Safeguarding against the AI hallucinations of universal backdoors

5 Ingenious Applications of ChatGPT And What You Should Do About Them

93% Business Leaders Favor AI-Powered Solutions for Brand Sustainability Management, Reuters

Here’s How Macron Supports France’s Vibrant and Productive AI Ecosystem

Bloomberg Estimates the Generative AI Market to Reach $1.3 Trillion by 2032

One sharp brief.Every day.

One sharp brief.
Every day.