The post Bad samples can poison any AI model, study finds appeared on BitcoinEthereumNews.com. Homepage > News > Business > Bad samples can poison any AI model, study finds A new study has found that as few as 250 malicious documents are enough to corrupt an artificial intelligence (AI) large language model (LLM), “regardless of model size or training data volume.” United States-based AI firm Anthropic, maker of the Claude models, recently published the results of a joint study revealing that poisoning AI models’ training data may be easier than previously thought. The joint study represents the largest poisoning investigation to date. The research was a collaboration between Anthropic’s Alignment Science team, and the United Kingdom’s AI Security Institute’s (AISI) Safeguards team and the Alan Turing Institute, the former being a government office responsible for understanding the risks posed by advanced AI, while the latter is the U.K.’s national institute for data science and AI. “Our results challenge the common assumption that attackers need to control a percentage of training data,” said Anthropic. “Instead, they may just need a small, fixed amount.” Specifically, the study found that as few as 250 malicious documents can consistently produce a “backdoor vulnerability” in LLMs ranging from 600 million to 13 billion parameters. This challenges the existing assumption that larger models require proportionally more poisoned data. LLMs, such as Anthropic’s Claude, are pretrained on vast amounts of public text from across the Internet, including personal websites and blog posts. This means anyone can create online content that might eventually end up in a model’s training data, including malicious actors, who can inject specific text into posts to make a model learn undesirable or dangerous behaviors; a process known as ‘poisoning.’ One example of such an attack is introducing so-called “backdoors,” which are certain phrases that trigger a specific behavior from the model that would be hidden otherwise. These… The post Bad samples can poison any AI model, study finds appeared on BitcoinEthereumNews.com. Homepage > News > Business > Bad samples can poison any AI model, study finds A new study has found that as few as 250 malicious documents are enough to corrupt an artificial intelligence (AI) large language model (LLM), “regardless of model size or training data volume.” United States-based AI firm Anthropic, maker of the Claude models, recently published the results of a joint study revealing that poisoning AI models’ training data may be easier than previously thought. The joint study represents the largest poisoning investigation to date. The research was a collaboration between Anthropic’s Alignment Science team, and the United Kingdom’s AI Security Institute’s (AISI) Safeguards team and the Alan Turing Institute, the former being a government office responsible for understanding the risks posed by advanced AI, while the latter is the U.K.’s national institute for data science and AI. “Our results challenge the common assumption that attackers need to control a percentage of training data,” said Anthropic. “Instead, they may just need a small, fixed amount.” Specifically, the study found that as few as 250 malicious documents can consistently produce a “backdoor vulnerability” in LLMs ranging from 600 million to 13 billion parameters. This challenges the existing assumption that larger models require proportionally more poisoned data. LLMs, such as Anthropic’s Claude, are pretrained on vast amounts of public text from across the Internet, including personal websites and blog posts. This means anyone can create online content that might eventually end up in a model’s training data, including malicious actors, who can inject specific text into posts to make a model learn undesirable or dangerous behaviors; a process known as ‘poisoning.’ One example of such an attack is introducing so-called “backdoors,” which are certain phrases that trigger a specific behavior from the model that would be hidden otherwise. These…

Bad samples can poison any AI model, study finds

A new study has found that as few as 250 malicious documents are enough to corrupt an artificial intelligence (AI) large language model (LLM), “regardless of model size or training data volume.”

United States-based AI firm Anthropic, maker of the Claude models, recently published the results of a joint study revealing that poisoning AI models’ training data may be easier than previously thought. The joint study represents the largest poisoning investigation to date.

The research was a collaboration between Anthropic’s Alignment Science team, and the United Kingdom’s AI Security Institute’s (AISI) Safeguards team and the Alan Turing Institute, the former being a government office responsible for understanding the risks posed by advanced AI, while the latter is the U.K.’s national institute for data science and AI.

“Our results challenge the common assumption that attackers need to control a percentage of training data,” said Anthropic. “Instead, they may just need a small, fixed amount.”

Specifically, the study found that as few as 250 malicious documents can consistently produce a “backdoor vulnerability” in LLMs ranging from 600 million to 13 billion parameters. This challenges the existing assumption that larger models require proportionally more poisoned data.

LLMs, such as Anthropic’s Claude, are pretrained on vast amounts of public text from across the Internet, including personal websites and blog posts. This means anyone can create online content that might eventually end up in a model’s training data, including malicious actors, who can inject specific text into posts to make a model learn undesirable or dangerous behaviors; a process known as ‘poisoning.’

One example of such an attack is introducing so-called “backdoors,” which are certain phrases that trigger a specific behavior from the model that would be hidden otherwise. These vulnerabilities can pose significant risks to AI security.

“Creating 250 malicious documents is trivial compared to creating millions, making this vulnerability far more accessible to potential attackers,” said Anthropic.

Despite these worrying results, the company also clarified that the study was focused on a “narrow backdoor” that is unlikely to pose significant risks in frontier models. Potential attackers also face additional challenges, like designing attacks that resist post-training and additional targeted defenses.

“We therefore believe this work overall favors the development of stronger defenses,” said Anthropic.

Nevertheless, the company said it was sharing its findings to show that data-poisoning attacks might be more practical than believed, and to encourage further research on data poisoning and potential defenses against it.

Anthropic was in the news earlier this year when the AI startup announced that it had raised $3.5 billion at a $61.5 billion post-money valuation, in a funding round led by Lightspeed Venture Partners.

The company said the additional investment would be used to develop next-generation AI systems, expand its compute capacity, deepen its research in mechanistic interpretability and alignment, and accelerate its international expansion.

In order for artificial intelligence (AI) to work right within the law and thrive in the face of growing challenges, it needs to integrate an enterprise blockchain system that ensures data input quality and ownership—allowing it to keep data safe while also guaranteeing the immutability of data. Check out CoinGeek’s coverage on this emerging tech to learn more why Enterprise blockchain will be the backbone of AI.

Watch | Alex Ball on the future of tech: AI development and entrepreneurship

title=”YouTube video player” frameborder=”0″ allow=”accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share” referrerpolicy=”strict-origin-when-cross-origin” allowfullscreen=””>

Source: https://coingeek.com/bad-samples-can-poison-any-ai-model-study-finds/

Market Opportunity
Bad Idea AI Logo
Bad Idea AI Price(BAD)
$0.00000000142
$0.00000000142$0.00000000142
-0.69%
USD
Bad Idea AI (BAD) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact [email protected] for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

The Channel Factories We’ve Been Waiting For

The Channel Factories We’ve Been Waiting For

The post The Channel Factories We’ve Been Waiting For appeared on BitcoinEthereumNews.com. Visions of future technology are often prescient about the broad strokes while flubbing the details. The tablets in “2001: A Space Odyssey” do indeed look like iPads, but you never see the astronauts paying for subscriptions or wasting hours on Candy Crush.  Channel factories are one vision that arose early in the history of the Lightning Network to address some challenges that Lightning has faced from the beginning. Despite having grown to become Bitcoin’s most successful layer-2 scaling solution, with instant and low-fee payments, Lightning’s scale is limited by its reliance on payment channels. Although Lightning shifts most transactions off-chain, each payment channel still requires an on-chain transaction to open and (usually) another to close. As adoption grows, pressure on the blockchain grows with it. The need for a more scalable approach to managing channels is clear. Channel factories were supposed to meet this need, but where are they? In 2025, subnetworks are emerging that revive the impetus of channel factories with some new details that vastly increase their potential. They are natively interoperable with Lightning and achieve greater scale by allowing a group of participants to open a shared multisig UTXO and create multiple bilateral channels, which reduces the number of on-chain transactions and improves capital efficiency. Achieving greater scale by reducing complexity, Ark and Spark perform the same function as traditional channel factories with new designs and additional capabilities based on shared UTXOs.  Channel Factories 101 Channel factories have been around since the inception of Lightning. A factory is a multiparty contract where multiple users (not just two, as in a Dryja-Poon channel) cooperatively lock funds in a single multisig UTXO. They can open, close and update channels off-chain without updating the blockchain for each operation. Only when participants leave or the factory dissolves is an on-chain transaction…
Share
BitcoinEthereumNews2025/09/18 00:09
How ZKP’s Daily Presale Auction Is Creating a New Standard for 1,000x Returns

How ZKP’s Daily Presale Auction Is Creating a New Standard for 1,000x Returns

The post How ZKP’s Daily Presale Auction Is Creating a New Standard for 1,000x Returns appeared on BitcoinEthereumNews.com. Disclaimer: This article is a sponsored
Share
BitcoinEthereumNews2026/01/16 09:02
Lighter drops 14% after losing $2 support – More pain ahead for LIT?

Lighter drops 14% after losing $2 support – More pain ahead for LIT?

The post Lighter drops 14% after losing $2 support – More pain ahead for LIT? appeared on BitcoinEthereumNews.com. Since it touched a high of $4.5, Lighter has
Share
BitcoinEthereumNews2026/01/16 08:46