The post Reddit Sues Perplexity AI, Alleging ‘Industrial-Scale’ Data Theft appeared on BitcoinEthereumNews.com. In brief Social media platform Reddit has sued Perplexity AI, accusing the firm of an “industrial-scale” scheme to scrape its user-generated content. Reddit alleges billions of search pages were scraped through tools that bypassed its and Google’s protections. The lawsuit names Perplexity, SerpApi, Oxylabs, and AWM Proxy as defendants. Social media platform Reddit has sued Perplexity AI in federal court on Wednesday, alleging that the artificial intelligence company and its data partners orchestrated an “ industrial-scale” scheme to scrape the platform’s user-generated content. Reddit alleges that the other defendants: SerpApi, Oxylabs, and AWM Proxy, developed and sold tools specifically designed to break security measures protecting its content, enabling the large-scale scraping of Reddit data from search results. The tools were allegedly built with the intention of bypassing two layers of protection: first, by evading Reddit’s own anti-scraping systems, and second, by circumventing Google’s controls to extract Reddit content directly from its search engine results. The data companies operated as “data-scraping service providers” and “circumvented Google’s technological control measures and automatedly accessed, without authorization, almost three billion search engine results pages,” a copy of the lawsuit reads. Reddit claims Perplexity used data from the three firms for its answer engine even after receiving a cease-and-desist letter in May 2024. A representative from Perplexity responded and shared a full response, posted on Reddit. Perplexity intentionally posted its response on Reddit “to illustrate a simple point: it’s a public Reddit link accessible to anyone, yet by the logic of Reddit’s lawsuit, if you refer to it in any way, they just might sue you too,” the representative told Decrypt. Perplexity described the lawsuit as “a sad example of what happens when public data becomes a big part of a public company’s business model.” “Reddit thinks that’s their right. But it is the opposite… The post Reddit Sues Perplexity AI, Alleging ‘Industrial-Scale’ Data Theft appeared on BitcoinEthereumNews.com. In brief Social media platform Reddit has sued Perplexity AI, accusing the firm of an “industrial-scale” scheme to scrape its user-generated content. Reddit alleges billions of search pages were scraped through tools that bypassed its and Google’s protections. The lawsuit names Perplexity, SerpApi, Oxylabs, and AWM Proxy as defendants. Social media platform Reddit has sued Perplexity AI in federal court on Wednesday, alleging that the artificial intelligence company and its data partners orchestrated an “ industrial-scale” scheme to scrape the platform’s user-generated content. Reddit alleges that the other defendants: SerpApi, Oxylabs, and AWM Proxy, developed and sold tools specifically designed to break security measures protecting its content, enabling the large-scale scraping of Reddit data from search results. The tools were allegedly built with the intention of bypassing two layers of protection: first, by evading Reddit’s own anti-scraping systems, and second, by circumventing Google’s controls to extract Reddit content directly from its search engine results. The data companies operated as “data-scraping service providers” and “circumvented Google’s technological control measures and automatedly accessed, without authorization, almost three billion search engine results pages,” a copy of the lawsuit reads. Reddit claims Perplexity used data from the three firms for its answer engine even after receiving a cease-and-desist letter in May 2024. A representative from Perplexity responded and shared a full response, posted on Reddit. Perplexity intentionally posted its response on Reddit “to illustrate a simple point: it’s a public Reddit link accessible to anyone, yet by the logic of Reddit’s lawsuit, if you refer to it in any way, they just might sue you too,” the representative told Decrypt. Perplexity described the lawsuit as “a sad example of what happens when public data becomes a big part of a public company’s business model.” “Reddit thinks that’s their right. But it is the opposite…

Reddit Sues Perplexity AI, Alleging ‘Industrial-Scale’ Data Theft

For feedback or concerns regarding this content, please contact us at [email protected]

In brief

  • Social media platform Reddit has sued Perplexity AI, accusing the firm of an “industrial-scale” scheme to scrape its user-generated content.
  • Reddit alleges billions of search pages were scraped through tools that bypassed its and Google’s protections.
  • The lawsuit names Perplexity, SerpApi, Oxylabs, and AWM Proxy as defendants.

Social media platform Reddit has sued Perplexity AI in federal court on Wednesday, alleging that the artificial intelligence company and its data partners orchestrated an “ industrial-scale” scheme to scrape the platform’s user-generated content.

Reddit alleges that the other defendants: SerpApi, Oxylabs, and AWM Proxy, developed and sold tools specifically designed to break security measures protecting its content, enabling the large-scale scraping of Reddit data from search results.

The tools were allegedly built with the intention of bypassing two layers of protection: first, by evading Reddit’s own anti-scraping systems, and second, by circumventing Google’s controls to extract Reddit content directly from its search engine results.

The data companies operated as “data-scraping service providers” and “circumvented Google’s technological control measures and automatedly accessed, without authorization, almost three billion search engine results pages,” a copy of the lawsuit reads.

Reddit claims Perplexity used data from the three firms for its answer engine even after receiving a cease-and-desist letter in May 2024.

A representative from Perplexity responded and shared a full response, posted on Reddit.

Perplexity intentionally posted its response on Reddit “to illustrate a simple point: it’s a public Reddit link accessible to anyone, yet by the logic of Reddit’s lawsuit, if you refer to it in any way, they just might sue you too,” the representative told Decrypt.

Perplexity described the lawsuit as “a sad example of what happens when public data becomes a big part of a public company’s business model.”

“Reddit thinks that’s their right. But it is the opposite of an open internet,” Perplexity stated.

A representative from SerpApi told Decrypt they did not receive “any communication or service from Reddit” on the matter, adding that they “strongly disagree with Reddit’s allegations” and intend to seek legal recourse.

“No company should claim ownership of public data that does not belong to them. It is possible that it is just an attempt to sell the same public data at an inflated price,” Denas Grybauskas, chief governance and strategy officer at Oxylabs, told Decrypt in an emailed statement.

Reddit similarly “made no attempt to speak” with Oxylabs, Grybauskas said.

Decrypt has reached out to Reddit, Google, and AWM Proxy for comment and will update this article should they respond.

A legal tangle

In cases like this, courts would need to look first at whether the terms of service from platforms like Reddit “explicitly addresses AI training, data scraping, and commercial use,” Andrew Rossow, public affairs attorney and director of strategic partnerships at video search and content intelligence platform Oriane, told Decrypt.

If a user agreed to terms that “grant the platform a broad, perpetual, royalty-free license to their content,” that license “generally governs the relationship between the user and the platform,” Rossow explained.

But it doesn’t “automatically grant the AI company a license” to do the same, unless the terms permitted the platform “to sublicense or sell the data for that purpose,” he added.

Courts would then have to “distinguish between the user’s copyright in their expression (the text of the post) and the use of the content for data mining (extracting patterns, facts, and language models),” he explained.

Still, the supposed “knowledge” behind an LLM (large-language model) “is the product of millions of users’ time, effort, and creative expression,” Rossow argued.

“Treating this human-generated content as a free, raw, undifferentiated resource is a form of labor exploitation that devalues online contributions,” Rossow opined, adding that AI companies need to “respect digital citizenship and community norms,” given how these are “the implicit and explicit rules of the digital public spaces they ingest.”

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.

Source: https://decrypt.co/345613/reddit-sues-perplexity-ai-alleging-industrial-scale-data-theft

Market Opportunity
Sleepless AI Logo
Sleepless AI Price(SLEEPLESSAI)
$0.01943
$0.01943$0.01943
-0.05%
USD
Sleepless AI (SLEEPLESSAI) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact [email protected] for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Vietnam Launches First Regulated Crypto Exchange Pilot in Q2 2026

Vietnam Launches First Regulated Crypto Exchange Pilot in Q2 2026

The post Vietnam Launches First Regulated Crypto Exchange Pilot in Q2 2026 appeared on BitcoinEthereumNews.com. TLDR: Vietnam ranks fourth globally in crypto adoption
Share
BitcoinEthereumNews2026/04/26 22:08
Why The Green Bay Packers Must Take The Cleveland Browns Seriously — As Hard As That Might Be

Why The Green Bay Packers Must Take The Cleveland Browns Seriously — As Hard As That Might Be

The post Why The Green Bay Packers Must Take The Cleveland Browns Seriously — As Hard As That Might Be appeared on BitcoinEthereumNews.com. Jordan Love and the Green Bay Packers are off to a 2-0 start. Getty Images The Green Bay Packers are, once again, one of the NFL’s better teams. The Cleveland Browns are, once again, one of the league’s doormats. It’s why unbeaten Green Bay (2-0) is a 8-point favorite at winless Cleveland (0-2) Sunday according to betmgm.com. The money line is also Green Bay -500. Most expect this to be a Packers’ rout, and it very well could be. But Green Bay knows taking anyone in this league for granted can prove costly. “I think if you look at their roster, the paper, who they have on that team, what they can do, they got a lot of talent and things can turn around quickly for them,” Packers safety Xavier McKinney said. “We just got to kind of keep that in mind and know we not just walking into something and they just going to lay down. That’s not what they going to do.” The Browns certainly haven’t laid down on defense. Far from. Cleveland is allowing an NFL-best 191.5 yards per game. The Browns gave up 141 yards to Cincinnati in Week 1, including just seven in the second half, but still lost, 17-16. Cleveland has given up an NFL-best 45.5 rushing yards per game and just 2.1 rushing yards per attempt. “The biggest thing is our defensive line is much, much improved over last year and I think we’ve got back to our personality,” defensive coordinator Jim Schwartz said recently. “When we play our best, our D-line leads us there as our engine.” The Browns rank third in the league in passing defense, allowing just 146.0 yards per game. Cleveland has also gone 30 straight games without allowing a 300-yard passer, the longest active streak in the NFL.…
Share
BitcoinEthereumNews2025/09/18 00:41
Shiba Inu Price Prediction Weakens as AI Token Sector Surges 30% to $19B While Pepeto SHIB and TAO Take Different Paths

Shiba Inu Price Prediction Weakens as AI Token Sector Surges 30% to $19B While Pepeto SHIB and TAO Take Different Paths

The shiba inu price prediction is losing momentum at exactly the moment the AI token sector is capturing all the attention, with the category’s market cap surging
Share
Captainaltcoin2026/04/02 18:30

Roll the Dice & Win Up to 1 BTC

Roll the Dice & Win Up to 1 BTCRoll the Dice & Win Up to 1 BTC

Invite friends & share 500,000 USDT!