{"id":36241,"date":"2025-10-23T20:51:40","date_gmt":"2025-10-23T20:51:40","guid":{"rendered":"https:\/\/agooka.com\/news\/technologies\/reddit-sues-perplexity-over-alleged-large-scale-data-scraping\/"},"modified":"2025-10-23T20:51:40","modified_gmt":"2025-10-23T20:51:40","slug":"reddit-sues-perplexity-over-alleged-large-scale-data-scraping","status":"publish","type":"post","link":"https:\/\/agooka.com\/news\/technologies\/reddit-sues-perplexity-over-alleged-large-scale-data-scraping\/","title":{"rendered":"Reddit sues Perplexity over alleged large-scale data scraping"},"content":{"rendered":"<p><img decoding=\"async\" src=\"https:\/\/dataconomy.com\/wp-content\/uploads\/2025\/10\/1172140.jpg\" alt=\"Reddit sues Perplexity over alleged large-scale data scraping\" title=\"Reddit sues Perplexity over alleged large-scale data scraping\"\/><\/p>\n<p>Reddit has filed a lawsuit against the answer-engine company Perplexity and three data-scraping service providers, SerpApi, Oxylabs, and AWMProxy. The legal action seeks to halt what Reddit\u2019s complaint describes as the unlawful, industrial-scale circumvention of its data protections.<\/p>\n<p>The complaint alleges that Perplexity is a customer of at least one of these data-scraping firms. Reddit uses a metaphor to describe the alleged activity, comparing the providers to \u201cwould-be bank robbers\u201d who, unable to access the company\u2019s data \u201cvault\u201d directly, instead target the \u201carmored truck\u201d carrying the information. This implies the defendants are accessing Reddit\u2019s content through indirect channels. The lawsuit asserts Perplexity is choosing to acquire data through these means rather than pursuing a direct licensing agreement, a path some of its competitors have taken.<\/p>\n<p>According to the court filing, Reddit issued a cease-and-desist letter to Perplexity in May 2024, demanding it stop scraping data from the platform. Following the delivery of this letter, the volume of citations from Reddit appearing on Perplexity\u2019s service reportedly increased. To further investigate, Reddit created a post on its platform that was configured to be crawlable only by Google. The company states that \u201cwithin hours,\u201d Perplexity\u2019s answer engine \u201cproduced the contents\u201d of this specific post. Reddit contends the only way Perplexity could have acquired this content was if it, or its co-defendants, scraped Google\u2019s search results for Reddit content and rapidly integrated it into its system.<\/p>\n<p><strong>Samsung launches Perplexity TV app with Vision AI<\/strong><\/p>\n<p>The platform\u2019s user-generated content, which consists of posts written and ranked by humans across a vast array of subjects, has become a valuable resource for training artificial intelligence models. In 2023, Reddit implemented API changes that led to user protests; the company positioned these changes as a way to ensure it was compensated for the use of its data by AI developers. Since then, Reddit has secured data-licensing deals with companies including OpenAI and Google and is reportedly seeking additional arrangements. This is not Reddit\u2019s first legal challenge in this area; it previously sued Anthropic, alleging that its bots continued to access the site after the company had stated otherwise.<\/p>\n<p>Ben Lee, Reddit\u2019s chief legal officer, described the situation as an \u201cindustrial-scale \u2018data laundering\u2019 economy\u201d fueled by an AI \u201carms race for quality human content.\u201d He stated, \u201cScrapers bypass technological protections to steal data, then sell it to clients hungry for training material. Reddit is a prime target because it\u2019s one of the largest and most dynamic collections of human conversation ever created.\u201d Lee identified the co-defendants Oxylabs UAB, AWM Proxy, and SerpAI as \u201ctextbook examples of this illegal behavior,\u201d describing them as an obscure Lithuanian scraper, a former Russian botnet, and a company that advertises questionable tactics. He added, \u201cUnable to scrape Reddit directly, they mask their identities, hide their locations, and disguise their web scrapers to steal Reddit content from Google Search.\u201d<\/p>\n<p>In response to the lawsuit, Perplexity\u2019s head of communication, Jesse Dwyer, stated that the company had not yet received the legal filing. Dwyer told The Verge, \u201cwe will always fight vigorously for users\u2019 rights to freely and fairly access public knowledge.\u201d He added, \u201cOur approach remains principled and responsible as we provide factual answers with accurate AI, and we will not tolerate threats against openness and the public interest.\u201d<\/p>\n<p><strong>Featured image credit<\/strong><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Reddit has filed a lawsuit against the answer-engine company Perplexity and three data-scraping service providers, SerpApi, Oxylabs, and AWMProxy. The legal action seeks to halt what Reddit\u2019s complaint describes as the unlawful, industrial-scale circumvention of its data protections. The complaint alleges that Perplexity is a customer of at least one of these data-scraping firms. Reddit [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":36242,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[37],"tags":[],"class_list":["post-36241","post","type-post","status-publish","format-standard","has-post-thumbnail","category-technologies"],"_links":{"self":[{"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/posts\/36241","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/comments?post=36241"}],"version-history":[{"count":0,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/posts\/36241\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/media\/36242"}],"wp:attachment":[{"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/media?parent=36241"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/categories?post=36241"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/tags?post=36241"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}