Reddit Sues Perplexity AI for Unauthorized Data Scraping to Train AI Models

Thumbnail Image

The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.

Reddit has filed lawsuits against Perplexity AI and other companies, alleging they illegally scraped Reddit content—circumventing technical safeguards—to train AI models without authorization. Reddit claims this violated intellectual property rights and harmed its business, highlighting ongoing legal and ethical disputes over AI training data use.[AI generated]

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity AI's chatbot) and the unlawful scraping of user-generated content to train these AI models. The scraping bypasses protections and uses stolen data, which is a breach of Reddit's rights and user rights, constituting a violation of intellectual property and possibly privacy rights. The harm is realized as the data has been scraped and used commercially without consent. Hence, this is an AI Incident involving violations of rights due to AI system development and use.[AI generated]

AI principles

AccountabilityPrivacy & data governanceRobustness & digital securityTransparency & explainability

Industries

Digital securityMedia, social platforms, and marketing

Affected stakeholders

Business

Harm types

Economic/Property

Severity

AI incident

Business function:

Citizen/customer serviceResearch and development

AI system task:

Content generationInteraction support/chatbots

Articles about this incident or hazard

Thumbnail Image

Reddit sues AI company Perplexity and others for 'industrial-scale' scraping of user comments

2025-10-22

Yahoo! Finance

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity AI's chatbot) and the unlawful scraping of user-generated content to train these AI models. The scraping bypasses protections and uses stolen data, which is a breach of Reddit's rights and user rights, constituting a violation of intellectual property and possibly privacy rights. The harm is realized as the data has been scraped and used commercially without consent. Hence, this is an AI Incident involving violations of rights due to AI system development and use.

Thumbnail Image

Reddit sues AI company Perplexity and others for 'industrial-scale' scraping of user comments

2025-10-22

Yahoo! Finance

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity AI chatbot) that use scraped Reddit user comments to train or operate. The scraping is alleged to be unlawful and done at an industrial scale, constituting a breach of intellectual property rights and possibly user privacy rights. This is a direct harm linked to the AI system's development and use. The lawsuit itself is a response to this harm. Hence, this qualifies as an AI Incident due to violations of rights caused by the AI system's use of unlawfully obtained data.

Thumbnail Image

Reddit sues Perplexity for scraping data to train AI system

2025-10-22

Reuters

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI-based search engine) whose development and use rely on data obtained through alleged unlawful scraping of copyrighted content from Reddit. This unauthorized use infringes on intellectual property rights, a recognized harm under the AI Incident definition. Since the lawsuit concerns actual past and ongoing use of the data for AI training, the harm is realized rather than potential. Hence, this qualifies as an AI Incident due to violation of intellectual property rights caused by the AI system's development and use.

Thumbnail Image

Reddit sues Perplexity for scraping data to train AI system

2025-10-22

Economic Times

Why's our monitor labelling this an incident or hazard?

The event centers on the development phase of an AI system involving unauthorized data scraping, which is a potential violation of intellectual property rights and data protection laws. Since the article does not report any actual harm caused by the AI system's deployment or use, but rather a legal challenge concerning its development, this fits the definition of Complementary Information. It provides context on governance and legal responses related to AI development practices but does not describe a realized AI Incident or a plausible future harm (AI Hazard) from the AI system itself.

Thumbnail Image

Reddit sues Perplexity; lawsuit says: Is stealing data that Perplexity "desperately needs" to ... - The Times of India

2025-10-22

The Times of India

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI-powered answer engine) whose development and use rely on data allegedly obtained unlawfully from Reddit. The lawsuit alleges that this use violates Reddit's intellectual property rights, which is a recognized harm under the AI Incident definition (violation of intellectual property rights). Since the harm (copyright infringement) has already occurred and is the subject of legal action, this qualifies as an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit sues Perplexity for scraping data to train AI system By Reuters

2025-10-22

Investing.com

Why's our monitor labelling this an incident or hazard?

The article centers on a legal complaint about unauthorized data scraping to train an AI system, which implicates intellectual property rights. While this is a significant issue in AI governance and ethics, the event does not describe realized harm caused by the AI system's outputs or malfunction, nor does it present a credible risk of future harm beyond the legal dispute. The AI system's involvement is in its development phase (training data acquisition), but the harm is legal and proprietary rather than direct physical, operational, or rights violations caused by AI outputs. Thus, it fits the definition of Complementary Information, as it informs about legal and governance challenges related to AI training data use.

Thumbnail Image

Perplexity AI Lawsuit: Reddit Accused of Google Data Theft - News Directory 3

2025-10-24

News Directory 3

Why's our monitor labelling this an incident or hazard?

The event involves an AI system (Perplexity AI) whose development and use allegedly involved unauthorized data scraping from Reddit, which is claimed to breach Reddit's technological safeguards and user agreements. This constitutes a violation of intellectual property rights and privacy rights, which are protected under applicable law. Since the lawsuit alleges harm caused by the AI system's development and use, this qualifies as an AI Incident under the framework, specifically a violation of rights (c).

Thumbnail Image

Reddit sues Perplexity AI and others over alleged data scraping By Investing.com

2025-10-22

Investing.com

Why's our monitor labelling this an incident or hazard?

The article details a lawsuit alleging unauthorized data scraping by AI-related companies, which implicates AI system development and use. However, it does not report actual harm or confirmed violations resulting from the AI systems' use, only allegations and ongoing legal action. The event primarily informs about governance and legal proceedings concerning AI data practices, fitting the definition of Complementary Information rather than an AI Incident or AI Hazard.

Thumbnail Image

Reddit drags Perplexity in a new lawsuit, accusing it of building up a $20 billion company off stolen data

2025-10-22

Business Insider

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems trained on data scraped without authorization, constituting a breach of intellectual property rights and proprietary data protections. The lawsuit alleges direct misuse of Reddit's data in AI model training, which is a violation of legal rights and harms Reddit's property interests. The involvement of AI in generating answers based on stolen data and the circumvention of digital guardrails confirms AI system involvement and misuse. This meets the criteria for an AI Incident as the AI system's development and use have directly led to a breach of obligations intended to protect intellectual property rights, a recognized harm category.

Thumbnail Image

Inside the trap Reddit set for AI startup Perplexity to test whether it was stealing data

2025-10-25

Business Insider

Why's our monitor labelling this an incident or hazard?

The event describes Perplexity's AI system allegedly bypassing Reddit's technological protections to scrape and use Reddit's content without a license, which is a violation of intellectual property rights. The AI system's use of this data directly led to harm in the form of unauthorized use of protected content. The presence of AI is explicit, and the harm is realized, not just potential. Hence, this is an AI Incident under the framework.

Thumbnail Image

Reddit sues AI company Perplexity and others for 'industrial-scale'...

2025-10-22

Daily Mail Online

Why's our monitor labelling this an incident or hazard?

The article explicitly involves AI systems (Perplexity AI chatbot) that use scraped data from Reddit to train their models. The lawsuit alleges unlawful scraping and use of Reddit's content, which is a violation of intellectual property rights, a recognized harm under the AI Incident definition. The harm is realized as the scraping has already occurred and is the basis of the legal action. The involvement of AI in the development and use of these systems is clear, and the harm is directly linked to the AI system's development and use. Thus, this event meets the criteria for an AI Incident rather than a hazard or complementary information.

Thumbnail Image

'Would-Be Bank Robbers': Reddit Escalates AI Data Wars With Perplexity Lawsuit

2025-10-23

Forbes

Why's our monitor labelling this an incident or hazard?

The event involves an AI system (Perplexity AI) whose development and use allegedly rely on illegally scraped data from Reddit, constituting a breach of intellectual property rights. The lawsuit claims direct unauthorized use of Reddit's content to train AI models, which is a violation of legal protections for data and content creators. This fits the definition of an AI Incident as it involves harm through breach of intellectual property rights caused by the AI system's use. Although the harm is legal and commercial rather than physical, it is clearly articulated and pivotal to the event. The article does not merely discuss potential or future harm but an ongoing legal dispute over realized unauthorized data use, confirming the classification as an AI Incident.

Thumbnail Image

Reddit sues Perplexity for scraping of posts, expanding user data battle with AI industry

2025-10-23

CNBC

Why's our monitor labelling this an incident or hazard?

The article explicitly involves AI systems (large language models trained on scraped Reddit data) and discusses alleged violations of intellectual property rights, which is a recognized harm category. However, the event is a lawsuit alleging past unauthorized data scraping and use, with no report of direct harm caused by the AI outputs or malfunction. The main focus is on the legal and governance dispute over data rights and licensing, which fits the definition of Complementary Information. It updates on ongoing societal and legal responses to AI data use conflicts rather than reporting a new AI Incident or AI Hazard.

Thumbnail Image

Analysis | The fight between AI companies and the websites that hate them

2025-10-24

Washington Post

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI answer engine) that uses scraped data from Reddit without authorization, leading to a legal claim of intellectual property rights violations. The harm is realized, as Reddit alleges improper use of its content, which can disrupt the economic model of content creation and hosting. The AI system's development and use directly led to this harm. This fits the definition of an AI Incident because it involves a violation of intellectual property rights caused by the AI system's use. The event is not merely a potential risk or a complementary update but a concrete legal dispute over harm caused by AI use.

Thumbnail Image

Reddit launches copyright suit against AI search engine Perplexity

2025-10-22

Financial Times News

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity's AI search engine) and alleges unauthorized scraping of copyrighted content to train these AI models. This constitutes a violation of intellectual property rights, which is one of the defined harms under AI Incidents. The lawsuit and the described actions indicate that the AI system's development and use have directly led to this harm. Hence, the event meets the criteria for an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit Launches Legal Action To Block AI Companies From Scraping Its Data

2025-10-24

Yahoo! Finance

Why's our monitor labelling this an incident or hazard?

The article involves AI systems indirectly, as the scraped data is used by AI companies, but the event is about legal action to prevent unauthorized data scraping and protect intellectual property rights. No realized harm or plausible future harm from AI misuse is described; rather, it is a governance/legal response to protect data rights and revenue. This fits the definition of Complementary Information, as it updates on societal and governance responses to AI-related data use issues without reporting an AI Incident or AI Hazard.

Thumbnail Image

Reddit sues Perplexity for allegedly stealing user data to train its AI 'answer engine'

2025-10-24

MoneyControl

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's generative AI) whose development and use allegedly involved unauthorized scraping of copyrighted user data from Reddit. This constitutes a violation of intellectual property rights and data protection laws, which falls under harm category (c) in the framework. The harm is realized, as Reddit has filed a lawsuit citing unauthorized use and seeking damages and injunctions. Therefore, this is not merely a potential risk but an actual incident involving AI misuse leading to legal and rights violations. Hence, the classification is AI Incident.

Thumbnail Image

'Would-Be Bank Robbers': Reddit Sues Perplexity, Data Firms Over AI Scraping

2025-10-23

CNET

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity's AI search and training data usage) and alleges that the development and use of these AI systems involved unauthorized scraping of copyrighted content from Reddit, violating intellectual property rights. This is a direct harm under the AI Incident definition (c) concerning violations of intellectual property rights. The lawsuit and the described actions indicate that the AI system's development and use have directly led to this harm. Hence, the event qualifies as an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Why Reddit Filed Lawsuit Against Perplexity And Data Scrapers

2025-10-25

NDTV

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (chatbots trained on scraped data) and the unauthorized scraping of Reddit content to train these AI systems. This use of AI has led to a legal claim of violation of intellectual property rights, which is a breach of obligations under applicable law protecting such rights. The harm is realized in the form of unauthorized use of data and potential economic loss to Reddit and its content creators. Hence, it meets the criteria for an AI Incident due to the direct involvement of AI system use causing a rights violation.

Thumbnail Image

Reddit Alleges User Data Theft, Sues Perplexity AI And 'Scraping' Partners

2025-10-22

NDTV

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity AI's chatbot) and the unlawful scraping of data to train these AI systems. The harm is a violation of intellectual property rights and unauthorized use of user-generated content, which is a breach of legal protections. The lawsuit alleges direct involvement of AI companies in this harm. Hence, it meets the criteria for an AI Incident rather than a hazard or complementary information, as the harm is ongoing and the AI system's role is pivotal.

Thumbnail Image

Reddit Sues Perplexity, Others Over Alleged Data Scraping

2025-10-22

Bloomberg Business

Why's our monitor labelling this an incident or hazard?

The presence of AI systems is inferred as Perplexity AI is an AI company using scraped data likely for AI model training or services. The lawsuit concerns the development phase of AI systems (data sourcing) and alleges violation of intellectual property rights. However, no actual harm or incident caused by AI system use or malfunction is described. The event is a legal proceeding addressing potential rights violations and data use practices, fitting the definition of Complementary Information as it provides context and governance response to AI ecosystem issues without reporting a realized AI Incident or AI Hazard.

Thumbnail Image

Reddit sues over 'industrial-scale' scraping of user comments

2025-10-22

ABC News

Why's our monitor labelling this an incident or hazard?

The article explicitly involves AI systems (AI chatbots trained on scraped data) and their development/use through data scraping. The alleged scraping is unauthorized and described as unlawful, which relates to intellectual property rights violations. However, the article focuses on the lawsuit and legal actions taken by Reddit rather than describing a confirmed AI Incident where harm has already occurred or an AI Hazard where harm is plausible but not realized. The event is about governance and legal proceedings addressing AI-related data scraping, making it Complementary Information according to the framework.

Thumbnail Image

Reddit sues Perplexity AI over 'industrial-scale' data scraping

2025-10-23

New York Post

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity AI's chatbot) that uses scraped data from Reddit without authorization, which is alleged to violate copyright laws and cause unfair competition. This fits the definition of an AI system's use potentially leading to harm (violation of intellectual property rights). However, the article focuses on the filing of a lawsuit and allegations rather than confirmed or realized harm. There is no clear evidence that the AI system's use has directly caused harm yet, only that it could plausibly lead to such harm if the allegations are true. Therefore, the event is best classified as an AI Hazard rather than an AI Incident. It is not Complementary Information because it is not an update or response to a previously known incident, and it is not Unrelated because it clearly involves AI and potential harm.

Thumbnail Image

Reddit vs Perplexity, and the scourge of 'data laundering' in an AI slop world

2025-10-23

Hindustan Times

Why's our monitor labelling this an incident or hazard?

The event involves AI systems trained on unlawfully obtained data, which constitutes a violation of intellectual property rights and possibly other legal protections. The illegal scraping and resale of data for AI training directly contributes to these harms. The article details actual lawsuits and complaints, indicating that harm has occurred rather than just a potential risk. Therefore, this qualifies as an AI Incident due to violations of rights caused by AI system development and use.

Thumbnail Image

Reddit sues Perplexity for allegedly ripping its content to feed AI

2025-10-22

The Verge

Why's our monitor labelling this an incident or hazard?

The lawsuit centers on the alleged unauthorized scraping of Reddit's copyrighted content to train or support an AI system, which is a direct violation of intellectual property rights. The AI system (Perplexity's answer engine) is explicitly mentioned and is central to the dispute. The harm (copyright infringement) has already occurred or is ongoing, as Reddit is seeking to stop the unlawful data scraping. This fits the definition of an AI Incident because the AI system's development and use have directly led to a breach of intellectual property rights.

Reddit sues Perplexity and three other companies for allegedly using its content without paying

2025-10-22

engadget

Why's our monitor labelling this an incident or hazard?

The event describes how AI companies, including Perplexity, have used scraped Reddit content without permission to train or power their AI systems, leading to legal action by Reddit for copyright infringement. The AI system's use of unlicensed data directly causes a breach of intellectual property rights, which is a recognized harm under the AI Incident definition. The involvement of AI systems in generating answers based on scraped content is explicit, and the harm (violation of rights) is realized and central to the event. Hence, this is an AI Incident.

Thumbnail Image

Perplexity fires back at Reddit's lawsuit, denies 'data theft' allegations amid scrutiny of AI scraping practices | Company Business News

2025-10-23

mint

Why's our monitor labelling this an incident or hazard?

The article describes a lawsuit alleging unlawful data scraping by an AI company, which could plausibly lead to violations of intellectual property rights or other legal harms if proven. However, the article does not report any actual harm or incident resulting from the AI system's use of the data, only allegations and legal actions. Therefore, this situation fits the definition of an AI Hazard, as the development and use of AI systems with potentially unauthorized data access could plausibly lead to an AI Incident if the claims are substantiated or if such practices continue unchecked. It is not Complementary Information because the main focus is the legal dispute and potential harm, not a response or update to a past incident. It is not an AI Incident because no harm has been confirmed or realized yet.

Thumbnail Image

Reddit accuses Perplexity of stealing its data, calls AI content race industrial-scale laundering

2025-10-23

India Today

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity AI's AI-driven answer engine) trained on data scraped from Reddit without authorization. The lawsuit alleges that this unauthorized use of copyrighted content directly violates intellectual property rights, a recognized harm under the AI Incident definition. The involvement of AI in the development and use of the system that caused this harm is clear. Hence, this is an AI Incident due to the direct link between AI system use and violation of intellectual property rights.

Thumbnail Image

Reddit sues Perplexity for scraping data to train AI system

2025-10-23

The Hindu

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI-based search engine) whose development and use rely on data allegedly scraped without permission from Reddit, constituting a breach of intellectual property rights. The harm is realized as Reddit has filed a lawsuit, indicating the unauthorized use has already taken place. This meets the criteria for an AI Incident because the AI system's development and use have directly led to a violation of intellectual property rights.

Thumbnail Image

Reddit accuses Perplexity of stealing content to train AI

2025-10-23

Mashable SEA

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI answer engine) that allegedly used unauthorized data scraping to obtain content from Reddit, violating Reddit's rights. This is a direct consequence of the AI system's development and use. The harm is a violation of intellectual property rights and legal obligations, which is a recognized category of AI harm. Since the harm is realized (lawsuit filed due to alleged unauthorized use), this qualifies as an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit Sues a Collection of Startups It Says Are Wrongly Scraping It for AI Training Data

2025-10-23

Gizmodo

Why's our monitor labelling this an incident or hazard?

The article centers on Reddit's lawsuit against companies scraping data for AI training, which involves AI system development but does not describe a direct or indirect harm caused by AI outputs or use. The legal dispute highlights issues of intellectual property and data rights in AI training but does not report an AI Incident (harm realized) or an AI Hazard (plausible future harm). Instead, it documents a governance and legal response to AI-related data practices, fitting the definition of Complementary Information.

Thumbnail Image

Reddit Sues Perplexity AI, Calling Its Data Collection 'Industrial-Scale Theft'

2025-10-23

Republic World

Why's our monitor labelling this an incident or hazard?

The article explicitly involves AI systems (Perplexity AI's AI-powered search tool) and their development/use through data scraping practices. The alleged unauthorized scraping of Reddit's content for AI training implicates violations of intellectual property rights and data protection laws, which are harms under the AI Incident definition. However, since the article describes an ongoing lawsuit and no confirmed harm or court decision has been reported, the harm is potential rather than realized. This fits the definition of an AI Hazard, where the AI system's use could plausibly lead to a breach of legal rights. The event is not Complementary Information because it is not an update or response to a prior incident but a new legal challenge. It is not Unrelated because it clearly involves AI systems and potential legal harm.

Thumbnail Image

Reddit Sues Perplexity for Scraping User Data to Train its AI

2025-10-23

Windows Report | Error-free Tech Life

Why's our monitor labelling this an incident or hazard?

The event describes a lawsuit against an AI company for scraping user data without permission to train AI models. This directly relates to the development and use of AI systems and involves a breach of intellectual property rights and possibly user rights. The harm is realized as Reddit is seeking damages and an injunction, indicating the unauthorized use has already happened. Therefore, this is an AI Incident due to violations of rights caused by the AI system's development and use.

Thumbnail Image

Reddit sues AI company over alleged 'industrial-scale' scraping of its users' comments

2025-10-22

PBS.org

Why's our monitor labelling this an incident or hazard?

The event involves AI systems (AI chatbots and answer engines) that rely on large-scale data scraping to train their models. The scraping is alleged to be unlawful and to violate copyright laws, which are legal protections of intellectual property rights. Since the scraping has already occurred and is the basis for the AI systems' training and operation, this constitutes a realized harm (violation of intellectual property rights). Hence, the event meets the criteria for an AI Incident as the AI system's development and use have directly led to a breach of applicable law protecting intellectual property rights.

Thumbnail Image

Aravind Srinivas' Perplexity AI sued by Reddit for data scraping, says, 'We will play fair but won't...

2025-10-23

The Financial Express

Why's our monitor labelling this an incident or hazard?

The article explicitly mentions that Perplexity AI uses AI systems that generate answers by summarizing and citing Reddit content. Reddit alleges that Perplexity and other firms engaged in unauthorized data scraping, violating copyright laws. This is a direct violation of intellectual property rights due to the AI system's use of scraped data for training or generating outputs. The harm is realized as Reddit has filed a lawsuit seeking damages and injunctions. The AI system's development and use are central to the incident, fulfilling the criteria for an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit sues Perplexity for scraping data to train AI system

2025-10-22

CNA

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI-based search engine) and concerns the development phase of the AI system, specifically the data sourcing for training. The alleged unlawful scraping of Reddit's data constitutes a violation of intellectual property rights and possibly other legal protections. Since the complaint indicates that the AI system was trained using this data, and this training is the basis of the legal claim, this qualifies as an AI Incident under the category of violations of intellectual property rights and breach of applicable law protecting such rights.

Thumbnail Image

Reddit Sues Perplexity For Unlawfully Scraping Data To Train AI Search Engine: Report

2025-10-22

Asianet News Network Pvt Ltd

Why's our monitor labelling this an incident or hazard?

The event explicitly involves the use of an AI system (Perplexity's AI search engine) trained on data scraped without authorization from Reddit, which is a violation of intellectual property rights and legal agreements. The harm is realized as Reddit has filed lawsuits, indicating that the unauthorized use has already taken place. This fits the definition of an AI Incident because the AI system's development and use directly led to a breach of legal obligations protecting intellectual property rights.

Thumbnail Image

Reddit sues AI company Perplexity and others for 'industrial-scale' scraping of user comments

2025-10-23

The Star

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (chatbots and AI training) and the unlawful scraping of Reddit user comments to train these AI systems without consent or licensing. This constitutes a violation of intellectual property rights and possibly user rights, which are harms covered under the AI Incident definition. The lawsuit indicates that the harm has already occurred through unauthorized data acquisition and use, not merely a potential future risk. Hence, it is an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit sues Perplexity and others for allegedly scraping millions of user comments

2025-10-22

Fast Company

Why's our monitor labelling this an incident or hazard?

The article involves AI systems (Perplexity's AI chatbot) and data scraping for AI development, which relates to intellectual property rights and legal frameworks. However, it does not report any direct or indirect harm caused by the AI system's use or malfunction, nor does it describe a credible risk of future harm. Instead, it focuses on a legal dispute addressing potential misuse of data for AI training. This aligns with the definition of Complementary Information, as it details a governance/legal response to AI-related practices without reporting a new AI Incident or AI Hazard.

Thumbnail Image

Reddit sues to block Perplexity from scraping Google search results

2025-10-23

Ars Technica

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's answer engine using a large language model) that is alleged to have illegally scraped and used Reddit content, causing harm to Reddit's intellectual property rights and business interests. The lawsuit details how the AI system's use and the associated scraping activities have directly led to these harms. This fits the definition of an AI Incident as it involves the use of an AI system leading to violations of intellectual property rights and economic harm. The involvement of anti-scraping circumvention and the use of AI to parse and generate answers from scraped content further supports this classification. The event is not merely a potential risk or a complementary update but a concrete legal action based on realized harm.

Thumbnail Image

Reddit sues Perplexity over data scraping

2025-10-22

Axios

Why's our monitor labelling this an incident or hazard?

The article involves AI systems indirectly, as the scraped data is used to train AI models. The lawsuit alleges violations of intellectual property rights due to unauthorized data scraping, which is a breach of legal protections. However, the article does not describe actual harm caused by the AI systems' outputs or use, only the alleged unlawful data acquisition. This situation represents a potential risk and legal challenge but not a realized AI Incident or immediate hazard. It is best classified as Complementary Information providing context on AI ecosystem challenges and legal responses.

Thumbnail Image

Reddit Launches Legal Action to Block AI Companies from Scraping its Data

2025-10-22

Social Media Today | A business community for the web's best thinkers on Social Media

Why's our monitor labelling this an incident or hazard?

The article details a legal dispute over unauthorized data scraping for AI training, involving AI systems indirectly as data consumers. However, it does not report any actual harm caused by AI systems, nor does it describe a credible risk of harm stemming from the AI systems' use or malfunction. The focus is on Reddit's efforts to protect its data rights and establish legal precedent, which is a governance and societal response to AI ecosystem challenges. This fits the definition of Complementary Information, as it enhances understanding of AI-related legal and business dynamics without reporting a new AI Incident or AI Hazard.

Thumbnail Image

Reddit strikes new battle with Perplexity AI in ongoing war with illegal data scrapers

2025-10-22

The A.V. Club

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity's chatbot) trained on illegally scraped data from Reddit, which is copyrighted content. The unauthorized data scraping and use for AI training directly violate intellectual property rights, fulfilling the harm criterion (c) under AI Incident. The lawsuit and the described misuse indicate that harm has already occurred, not just a potential risk. Therefore, this qualifies as an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit Sues Perplexity and Others Over Data Scraping to Train AI System

2025-10-22

Adweek

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems trained on data scraped without permission, which is a breach of intellectual property rights and legal obligations. The unauthorized use of Reddit's content for AI training has directly led to legal harm and a dispute, fulfilling the criteria for an AI Incident under violations of human rights or breach of applicable law protecting intellectual property rights. The involvement of AI systems in the use of scraped data for training is clear, and the harm is realized through the infringement and legal consequences.

Thumbnail Image

Reddit sues AI company Perplexity and others for 'industrial-scale' scraping of user comments - CNBC TV18

2025-10-23

cnbctv18.com

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity AI's chatbot and AI training data acquisition) and alleges unlawful scraping of user content for AI training, which is a violation of copyright laws and unfair competition. The harm is realized as Reddit claims its content was used without authorization, infringing on intellectual property rights. This fits the definition of an AI Incident because the AI system's development and use directly led to a breach of legal obligations protecting intellectual property rights, causing harm to Reddit as a content owner.

Thumbnail Image

Reddit Sues Perplexity Over Alleged Illegal Data Scraping

2025-10-23

Gizbot

Why's our monitor labelling this an incident or hazard?

The event involves AI system development and use (training an AI search engine) and alleges unauthorized data scraping, which is a violation of intellectual property rights. However, the article does not indicate that this has yet resulted in a breach of rights or other harms; it is a legal action initiated to address potential or ongoing misuse. Therefore, this situation represents a plausible risk of harm related to AI development and use, fitting the definition of an AI Hazard rather than an AI Incident. It is not merely complementary information because the legal action itself is a direct response to alleged AI-related misuse with potential for harm.

Thumbnail Image

Reddit Sues Perplexity AI for Alleged Data Theft, Calls AI Content Race 'Industrial-Scale Laundering'

2025-10-23

The Hans India

Why's our monitor labelling this an incident or hazard?

The article explicitly describes Perplexity AI's alleged unauthorized scraping of Reddit's copyrighted user-generated content to train its AI system, which is a violation of intellectual property rights. This use of AI development and training data directly leads to harm in the form of legal rights violations. The involvement of AI in the development and use of the system is clear, and the harm is realized through the alleged infringement and resulting lawsuit. Therefore, this qualifies as an AI Incident under the framework's definition of harm to intellectual property rights caused by AI system development and use.

Thumbnail Image

Reddit Sues Perplexity For Scraping Data to Train AI System

2025-10-23

Deccan Chronicle

Why's our monitor labelling this an incident or hazard?

The event involves AI system development (training an AI search engine) and the alleged unauthorized data scraping, which is a violation of intellectual property rights. However, the article does not report any direct or indirect harm caused by the AI system's outputs or use, such as harm to persons, communities, or property. The focus is on the legal action and dispute over data use, which is a governance and societal response to AI development practices. Hence, it fits the definition of Complementary Information rather than an AI Incident or AI Hazard.

Thumbnail Image

Reddit sues Perplexity over alleged illegal data scraping to train its AI engine

2025-10-23

Digit

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI-based search engine) that was allegedly trained using illegally scraped data from Reddit, a violation of intellectual property rights. This harm has already occurred as Reddit has filed a lawsuit claiming unauthorized use of its content. The AI system's development and use are central to the incident, fulfilling the criteria for an AI Incident under the framework.

Reddit sues AI company Perplexity and others for 'industrial-scale' scraping of user comments

2025-10-22

Market Beat

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity AI's chatbot) and the use of scraped data to train these AI models. The lawsuit alleges unlawful scraping and use of Reddit's content without permission, which constitutes a violation of intellectual property rights and possibly user rights. This is a direct harm related to the AI system's development and use, as the AI company is accused of using stolen data to train its models. Hence, this qualifies as an AI Incident under the category of violations of human rights or breach of intellectual property rights caused by the AI system's development and use.

Thumbnail Image

Reddit sues Perplexity, three other firms, for AI scraping

2025-10-23

Computerworld

Why's our monitor labelling this an incident or hazard?

The article explicitly involves AI systems because the scraped data is used for AI training. The event stems from the development and use of AI systems with unauthorized data collection. However, the article does not report direct or indirect harm caused by the AI systems themselves, only the legal dispute over data scraping practices. The potential violation of intellectual property rights is alleged but not confirmed as harm caused by AI system deployment. The main focus is on the lawsuit and the governance/legal response to AI data practices, making it Complementary Information rather than an Incident or Hazard.

Thumbnail Image

Perplexity Just Got Caught Breaking the Rules Red-Handed

2025-10-24

Futurism

Why's our monitor labelling this an incident or hazard?

The event involves an AI system (Perplexity's AI-powered search engine) that uses scraped data from Reddit without authorization, constituting a violation of intellectual property rights. This is a direct harm caused by the AI system's use, fulfilling the criteria for an AI Incident under violations of human rights or breach of legal obligations protecting intellectual property. The lawsuit and the described data scraping and use of AI-generated search results demonstrate realized harm, not just potential harm.

Thumbnail Image

Reddit Sues Perplexity, Others Over Copyrighted Posts

2025-10-22

MediaPost

Why's our monitor labelling this an incident or hazard?

The article explicitly describes AI systems (Perplexity's 'answer engine') using scraped Reddit content without authorization, violating copyright protections. The involvement of AI in processing and generating answers from this data is clear. The harm is a breach of intellectual property rights due to unauthorized data scraping and use in AI training or operation. This meets the definition of an AI Incident as the AI system's use has directly led to a violation of legal rights and harm to Reddit's property rights. The complaint and legal action confirm the harm has materialized, not just a potential risk.

Thumbnail Image

Reddit Sues Perplexity AI, Alleging 'Industrial-Scale' Data Theft - Decrypt

2025-10-23

Decrypt

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (large language models) trained or powered by data scraped from Reddit without authorization, which is a direct violation of intellectual property rights and terms of service. The alleged 'industrial-scale' data theft and continued use after cease-and-desist demonstrate realized harm. This fits the AI Incident category under violations of human rights or breach of obligations protecting intellectual property rights. The involvement of AI in the development and use of these systems is clear, and the harm is direct and ongoing, not merely potential. Therefore, the classification as an AI Incident is appropriate.

Thumbnail Image

The fight between AI companies and the websites that hate them

2025-10-25

The Philadelphia Inquirer

Why's our monitor labelling this an incident or hazard?

The event involves AI systems that use large-scale data scraping to train models, which is central to the dispute. The lawsuit alleges improper use of data, which could lead to violations of intellectual property rights and economic harm to content creators and platforms. However, the article does not describe actual harm having occurred, only the legal challenge and potential future consequences. This fits the definition of an AI Hazard, as the development and use of AI systems in this manner could plausibly lead to significant harms, but no incident has yet materialized.

Thumbnail Image

Reddit Sues Perplexity, Other AI Companies for Scraping User Comments

2025-10-22

TheWrap

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI companies using scraped Reddit comments to train AI models without permission, constituting a violation of intellectual property rights. This is a direct legal harm linked to the development and use of AI systems. The lawsuit alleges actual harm (copyright infringement and unfair competition), meeting the criteria for an AI Incident under violations of intellectual property rights. The involvement of AI systems in training chatbots on scraped data is clear, and the harm is realized through the legal claims.

Thumbnail Image

Reddit Sues Perplexity AI and Data Firms Over Alleged Unauthorized Scraping | PYMNTS.com

2025-10-22

PYMNTS.com

Why's our monitor labelling this an incident or hazard?

The article involves AI systems indirectly because the scraped data is used to train AI models, but the event is a lawsuit over unauthorized data scraping and copyright infringement, not an incident where AI caused harm or a hazard where AI could plausibly cause harm. The harm alleged is legal and intellectual property rights violation, but it is framed as a dispute rather than a realized violation caused by AI system deployment or malfunction. The event focuses on legal action and industry practices, which fits the definition of Complementary Information as it informs about governance and legal responses to AI-related data use issues.

Thumbnail Image

Reddit Sues Perplexity Over Alleged Data Scraping | PYMNTS.com

2025-10-22

PYMNTS.com

Why's our monitor labelling this an incident or hazard?

The article explicitly involves AI systems (generative AI models trained on scraped Reddit data) and describes a legal claim of unauthorized data scraping and use, which constitutes a violation of intellectual property rights. This is a direct harm linked to the AI system's development and use. The lawsuit highlights actual harm (legal rights violations) rather than potential harm, making it an AI Incident rather than a hazard or complementary information. The focus is on the misuse of data for AI training causing legal and rights harm, fitting the AI Incident definition.

Thumbnail Image

Meta Cuts 600 Jobs in Major AI Push | AIM

2025-10-23

Analytics India Magazine

Why's our monitor labelling this an incident or hazard?

The event involves an AI system (Perplexity's AI-powered search engine) whose development involved unauthorized scraping of copyrighted content, leading to a legal claim of rights violations. This fits the definition of an AI Incident because the AI system's development and use have directly led to a breach of intellectual property rights, a harm explicitly covered under the framework. The article does not merely discuss potential or future harm but reports an ongoing lawsuit based on alleged past actions, indicating realized harm or at least a formal claim of harm.

Thumbnail Image

Reddit accuses Perplexity AI of data theft

2025-10-23

Bizcommunity.com

Why's our monitor labelling this an incident or hazard?

The article explicitly involves an AI system (Perplexity AI's chatbot and answer engine) and alleges that its development or use involved unauthorized data scraping from Reddit, constituting a violation of intellectual property rights. This fits the definition of an AI Incident because the AI system's development/use has directly or indirectly led to a breach of obligations under applicable law protecting intellectual property rights. The presence of a lawsuit and statements from Reddit's legal officer further confirm the harm and legal implications. There is no indication that this is merely a potential risk or a complementary update; the event centers on an alleged realized harm involving AI.

Thumbnail Image

Reddit sues Perplexity, accusing the AI lab of using scraped content for training

2025-10-23

Neowin

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (large language models) trained on scraped Reddit content without authorization, which Reddit claims is a violation of its intellectual property rights. The unauthorized scraping and use of data for AI training is a direct cause of harm to Reddit's rights and business interests. The involvement of AI in the development and use of these models is clear, and the harm (copyright infringement and breach of terms) is realized and central to the lawsuit. Hence, this is an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit Battles Perplexity in AI Data Scraping Dispute | Technology

2025-10-22

Devdiscourse

Why's our monitor labelling this an incident or hazard?

The article explicitly describes an AI system (Perplexity's AI search engine) whose development involved unauthorized scraping of copyrighted content from Reddit. This unauthorized use of data infringes on intellectual property rights, constituting a harm under the AI Incident definition. The lawsuit indicates that the AI system's development and use have directly led to this violation, making it an AI Incident rather than a hazard or complementary information. The event is not merely a potential risk but an actual legal dispute over realized harm.

Thumbnail Image

Reddit Battles AI Startup in Court Over Data Scraping Allegations | Technology

2025-10-22

Devdiscourse

Why's our monitor labelling this an incident or hazard?

The event involves AI systems being trained on data scraped without permission, which constitutes a violation of intellectual property rights, a breach of applicable law protecting such rights. Since the lawsuit concerns actual unauthorized data scraping and use for AI training, this is a realized harm related to AI system development and use. Therefore, this qualifies as an AI Incident due to the direct link between AI system development and violation of intellectual property rights.

Thumbnail Image

Reddit Takes Legal Action Against Data Scrapers | Technology

2025-10-22

Devdiscourse

Why's our monitor labelling this an incident or hazard?

The event involves an AI system (an AI-based search engine) trained on data scraped illicitly, which is a violation of data use and privacy laws, thus relating to potential harm to rights. However, the article does not report actual harm caused by the AI system's outputs or use, nor does it indicate plausible future harm beyond the legal dispute. The main focus is on the legal action taken by Reddit, which is a governance response to AI-related data use issues. Hence, it fits the definition of Complementary Information rather than an AI Incident or AI Hazard.

Thumbnail Image

Reddit vs. Perplexity: Data Scraping Showdown in the AI Arena | Technology

2025-10-22

Devdiscourse

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (AI search engines) trained on data scraped without authorization, leading to a legal claim of violation of intellectual property rights. This fits the definition of an AI Incident because the AI system's development and use have directly led to a breach of obligations under applicable law protecting intellectual property rights. The lawsuit and allegations confirm that harm has occurred, not just a potential risk, so this is not merely a hazard or complementary information.

Thumbnail Image

Reddit Sues Perplexity Over Alleged Data Scraping To Train Its AI-Based Search Engine

2025-10-23

LatestLY

Why's our monitor labelling this an incident or hazard?

The event involves the use of an AI system (Perplexity's AI-based search engine) trained on data scraped without permission from Reddit, which is a violation of intellectual property rights and possibly other legal protections. This unauthorized data scraping and use directly led to legal action, indicating realized harm. Therefore, it meets the criteria for an AI Incident due to breach of obligations under applicable law protecting intellectual property rights.

Thumbnail Image

Reddit Sues Perplexity, Others Over Alleged Data Scraping (1)

2025-10-22

news.bloomberglaw.com

Why's our monitor labelling this an incident or hazard?

The article involves AI systems indirectly because the scraped data is used for AI training, but the event is about a lawsuit alleging unauthorized data scraping and copyright infringement. There is no direct or indirect harm caused by the AI systems themselves described here, only a legal dispute over data use. The event is a governance/legal response to concerns about AI data sourcing practices, fitting the definition of Complementary Information rather than an AI Incident or AI Hazard.

Thumbnail Image

AI Ethics Clash: Reddit Slaps Perplexity & Three Others with Lawsuit Over AI Data Theft Claims

2025-10-23

Analytics Insight

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (answer engines powered by scraped data) and alleges that these systems were trained using data obtained by circumventing Reddit's data protection measures, constituting a violation of copyright laws and unfair competition. These harms fall under violations of intellectual property rights and legal obligations protecting such rights, which qualifies as harm (c) under the AI Incident definition. The lawsuit indicates that the harm has already occurred (data scraping and use in AI training), not just a potential risk, so this is an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit sues AI company Perplexity and others for 'industrial-scale' scraping of user comments - WTOP News

2025-10-22

WTOP

Why's our monitor labelling this an incident or hazard?

The lawsuit explicitly involves AI systems (Perplexity AI's chatbot) that rely on scraped Reddit data for training. The scraping is alleged to be unlawful and bypasses protections, constituting a breach of intellectual property rights and user content rights. This is a direct harm linked to the development and use of AI systems, fulfilling the criteria for an AI Incident under violations of intellectual property rights and breach of legal obligations. The event is not merely a potential risk but an ongoing legal dispute over realized unauthorized use, thus not a hazard or complementary information.

Thumbnail Image

Reddit to Perplexity: Get your filthy hands off our forums

2025-10-22

TheRegister.com

Why's our monitor labelling this an incident or hazard?

The event involves the use of AI systems trained on unlawfully scraped data, leading to violations of intellectual property rights and unfair competition. The lawsuit claims direct harm caused by the AI system's development and use based on illegally obtained data. This fits the definition of an AI Incident because the AI system's use has directly led to a breach of intellectual property rights and related harms. The presence of AI is explicit (AI answer engine, AI models), and the harm is realized (lawsuit filed for damages and injunction).

Thumbnail Image

'Would-Be Bank Robbers': Reddit Says Perplexity Steals Data - Law360

2025-10-23

law360.com

Why's our monitor labelling this an incident or hazard?

The article explicitly involves AI systems (generative text products) developed using data scraped from Reddit without authorization. This constitutes a violation of intellectual property rights, a recognized harm under the AI Incident definition. Since the lawsuit alleges that the AI system's development involved unauthorized use of copyrighted material, this is a direct harm linked to AI system development and use.

Thumbnail Image

Reddit sues Perplexity, SerpApi over scraping Google Search data

2025-10-22

Search Engine Land

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity is an AI search engine) and concerns the unauthorized scraping and use of Reddit data to train AI models. This unauthorized use of data infringes on intellectual property rights and is alleged to have caused harm to Reddit by bypassing licensing agreements and potentially undermining its business model. The lawsuit seeks damages and injunctions, indicating realized harm. Therefore, this qualifies as an AI Incident due to violations of intellectual property rights and harm caused by AI system development and use.

Thumbnail Image

Reddit sues Perplexity and 3 others for data scraping

2025-10-23

NewsBytes

Why's our monitor labelling this an incident or hazard?

The event describes unauthorized scraping of Reddit content to train AI chatbots, which is a violation of intellectual property rights and unfair competition. Since the AI systems were developed using this data, and harm (legal and economic) has occurred, this qualifies as an AI Incident under the framework, specifically under violations of intellectual property rights (c).

Thumbnail Image

Perplexity denies 'data theft' allegations in Reddit lawsuit

2025-10-23

NewsBytes

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems used by Perplexity to process Reddit data. The lawsuit alleges unauthorized data scraping, which constitutes a violation of intellectual property rights and possibly user privacy, both falling under harm category (c). However, the article describes allegations and ongoing legal action without confirming that harm has already occurred or been legally established. Therefore, this event represents a potential legal and ethical risk related to AI system use, but not a confirmed AI Incident. It is not merely general AI news, as it involves specific allegations of harm linked to AI system use. Hence, it is best classified as Complementary Information, providing context on legal and governance responses to AI data use issues.

Thumbnail Image

Reddit sues Perplexity for scraping data to train AI system

2025-10-22

Free Malaysia Today

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity's AI-based search engine) and concerns the development and use of these systems based on data scraped without authorization. This unauthorized use of copyrighted content to train AI systems constitutes a violation of intellectual property rights, which is a breach of applicable law protecting such rights. Since the harm (copyright infringement) has already occurred and legal action is underway, this qualifies as an AI Incident under the framework, specifically under category (c) violations of intellectual property rights.

Thumbnail Image

Reddit sues Perplexity for scraping data to train AI system | Honolulu Star-Advertiser

2025-10-22

Honolulu Star Advertiser

Why's our monitor labelling this an incident or hazard?

The article explicitly describes an AI system (Perplexity's AI-based search engine) whose training involved unauthorized scraping of Reddit's data, which is copyrighted material. This unauthorized use constitutes a violation of intellectual property rights, a recognized harm under the AI Incident definition. The involvement of the AI system's development and use is clear, and the harm (legal rights violation) has occurred, as evidenced by the lawsuit. Hence, this is an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit's 'AI Scraping' Lawsuit Is An Attack On The Open Internet

2025-10-24

Techdirt

Why's our monitor labelling this an incident or hazard?

The article centers on a legal dispute involving AI-related data scraping and content access but does not report any realized harm or incident caused by AI systems. It discusses potential implications and critiques the lawsuit's approach, which could have future consequences for AI data use and the open internet, but these are speculative and not immediate harms. Therefore, the article is best classified as Complementary Information, as it provides context, analysis, and governance-related discussion about AI data use and legal challenges without describing a specific AI Incident or AI Hazard.

Thumbnail Image

Reddit sues AI company Perplexity and others for 'industrial-scale' scraping of user comments

2025-10-22

Spectrum News Bay News 9

Why's our monitor labelling this an incident or hazard?

The article details a lawsuit against AI companies for unauthorized data scraping used to train AI systems, which implicates AI system development and use. However, it does not report any direct or indirect harm caused by the AI systems themselves, such as injury, rights violations, or operational disruption. The focus is on legal and governance issues surrounding AI training data acquisition, which is a significant societal and governance response to AI development practices. This fits the definition of Complementary Information, as it enhances understanding of AI ecosystem challenges and responses without describing a new AI Incident or AI Hazard.

Thumbnail Image

Reddit is suing Perplexity and AI data scraping firms for using its data without permission - SiliconANGLE

2025-10-23

SiliconANGLE

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (large language models) trained on data scraped without permission, leading to a legal claim of copyright infringement. The unauthorized scraping and use of Reddit's data for AI training is a breach of intellectual property rights, which is a recognized harm under the AI Incident framework. The involvement of AI in the development and use of these models is clear, and the harm (legal violation) has occurred, making this an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit Sues Perplexity For Alleged Data Scraping To Train AI Models

2025-10-22

finanzen.ch

Why's our monitor labelling this an incident or hazard?

The article explicitly involves AI systems (Perplexity's AI models) trained using data scraped from Reddit without authorization. This unauthorized data scraping constitutes a breach of intellectual property rights, a recognized harm under the AI Incident definition. The lawsuit alleges that this infringement has already occurred, indicating realized harm rather than a potential risk. Hence, the event meets the criteria for an AI Incident due to the direct involvement of AI system development and use causing a violation of rights.

Reddit Sues Perplexity AI Over Alleged Data Scraping and Content Theft

2025-10-23

Android Headlines

Why's our monitor labelling this an incident or hazard?

The lawsuit alleges that Perplexity AI scraped Reddit content without authorization and used it to train its AI models, which is a direct violation of intellectual property rights. The involvement of the AI system is explicit, as the AI's training data allegedly includes the scraped content. The harm is realized, as Reddit claims its content was used without consent, constituting a breach of legal protections. This fits the definition of an AI Incident due to the direct link between the AI system's use and the violation of rights. The event is not merely a potential risk or a complementary update but a concrete legal dispute over harm caused by AI use.

Thumbnail Image

Reddit sues Perplexity for scraping data to train AI system

2025-10-23

The Manila times

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI-based search engine) whose development and use depend on data scraped without authorization from Reddit, a violation of copyright law. This constitutes a breach of intellectual property rights, one of the harms defined under AI Incidents. The lawsuit and the described circumstances indicate that the AI system's development and use have directly led to this harm. Hence, the event meets the criteria for an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Explained - Why Reddit Sued Perplexity AI & Other Web Scrapers?

2025-10-24

MediaNama

Why's our monitor labelling this an incident or hazard?

The event involves the use and alleged misuse of AI systems (Perplexity AI's chatbot) that rely on unauthorized scraping of Reddit's copyrighted content, leading to violations of intellectual property rights and breach of terms of service. The harm is realized, as Reddit claims its valuable content was accessed and used without permission, constituting a breach of rights and unauthorized data use. The AI system's role is pivotal because the scraped data is used to train or power the AI chatbot, directly linking the AI system to the harm. Hence, this is an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit catches Perplexity in clever data trap

2025-10-25

Rolling Out

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity AI) that ingests and generates content based on scraped data. The misuse of the AI system to scrape data despite technological barriers and agreements not to do so constitutes a breach of legal obligations protecting intellectual property rights. The harm is realized as unauthorized use of protected content, which is a violation of rights under applicable law. The event is not merely a potential risk but an actual incident with evidence presented in a lawsuit. Hence, it meets the criteria for an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit sues AI company Perplexity and others for 'industrial-scale' scraping of user comments

2025-10-22

KTAR News

Why's our monitor labelling this an incident or hazard?

The event involves AI systems (Perplexity AI's chatbot) whose development and use rely on data scraped unlawfully from Reddit. The scraping and use of Reddit content without permission constitutes a violation of intellectual property rights and possibly user rights. This is a direct harm caused by the AI system's development and use. Therefore, this qualifies as an AI Incident due to the realized violation of rights through the AI system's training data acquisition practices.

Thumbnail Image

Reddit Files Lawsuit Against Perplexity Over Unlawful Data Scraping - Blockonomi

2025-10-22

Blockonomi

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI answer engine) that uses scraped data from Reddit without authorization. The lawsuit alleges that this unauthorized scraping and use of data violates Reddit's rights and terms of service, constituting a breach of intellectual property rights and legal obligations. This is a direct harm caused by the AI system's development and use, meeting the criteria for an AI Incident under violations of human rights or breach of legal obligations protecting intellectual property rights. The presence of the AI system, the direct link to harm, and the legal complaint confirm this classification.

Thumbnail Image

Reddit Files Lawsuit Against Perplexity AI Over Alleged Data Scraping - Blockonomi

2025-10-23

Blockonomi

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI-powered search engine) whose use of scraped data from Reddit's user content allegedly infringes on copyright and violates legal protections. This constitutes a violation of intellectual property rights, a recognized harm under the AI Incident definition. The lawsuit and the described harm are direct consequences of the AI system's use of data. Hence, the event qualifies as an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit Dubs Perplexity AI and Data Scraping Companies 'Would-Be Bank Robbers'

2025-10-23

IPWatchdog.com | Patents & Patent Law

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (large language models) trained on data scraped from Reddit without authorization, which is a violation of intellectual property rights and possibly other legal protections. The complaint details how Perplexity AI and others circumvented technical controls to access data, leading to harm to Reddit's rights and interests. This fits the definition of an AI Incident as the AI system's development and use have directly led to a breach of obligations under applicable law protecting intellectual property rights. The presence of AI systems and the resulting legal harm are clear and direct, distinguishing this from a mere hazard or complementary information.

Thumbnail Image

Reddit Sues Perplexity For Alleged Data Scraping To Train AI Models

2025-10-22

finanzen.at

Why's our monitor labelling this an incident or hazard?

The event involves the use of AI systems trained on data allegedly obtained without permission, leading to a legal claim of copyright infringement. This constitutes a violation of intellectual property rights, which fits the definition of an AI Incident. The harm is realized as Reddit has filed a lawsuit alleging infringement, indicating the harm has occurred or is ongoing. Therefore, this event qualifies as an AI Incident.

Thumbnail Image

Reddit Sues Perplexity for Data Scraping

2025-10-24

eWEEK

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's generative AI model) that allegedly uses scraped Reddit content without authorization, leading to a copyright infringement lawsuit. This is a direct violation of intellectual property rights, which is one of the harms defined under AI Incidents. The lawsuit and the described unauthorized data scraping indicate that harm has already occurred, not just a potential risk. Hence, the event meets the criteria for an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit sues Perplexity for scraping data to train its AI

2025-10-22

Cryptopolitan

Why's our monitor labelling this an incident or hazard?

The event involves an AI system (Perplexity's AI-based search engine) whose development and use rely on data scraped without authorization from Reddit and other content creators. This unauthorized use of copyrighted material constitutes a breach of intellectual property rights, which is a recognized harm under the AI Incident definition. The harm is realized as it affects the economic interests and rights of content creators and publishers. Therefore, this event qualifies as an AI Incident due to the direct link between the AI system's use and the violation of rights and harm to content creators.

Thumbnail Image

Reddit Sues Perplexity AI Over Unauthorized Data Scraping Trap

2025-10-25

WebProNews

Why's our monitor labelling this an incident or hazard?

The article describes Perplexity AI's unauthorized scraping of Reddit content using AI-powered systems, which directly violates Reddit's terms and potentially copyright laws, constituting a breach of intellectual property rights. The involvement of AI in data scraping and the resulting lawsuit demonstrate realized harm linked to the AI system's use. The presence of AI is explicit, and the harm is materialized through legal and ethical violations, fitting the criteria for an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit Sues Perplexity AI Over Illegal Scraping of User Data for AI

2025-10-24

WebProNews

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity AI) that uses scraped data to train its models. The unauthorized scraping and use of Reddit's user-generated content directly infringes on intellectual property rights and terms of service, constituting a breach of legal obligations protecting such rights. This harm is realized, not merely potential, as the lawsuit alleges ongoing unauthorized data harvesting and use. Hence, the event meets the criteria for an AI Incident because the AI system's use has directly led to a violation of intellectual property rights.

Thumbnail Image

Reddit Sues Perplexity Over Illegal AI Data Scraping from Search Results

2025-10-23

WebProNews

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI search engine) that uses scraped data from Reddit to train and generate responses. The unauthorized scraping and use of Reddit's content without permission or compensation constitutes a violation of intellectual property rights, a recognized harm under the AI Incident definition. The lawsuit indicates that this harm has already occurred, not just a potential risk, making it an AI Incident rather than a hazard or complementary information. The focus is on the direct harm caused by the AI system's use of illegally obtained data, fulfilling the criteria for an AI Incident.

Thumbnail Image

Reddit sues Perplexity over alleged data scraping for AI training

2025-10-23

Verdict

Why's our monitor labelling this an incident or hazard?

The event explicitly involves the use of AI systems (Perplexity's AI-driven search service) trained on scraped data from Reddit without authorization. The harm is a violation of intellectual property rights and possibly user rights, as Reddit claims unauthorized use of its content for AI training. This is a direct consequence of the AI system's development process. The legal action and the description of the dispute confirm that harm has occurred or is ongoing, not merely a potential risk. Hence, it meets the criteria for an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit sues Perplexity for allegedly scraping millions of user posts to train its AI model - Tech Startups

2025-10-23

Tech News | Startups News

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI models) trained on scraped Reddit data without permission, which Reddit claims is a violation of intellectual property rights and harms its user community. The lawsuit alleges direct unauthorized use of data for AI training, constituting a breach of legal obligations protecting intellectual property and user rights. This fits the definition of an AI Incident because the AI system's development and use have directly led to a violation of rights and harm to the community. Although the harm is legal and community-based rather than physical, it is clearly articulated and pivotal to the case. Hence, the classification as AI Incident is appropriate.

Thumbnail Image

Reddit sues AI company Perplexity and others for 'industrial-scale' scraping of user comments

2025-10-22

NewsTimes

Why's our monitor labelling this an incident or hazard?

The event involves the use of AI systems (Perplexity's AI chatbot) trained on unlawfully scraped data from Reddit, which is a direct violation of intellectual property rights and unfair competition. The harm is realized as Reddit has filed a lawsuit alleging these violations and economic harm. The AI system's development and use rely on this scraped data, making the AI system's involvement pivotal to the harm. Hence, this is an AI Incident under the definitions provided, specifically under violations of intellectual property rights (c).

Thumbnail Image

Reddit Sues Perplexity in Shocking AI Data Scraping Scandal

2025-10-23

International Business Times UK

Why's our monitor labelling this an incident or hazard?

The event involves an AI system (Perplexity AI) whose development and use rely on data scraped without authorization from Reddit, violating intellectual property rights and user consent. The unauthorized data scraping and use for AI training directly implicate legal and ethical harms related to rights violations. Since the harm (violation of intellectual property and user rights) has already occurred through the unauthorized data use, this qualifies as an AI Incident rather than a hazard or complementary information. The lawsuit itself is a response to this realized harm, not merely a potential future risk or a general update.

Thumbnail Image

Reddit sues AI company Perplexity and others for 'industrial-scale' scraping of user comments

2025-10-22

Winnipeg Sun

Why's our monitor labelling this an incident or hazard?

The event involves AI systems (AI chatbots) trained on scraped data. The scraping is unauthorized and bypasses protections, leading to a breach of Reddit's rights over its content. This is a direct or indirect violation of intellectual property rights and possibly user privacy, which fits the definition of an AI Incident. The involvement of AI companies and data scrapers in acquiring data for AI training links the AI system development and use to the harm. Therefore, this is classified as an AI Incident.

Thumbnail Image

Reddit Sues Perplexity and Data Scrapers for 'Industrial-Scale' AI Content Theft - WinBuzzer

2025-10-23

WinBuzzer

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity's AI search engine) and their use in scraping data unlawfully, which is directly linked to violations of intellectual property rights under copyright law. The harm is realized, as Reddit's content was scraped without permission, bypassing technological measures, and used to train AI models, constituting a breach of legal obligations protecting intellectual property. This meets the criteria for an AI Incident because the AI system's use has directly led to a violation of intellectual property rights, a recognized harm under the framework. The event is not merely a potential risk or a complementary update but a concrete legal action addressing actual harm caused by AI system misuse.

Reddit Sues Perplexity, Others Over Alleged Data Scraping

2025-10-22

Claims Journal

Why's our monitor labelling this an incident or hazard?

The event involves AI systems indirectly because the scraped data is used to train AI models, which is central to the dispute. However, the article does not describe any realized harm such as injury, rights violations, or operational disruption caused by the AI systems themselves. Instead, it focuses on alleged illegal data scraping and copyright infringement, which are legal and intellectual property issues. Since the harm is alleged and the event is a legal action concerning data use rather than a direct or indirect harm caused by AI system operation, this fits best as Complementary Information. It provides context on governance and legal responses to AI data practices but does not report an AI Incident or AI Hazard per the definitions.

Thumbnail Image

Reddit sues AI company Perplexity and others for 'industrial-scale' scraping of user comments

2025-10-22

Owensboro Messenger-Inquirer

Why's our monitor labelling this an incident or hazard?

The event centers on the development and use of an AI system that allegedly scraped data unlawfully, which constitutes a breach of intellectual property rights and possibly user rights. This fits the definition of an AI Incident because the AI system's use has directly led to a violation of intellectual property rights (harm category c). The lawsuit indicates that the harm has occurred or is ongoing, not just a potential risk. Therefore, this is classified as an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit sues AI company and data scraping company for allegedly training chatbot on user comments | FOX 28 Spokane

2025-10-22

FOX 28 Spokane

Why's our monitor labelling this an incident or hazard?

The lawsuit alleges unauthorized use of user-generated content to train AI systems, which constitutes a violation of intellectual property rights and possibly user privacy rights. This is a direct legal challenge related to the development and use of AI systems. Since the event involves alleged violations of rights due to AI system training, it qualifies as an AI Incident under the category of violations of human rights or breach of obligations under applicable law protecting intellectual property rights.

Thumbnail Image

Reddit sues AI company Perplexity and others for 'industrial-scale' scraping of user comments

2025-10-22

WCBI TV | Your News Leader

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity AI chatbot) that use scraped Reddit user comments for training, which Reddit alleges was done unlawfully and at an industrial scale. This scraping bypasses Reddit's protections and involves evasion tactics, indicating misuse of AI-related data acquisition methods. The harm includes violation of intellectual property rights and user content rights, which fits the definition of an AI Incident. The lawsuit and the described harm are concrete and ongoing, not merely potential or speculative, thus it is not an AI Hazard or Complementary Information.

Thumbnail Image

Internet platform Reddit has filed a lawsuit against Perplexity

2025-10-23

Financial World

Why's our monitor labelling this an incident or hazard?

The event involves AI systems because the scraped data is allegedly used to train AI models, which fits the definition of AI system involvement. The lawsuit concerns the development and use of AI systems based on data obtained without authorization, which could constitute a violation of intellectual property rights if proven. However, the article does not describe any direct or indirect harm caused by the AI systems' outputs or use, only the alleged unauthorized data collection. Therefore, this event does not meet the criteria for an AI Incident, as no harm has yet materialized. It also does not describe a plausible future harm scenario beyond the ongoing legal dispute. The main focus is on the legal and governance response to the alleged unauthorized data scraping for AI training, which enhances understanding of AI ecosystem challenges and responses. Hence, this event is best classified as Complementary Information.

Thumbnail Image

Reddit's data becomes a battleground in the AI gold rush

2025-10-24

semafor.com

Why's our monitor labelling this an incident or hazard?

The article explicitly mentions AI systems (large language models, AI search engines) trained or operating on Reddit data without authorization, leading to lawsuits. This constitutes a violation of intellectual property rights due to unauthorized use of data for AI development and deployment. Since the unauthorized use has already occurred and legal action is underway, this is a realized harm under the framework's definition of AI Incident (violation of intellectual property rights).

Thumbnail Image

Reddit sues Perplexity and others for allegedly stealing data via Google

2025-10-25

Cybernews

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity's AI answer engine) that use scraped data from Reddit to train their models. The unauthorized scraping and use of Reddit's copyrighted content constitute a violation of intellectual property rights, which is a recognized harm under the AI Incident definition. The harm is realized as the data scraping and use have already taken place, and Reddit is suing to stop this unlawful activity. Although the harm is not physical or health-related, it is a breach of legal protections for intellectual property rights caused by the AI system's development and use. Hence, this event meets the criteria for an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit Lawsuit Perplexity Data Scraping - News Directory 3

2025-10-25

News Directory 3

Why's our monitor labelling this an incident or hazard?

The event involves AI systems indirectly because the scraped data is used to train AI models, which is central to the dispute. However, the article does not describe any realized harm such as injury, rights violations, or operational disruption caused by the AI systems themselves. Instead, it focuses on the legal and ethical debate over data ownership and usage rights in AI development. Since no direct or indirect harm has yet occurred or been reported, but there is a plausible risk of harm related to unauthorized data use, this situation is best classified as Complementary Information. It provides important context and updates on governance and legal responses to AI data practices rather than reporting an AI Incident or AI Hazard.

Thumbnail Image

Reddit Sues Perplexity Over Data Scraping

2025-10-24

Buttercup

Why's our monitor labelling this an incident or hazard?

The event involves AI systems because the scraped data is used to train AI models, which fits the definition of AI system involvement. The lawsuit alleges violations of intellectual property rights and commercial harm, which are forms of harm under the AI Incident definition. However, since the article focuses on the filing of the lawsuit and the legal dispute rather than confirmed realized harm caused by the AI system's use, it is best classified as Complementary Information. It provides important context on societal and governance responses to AI data use and potential harms but does not document a confirmed AI Incident or an AI Hazard at this stage.

Thumbnail Image

Reddit Sues Perplexity Over AI Scraping - News Directory 3

2025-10-23

News Directory 3

Why's our monitor labelling this an incident or hazard?

While the event involves AI systems (AI training on scraped data), the main focus is on the legal dispute over unauthorized data collection rather than a direct or indirect harm caused by the AI systems. There is no indication that the AI systems have caused injury, rights violations, or other harms as defined for an AI Incident, nor is there a clear plausible future harm described that would qualify as an AI Hazard. Therefore, this event is best classified as Complementary Information, as it provides context on legal and governance responses related to AI data practices.

Thumbnail Image

'Would-Be Bank Robbers': Reddit Sues Perplexity, Data Firms Over AI Scraping

2025-10-23

News Flash

Why's our monitor labelling this an incident or hazard?

The lawsuit alleges that data firms illegally scraped Reddit's copyrighted content, which was then used by Perplexity to train AI systems. This unauthorized use of copyrighted material for AI training is a violation of intellectual property rights, a recognized harm under the AI Incident definition. Since the scraping and use of data has already occurred and led to legal action, this qualifies as an AI Incident rather than a potential hazard or complementary information.

Thumbnail Image

Reddit sets trap to catch Perplexity scraping its data from Google Search

2025-10-23

THE DECODER

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI search) and concerns the unauthorized use of data to train or operate that AI system, which constitutes a breach of intellectual property rights. This breach is a harm under the framework's category (c) violations of human rights or breach of obligations under applicable law intended to protect intellectual property rights. Since the harm (illegal data scraping and use) has occurred and is central to the event, this qualifies as an AI Incident.

Thumbnail Image

Why Reddit is suing Perplexity and others over industrial-scale content scraping?

2025-10-23

storyboard18.com

Why's our monitor labelling this an incident or hazard?

The event involves the use of AI systems (training AI models) and alleges unauthorized data scraping, which constitutes a violation of intellectual property rights. Since the lawsuit is filed based on past unauthorized scraping and use of content, this indicates realized harm related to breach of intellectual property rights. Therefore, this qualifies as an AI Incident because the AI system's development (training) has directly led to a breach of intellectual property rights protected by law.

Thumbnail Image

Reddit Sues AI Company Perplexity And Others For 'Industrial-Scale' Scraping Of User Comments

2025-10-23

ETV Bharat News

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity AI's chatbot and answer engine) and their use of scraped data from Reddit without authorization. The lawsuit alleges direct harm through copyright violations and unfair competition, which are breaches of legal obligations protecting intellectual property rights. Since the harm has already occurred (the scraping and use of data), and the AI system's development and use are central to the incident, this qualifies as an AI Incident under the framework. The involvement of data-scraping companies supporting AI training further supports the classification as an incident rather than a hazard or complementary information.

Reddit Sues Perplexity, Others Over Alleged Data Scraping

2025-10-23

NDTV Profit

Why's our monitor labelling this an incident or hazard?

The lawsuit alleges that AI companies have been using Reddit's data without authorization to train AI models, which constitutes a breach of intellectual property rights. The involvement of AI systems in training on scraped data directly links the AI system's development and use to the harm (violation of rights). The event is not merely a potential risk but an ongoing legal action based on actual data scraping and use, indicating realized harm. Hence, it meets the criteria for an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit Sues Perplexity & AI Companies for Scraping Comments - News Directory 3

2025-10-22

News Directory 3

Why's our monitor labelling this an incident or hazard?

The event involves AI systems (large language models trained on scraped data) and concerns about unauthorized data use, which relates to intellectual property rights (a form of legal rights). However, the article does not report actual harm caused by the AI systems' outputs or use, only the legal action taken by Reddit to prevent unauthorized scraping and enforce its policies. The focus is on the legal and governance response to AI data practices, making it Complementary Information rather than an AI Incident or AI Hazard. There is no direct or indirect harm realized yet, nor a clear plausible future harm described beyond the legal dispute context.

Thumbnail Image

Reddit sues AI company Perplexity and others for 'industrial-scale' scraping of user comments

2025-10-22

2 News Nevada

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity AI's chatbot) and the unauthorized scraping of data to train these systems, which is a direct violation of Reddit's rights. The harm is realized as Reddit has filed a lawsuit alleging unlawful scraping and commercial exploitation of user comments, which constitutes a breach of intellectual property rights and possibly user privacy. The involvement of AI in the development and use of these chatbots is central to the incident, fulfilling the criteria for an AI Incident under the OECD framework.

Thumbnail Image

Reddit sues Perplexity for allegedly ripping its content to feed AI

2025-10-22

News Flash

Why's our monitor labelling this an incident or hazard?

The event involves AI systems indirectly, as the data scraping is alleged to be used to train AI models, which fits the definition of AI system involvement. The harm alleged is a violation of intellectual property rights due to unauthorized data scraping and use, which is a recognized AI Incident category. However, since the event is about a lawsuit and allegations without confirmed realized harm or direct causation of harm by the AI system's outputs, it does not meet the threshold for an AI Incident. It also does not describe a plausible future harm scenario but rather a current legal dispute. Thus, it fits best as Complementary Information, detailing governance and legal responses to AI-related data use issues.

Thumbnail Image

Reddit 控告 Perplexity 涉嫌盜用內容供 AI 使用

2025-10-22

Yahoo News

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity's AI answer engine) and their development/use relying on scraped copyrighted content from Reddit without authorization. This unauthorized use of copyrighted material for AI training is a breach of intellectual property rights, which is a recognized harm under the AI Incident definition. The lawsuit and allegations indicate that the harm has already occurred (copyright infringement), not just a potential risk. Therefore, this qualifies as an AI Incident due to violation of intellectual property rights caused by the AI system's development and use.

Thumbnail Image

《GTA6》电臀舞玩法被辟谣：网友瞎编的结果谷歌AI全信了

2025-10-22

驱动之家

Why's our monitor labelling this an incident or hazard?

An AI system (Google's AI) was involved in processing and summarizing online content, which included false claims about a game feature. The AI's malfunction in discerning credible sources directly led to the spread of misinformation, which can be considered harm to communities by misleading users. Therefore, this qualifies as an AI Incident due to the AI system's role in propagating false information that misinforms the public.

Thumbnail Image

Reddit起诉Perplexity，指控后者未经授权为AI抓取和使用数据

2025-10-23

凤凰网（凤凰新媒体）

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems since Perplexity is an AI company using scraped data to train AI models. The lawsuit alleges unauthorized use of data, which constitutes a violation of intellectual property rights if proven. However, the article focuses on the legal action and allegations rather than describing any direct or indirect harm caused by the AI system's outputs or use. Since no harm has yet materialized but there is a plausible risk of harm related to unauthorized data use in AI training, this event fits the definition of an AI Hazard rather than an AI Incident or Complementary Information.

Thumbnail Image

Reddit起诉AI搜索引擎Perplexity非法抓取其数据 - FT中文网

2025-10-23

英国金融时报中文版

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI search engine) whose development involved scraping data from Reddit without authorization. This unauthorized data use is a breach of intellectual property rights, which is a recognized harm under the AI Incident framework. The lawsuit indicates that the harm has already occurred (the data scraping for training), not just a potential future risk. Hence, it qualifies as an AI Incident rather than a hazard or complementary information.

Thumbnail Image

曾提付費合作遭拒！Reddit控告Perplexity偷數據訓練自家AI | 鉅亨網 - 美股雷達

2025-10-22

Anue鉅亨

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (AI search engines) trained on data scraped without permission from Reddit, a content owner. The unauthorized use of copyrighted content for AI training is a breach of intellectual property rights, which is a recognized harm under the AI Incident definition. The lawsuit and accusations indicate that the harm has already occurred, not just a potential risk. Therefore, this qualifies as an AI Incident due to violations of intellectual property rights caused by the AI system's development and use.

Thumbnail Image

Reddit控告Perplexity AI與3家資料爬梳公司

2025-10-23

iThome Online

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems, specifically AI search/answer engines that rely on data scraped from Reddit to train or support their models. The unauthorized scraping and use of Reddit content for AI training or support constitutes a violation of intellectual property rights, which is a recognized harm under the AI Incident definition (c). The lawsuit and claims indicate that harm has occurred due to the AI system's development and use involving unauthorized data. Therefore, this qualifies as an AI Incident because the AI system's development and use have directly or indirectly led to a breach of intellectual property rights and commercial harm to Reddit.

Thumbnail Image

智通财经APP获悉，社交媒体平台Reddit(RDDT.US)于周三在纽约联邦法院对人工智能初创公司Perplexity提起诉讼，指控该公司及其他三家企业非法抓取其数据用于训练Perplexity基于AI的搜索引擎。Reddit在诉状中称，这些数......

2025-10-23

证券之星

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI search engine) trained using data scraped without authorization from Reddit, a violation of intellectual property rights. The harm is realized, as the unauthorized data scraping and use have already happened, and Reddit is pursuing legal action for damages and to stop further use. This fits the definition of an AI Incident because the AI system's use has directly led to a breach of intellectual property rights, a recognized harm category. The event is not merely a potential risk or a complementary update but a concrete incident involving harm.

Thumbnail Image

Reddit 的 AI 聊天機器人建議用戶嘗試海洛因

2025-10-22

Gamereactor China

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Reddit Answers) that generated harmful medical advice, including recommending heroin use, which poses direct health risks to users. The AI's development and use led to the dissemination of unsafe health information, fulfilling the criteria for harm to health (a). The AI system's malfunction or inadequate safeguards caused this harm, and the platform's mitigation efforts indicate acknowledgment of the incident. Hence, this is an AI Incident rather than a hazard or complementary information.

Thumbnail Image

社媒平台Reddit起诉Perplexity，指控后者非法窃取数据用于训练AI

2025-10-23

新浪财经

Why's our monitor labelling this an incident or hazard?

The event involves AI systems (AI-based search engines) trained on data allegedly obtained illegally from Reddit. The unauthorized data scraping and use for AI training is a breach of intellectual property rights, which falls under harm category (c) in the AI Incident definition. Since the lawsuit is filed due to this unauthorized use, the harm is realized and ongoing, making this an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit(RDDT.US)状告AI独角兽Perplexity：指控其非法抓取数据训练搜索引擎

2025-10-23

k.sina.com.cn

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI-based search engine) whose development and use rely on data allegedly obtained illegally from Reddit. The unauthorized scraping and use of copyrighted content for AI training is a violation of intellectual property rights, which is a form of harm recognized under the AI Incident definition. The lawsuit and the described circumstances indicate that the harm has already occurred (the use of data without permission), not just a potential future risk. Hence, this is an AI Incident rather than a hazard or complementary information.

Thumbnail Image

GTA6电臀舞按钮系网络实验，AI误读致谣言扩散

2025-10-23

ai.zol.com.cn

Why's our monitor labelling this an incident or hazard?

An AI system (AI-driven search and summarization) was involved in processing and amplifying inaccurate information originating from a social media experiment. This led to the indirect harm of misinformation spreading among the public, affecting community understanding and trust. Since the AI system's malfunction or misinterpretation directly contributed to the dissemination of false information, which is a harm to communities, this qualifies as an AI Incident.

Thumbnail Image

Reddit宣布起诉Perplexity等未经授权抓取数据训练AI模型 - cnBeta.COM 移动版

2025-10-23

cnBeta.COM

Why's our monitor labelling this an incident or hazard?

The event involves AI systems because the scraped data is used to train AI models, which fits the definition of AI system development and use. The unauthorized data scraping and use without permission constitute a violation of intellectual property rights and legal obligations, which is a form of harm under the framework. However, the article does not report actual harm caused by the AI systems' outputs or their deployment, only the legal action and allegations regarding data acquisition. Since the main focus is on the legal dispute and the companies' responses, and no direct or indirect harm from AI system use or malfunction is described, this fits the category of Complementary Information, which covers legal proceedings and governance responses related to AI.

Thumbnail Image

Reddit起诉Perplexity非法抓取数据

2025-10-24

ai.zol.com.cn

Why's our monitor labelling this an incident or hazard?

The event involves AI system development and use, specifically the training of AI models on scraped data from Reddit without authorization. This implicates intellectual property rights and legal compliance issues. However, the article does not describe any direct or indirect harm that has materialized from this activity, only the legal action taken by Reddit. Therefore, it does not meet the criteria for an AI Incident, which requires realized harm. Nor does it describe a plausible future harm scenario beyond the legal dispute, so it is not an AI Hazard. The main focus is on the legal and governance response to AI data usage practices, making it Complementary Information.

Thumbnail Image

Reddit起诉多家公司非法抓取数据用于AI训练

2025-10-26

ai.zol.com.cn

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems being trained on data scraped without authorization, which is a breach of intellectual property and platform rights, thus constituting harm under category (c) of AI Incidents. The involvement of AI systems in the development and use stages is clear, and the harm is realized through unauthorized data use. Therefore, this is classified as an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit demanda a Perplexity y otras empresas de IA por extracción masiva de comentarios de usuarios

2025-10-22

Yahoo!

Why's our monitor labelling this an incident or hazard?

The event involves AI systems (chatbots and AI models) trained on data extracted from Reddit without permission, which Reddit alleges is illegal and violates intellectual property rights. The unauthorized extraction and use of user comments for commercial AI training purposes directly breaches legal protections and user rights. This constitutes a violation of intellectual property rights (a category of harm under AI Incident definition). The involvement of AI systems in the development and use stages is explicit, and the harm (legal violation) is realized and central to the event. Hence, it is classified as an AI Incident.

Thumbnail Image

Reddit demanda a Perplexity AI y otros por presunta extracción de datos Por Investing.com

2025-10-22

Investing.com Español

Why's our monitor labelling this an incident or hazard?

The event involves AI systems (Perplexity AI) using data scraped without authorization, which constitutes a violation of intellectual property rights under applicable law. The unauthorized data extraction and use directly relate to the development and use of AI systems, leading to a breach of legal protections for content creators and platform users. This fits the definition of an AI Incident because the AI system's use has directly led to a violation of intellectual property rights, a recognized harm under the framework. Although the harm is legal and rights-based rather than physical, it is clearly articulated and pivotal to the incident. Therefore, this event qualifies as an AI Incident.

Thumbnail Image

Reddit demanda a Perplexity por usar los comentarios de sus usuarios para entrenar a su IA sin autorización

2025-10-24

La Nacion

Why's our monitor labelling this an incident or hazard?

The article explicitly involves AI systems (chatbots and AI engines trained on user-generated content) and their development/use through data extraction. The harm described is a violation of intellectual property rights and unauthorized use of user data, which is a recognized form of harm under the framework. However, the article reports on a lawsuit alleging these actions rather than confirmed harm or consequences resulting from the AI system's use. Since the harm is not yet realized but plausibly could occur if the unauthorized use continues, the event fits the definition of an AI Hazard. It is not Complementary Information because the main focus is the legal claim of unauthorized data use, not a response or update to a prior incident. It is not Unrelated because AI systems and their training data are central to the dispute.

Thumbnail Image

Reddit acaba de demandar a Perplexity: el mensaje es claro, si usas mis datos sin pagar, prepara tus abogados

2025-10-23

Xataka

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity AI) that uses scraped data to power its AI model. The lawsuit alleges unauthorized use of copyrighted content, which is a breach of intellectual property rights, a recognized harm under the AI Incident definition. The harm is realized, not just potential, as Reddit has identified actual unauthorized data scraping and use. The involvement of AI in the use of the scraped data for training or powering the AI system is central to the incident. Hence, this is an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Tu conversación vale millones y podría ser tratada como propiedad intelectual

2025-10-23

El Financiero

Why's our monitor labelling this an incident or hazard?

The event involves the use of AI systems trained on data scraped without permission, which directly breaches intellectual property rights and licensing agreements. This unauthorized use of data for AI training is a violation of applicable law protecting intellectual property rights, fitting the definition of an AI Incident. Although the article does not describe physical harm or operational disruption, the legal and rights violation harm is clear and realized. Therefore, the event qualifies as an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit demanda a Perplexity y otras empresas de IA por extracción masiva de comentarios

2025-10-22

Cadena 3 Argentina

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (chatbots trained on Reddit data) and alleges that the companies extracted data illegally to train these AI systems, violating Reddit's rights. This constitutes a breach of intellectual property rights, which is one of the harms defined under AI Incidents. The involvement of AI is central, and the harm is realized (legal violation and unauthorized data use). Hence, the event qualifies as an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit demanda a Perplexity y otras empresas de IA por extracciÃ³n masiva de comentarios de usuarios

2025-10-23

Revista Proceso

Why's our monitor labelling this an incident or hazard?

The event involves AI systems (chatbots and AI models) trained on data scraped from Reddit without permission, which Reddit claims is illegal and violates intellectual property and user rights. The use of AI systems in this unauthorized manner has directly led to a breach of rights, fulfilling the criteria for an AI Incident under violations of intellectual property and fundamental rights. The lawsuit and the described unauthorized data extraction indicate that harm has already occurred, not just a potential risk, so it is not merely a hazard or complementary information. Hence, the classification is AI Incident.

Thumbnail Image

Reddit demanda a Perplexity y a otras tres compañías por presunto robo de datos

2025-10-23

Diario La República

Why's our monitor labelling this an incident or hazard?

The article involves AI systems indirectly through the use of scraped data for AI training, which is central to AI development. The legal action concerns alleged copyright infringement, a violation of intellectual property rights, which is a recognized harm category. However, the article does not report that the AI systems have directly or indirectly caused harm yet, only that unauthorized data use is alleged and being contested legally. This makes it a governance and legal response update rather than a direct AI Incident or a plausible future hazard. Hence, it fits the definition of Complementary Information.

Thumbnail Image

Reddit demanda a Perplexity por extraer contenido sin permiso - PasionMóvil

2025-10-23

PasionMovil

Why's our monitor labelling this an incident or hazard?

The event involves AI systems (Perplexity AI's response engine) that use scraped data from Reddit, implicating AI system development and use. The alleged harm is violation of intellectual property rights and unauthorized data extraction, which fits the definition of harm under AI Incident criteria. However, the article reports a legal complaint and dispute without confirmed or adjudicated harm or direct consequences yet. The focus is on the legal challenge and industry implications, making it a governance and societal response to AI-related data use issues. Therefore, it is Complementary Information rather than an AI Incident or AI Hazard.

Thumbnail Image

Reddit acaba de demandar a Perplexity: el mensaje es claro, si usas mis datos sin pagar, prepara tus abogados

2025-10-24

Prensa Mercosur - Imprensa Mercosul El diario online del MERCOSUR

Why's our monitor labelling this an incident or hazard?

The article explicitly involves AI systems (Perplexity AI's model trained on scraped Reddit data) and describes a legal dispute over unauthorized data scraping and copyright infringement. However, the event does not describe any direct or indirect harm caused by the AI system's outputs or malfunction, nor does it describe a plausible future harm scenario from the AI system's use. Instead, it details a legal and governance response to the use of data in AI training. This fits the definition of Complementary Information, as it informs about societal and governance responses to AI-related challenges without reporting a new AI Incident or AI Hazard.

Thumbnail Image

Reddit processa empresas por 'raspagem ilegal de dados' usados por OpenAI e Meta; entenda

2025-10-23

O Globo

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (OpenAI, Meta, Perplexity) trained on data scraped from Reddit without permission. The scraping and subsequent use of this data for AI training constitutes a breach of intellectual property rights and unauthorized use of data, which are harms under the AI Incident definition. The lawsuit confirms that harm has occurred, not just potential harm. Hence, this is an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit processa Perplexity por roubo de dados para IA * Tecnoblog

2025-10-23

Tecnoblog

Why's our monitor labelling this an incident or hazard?

The article explicitly involves AI systems (Perplexity's AI models) that use scraped data from Reddit without authorization, leading to a violation of intellectual property rights and terms of service. The harm is realized and ongoing, as Reddit is seeking damages and injunctions. The AI system's use of illegally obtained data directly causes the harm. This fits the definition of an AI Incident due to breach of intellectual property rights caused by AI system use.

Thumbnail Image

Briga judicial: Reddit acusa scrapers de vender dados para treinar IA

2025-10-22

Olhar Digital - O futuro passa primeiro aqui

Why's our monitor labelling this an incident or hazard?

The article explicitly mentions AI systems being trained on scraped Reddit data without license, constituting unauthorized use of intellectual property. This use has led Reddit to seek legal redress for damages and to stop the scraping. The AI systems' development and use are directly linked to the harm (violation of rights and financial loss). Hence, this qualifies as an AI Incident under the framework definitions.

Thumbnail Image

Reddit processa Perplexity e outras startups por compra e venda de dados roubados para treinar IAs

2025-10-23

Estadão

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (AI chatbots and models trained on scraped data) and describes a direct harm: violation of intellectual property and data rights by unauthorized scraping and resale of data for AI training. The legal action confirms the harm has occurred, not just a potential risk. Hence, it meets the criteria for an AI Incident due to the direct link between AI system development/use and breach of rights.

Thumbnail Image

Reddit processa a Perplexity por usar publicações de utilizadores para treinar o seu sistema de IA

2025-10-23

Jornal Expresso

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI-powered search engine) whose development and use depend on data scraped from Reddit without authorization. This unauthorized data collection constitutes a violation of intellectual property rights, a recognized harm under the AI Incident definition. The legal action and accusations confirm that harm has occurred, not just a potential risk. Hence, this is an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit processa Perplexity e acusa a IA de roubar dados através da Google | TugaTech

2025-10-22

TugaTech

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems trained on data scraped without authorization from Reddit, which is a violation of intellectual property rights. The use of AI models trained on such data directly leads to a breach of legal protections. The lawsuit indicates that the scraping and AI training have already taken place, so the harm is realized, not just potential. Hence, this is an AI Incident due to violation of intellectual property rights caused by the development and use of AI systems trained on illegally obtained data.

Thumbnail Image

Reddit processa a Perplexity por supostamente copiar conteúdo para alimentar IA

2025-10-22

Portal Tela

Why's our monitor labelling this an incident or hazard?

The article explicitly involves an AI system (Perplexity's AI models) that allegedly used Reddit's content without authorization to train its models, constituting a violation of intellectual property rights. This is a direct harm caused by the AI system's use, meeting the criteria for an AI Incident. The legal action and the dispute over data usage confirm that harm has occurred, not just a potential risk, distinguishing this from a hazard or complementary information.

Thumbnail Image

Reddit, gata să se "ia la trântă" prin sălile de judecată cu Perplexity. Cum s-a ajuns la această situație?

2025-10-23

PLAYTECH.ro

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity's AI-powered response engine and similar systems) that have used Reddit's content without license, leading to a legal claim of intellectual property rights violation. The unauthorized scraping and use of data for AI training and response generation constitute a breach of rights under applicable law. This is a direct harm caused by the AI system's use, fulfilling the criteria for an AI Incident. The event is not merely a potential risk or a complementary update but a concrete legal action based on realized harm.

Thumbnail Image

Reddit dă în judecată Perplexity pentru extragerea de date în scopul antrenării inteligenței artificiale

2025-10-22

Mediafax.ro

Why's our monitor labelling this an incident or hazard?

The event involves an AI system (Perplexity's AI response system) that uses scraped data from Reddit without authorization to train its model. This unauthorized data extraction and use infringe on Reddit's intellectual property rights, which is a recognized form of harm under the AI Incident definition (violation of intellectual property rights). The harm is realized as Reddit has filed a lawsuit alleging these violations, indicating the misuse has already occurred. Hence, this is not merely a potential risk but an actual incident involving AI misuse leading to legal harm.

Thumbnail Image

Reddit dă în judecată Perplexity pentru extragerea de date în scopul antrenării IA

2025-10-22

Stiri pe surse

Why's our monitor labelling this an incident or hazard?

The event involves the use of AI systems (Perplexity's AI search engine) trained on data scraped from Reddit without authorization, which is a violation of intellectual property rights. The lawsuit claims that this unauthorized data extraction has already occurred and is being used to train AI, constituting realized harm. This fits the definition of an AI Incident because the AI system's development and use have directly led to a breach of legal protections for intellectual property. The presence of AI is explicit, the harm is realized, and the legal action confirms the seriousness of the violation.

Thumbnail Image

Reddit dă în judecată Perplexity şi alte companii AI pentru preluarea neautorizată a conţinutului

2025-10-24

Monitorul de Galaţi

Why's our monitor labelling this an incident or hazard?

The event involves AI systems used by Perplexity and others to extract and use Reddit content without authorization, which is a breach of intellectual property rights. The use of AI for data scraping and training without consent directly violates Reddit's licensing terms and legal rights. The harm is realized as Reddit has filed a lawsuit seeking damages and injunctions, indicating actual infringement and harm. Therefore, this qualifies as an AI Incident due to the direct involvement of AI systems in causing a violation of intellectual property rights.

Thumbnail Image

Reddit dă în judecată Perplexity pentru extragerea de date în scopul antrenării IA - Stiripesurse.md

2025-10-23

Stiripesurse.md

Why's our monitor labelling this an incident or hazard?

The event involves an AI system (Perplexity's AI search engine) whose development and use rely on illegally scraped data from Reddit, a violation of intellectual property rights. The lawsuit indicates that the AI system's training involved unauthorized use of protected content, directly leading to a breach of legal rights. This fits the definition of an AI Incident as it involves harm through violation of intellectual property rights caused by the AI system's development and use. The presence of the AI system is explicit, the harm is realized (legal violation), and the connection to the AI system is direct and central to the incident.

Thumbnail Image

Reddit подала до суду на Perplexity через нібито збір дописів користувачів для навчання ШІ

2025-10-23

LIGA

Why's our monitor labelling this an incident or hazard?

The event involves the development and use of an AI system trained on data allegedly collected without authorization, infringing Reddit's intellectual property rights. The AI system's training process is directly linked to the harm (copyright violation). The presence of AI is explicit (AI model training), and the harm is a breach of intellectual property rights, which is one of the defined harms for an AI Incident. Therefore, this event qualifies as an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit подає до суду на Perplexity та ще три компанії за "промислове" збирання коментарів користувачів -- Delo.ua

2025-10-23

delo.ua

Why's our monitor labelling this an incident or hazard?

The event involves AI systems (chatbots trained on scraped data) and describes a direct violation of intellectual property rights and unauthorized data use, which are harms under the AI Incident definition (c). The scraping and data use are integral to the AI system's development and operation. The harm is realized, as Reddit has filed a lawsuit alleging these violations. Thus, this is an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit подала до суду на Perplexity

2025-10-23

ZN.UA

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems trained on data scraped without permission, leading to a violation of intellectual property rights, which is a recognized harm under the AI Incident definition (c). The lawsuit alleges direct involvement of AI system development using illegally obtained content, which is a direct cause of harm. The presence of AI systems is clear as the content is used for AI training. The harm is realized, not just potential, as the unauthorized data use has already occurred. Hence, this is classified as an AI Incident.

Thumbnail Image

Reddit подала позов проти Perplexity через використання її контенту для навчання ШІ

2025-10-23

ms.detector.media

Why's our monitor labelling this an incident or hazard?

The event involves the use of AI systems trained on data scraped without authorization, leading to a violation of intellectual property rights, which is a recognized form of harm under the AI Incident definition. The lawsuit alleges direct unauthorized use of Reddit's content for AI training, constituting a breach of legal protections. Therefore, this qualifies as an AI Incident due to realized harm linked to AI system development and use.

Thumbnail Image

Reddit подає в суд на Perplexity за нібито крадіжку контенту

2025-10-23

HiTech.Expert

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Perplexity's AI response system) that uses data scraped without authorization from Reddit, leading to a violation of intellectual property rights. The lawsuit alleges direct use of this data in the AI system's operation, constituting harm under the framework's category (c) violations of intellectual property rights. The involvement of AI in the development and use of the system is clear, and the harm is realized, not just potential. Hence, this is an AI Incident rather than a hazard or complementary information.

Thumbnail Image

Reddit подає до суду на Perplexity за незаконне використання її контенту для навчання ШІ

2025-10-22

Mezha.Media

Why's our monitor labelling this an incident or hazard?

The event explicitly involves AI systems (Perplexity's AI response mechanism) trained on data scraped without authorization from Reddit, which is copyrighted content. This unauthorized use constitutes a violation of intellectual property rights, a recognized harm under the AI Incident definition. The lawsuit alleges direct use of Reddit content in AI training, indicating realized harm rather than potential harm. Therefore, this qualifies as an AI Incident due to the direct link between AI system development/use and violation of rights.

Thumbnail Image

Reddit CEO on data scraping lawsuits against AI companies: 'We see both sides of this'

2025-10-30

CNBC

Why's our monitor labelling this an incident or hazard?

The event involves AI companies and data scraping allegations, but it does not describe any realized harm or plausible future harm caused by AI systems. The discussion centers on legal disputes and business relationships, which fits the definition of Complementary Information as it provides context and updates related to AI ecosystem developments without reporting an AI Incident or Hazard.

Thumbnail Image

Reddit CEO Addresses Lawsuits Against AI Firms: 'Our Duty Is to Protect Our Data'

2025-10-31

IVCPOST

Why's our monitor labelling this an incident or hazard?

The article explicitly involves AI systems (AI firms training models on scraped data) and alleges violations of intellectual property rights, which is a recognized harm category. However, the article does not describe a concrete AI Incident where harm has already occurred due to AI system outputs or malfunction, nor does it describe a plausible future harm scenario independent of the legal dispute. Instead, it focuses on the ongoing lawsuits and the company's efforts to protect its data, which is a governance and legal response to AI-related challenges. Hence, the event is best classified as Complementary Information.