Apple Sued for Scraping YouTube Videos to Train AI Models

Thumbnail Image

The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.

Apple faces a class action lawsuit in the United States after YouTube creators accused the company of scraping millions of copyrighted YouTube videos, bypassing anti-scraping protections, to train its AI models using the Panda-70M dataset. Plaintiffs allege this violates copyright law and seek damages and an injunction.[AI generated]

Why's our monitor labelling this an incident or hazard?

The lawsuit explicitly alleges that Apple's AI system was trained using copyrighted videos scraped from YouTube without authorization, violating copyright protections and the DMCA. This is a direct violation of intellectual property rights, which is a recognized harm under the AI Incident definition. The involvement of AI in the development and use of the system is clear, and the harm is realized as the plaintiffs claim substantial profit by Apple without compensation. Hence, this is an AI Incident rather than a hazard or complementary information.[AI generated]
AI principles
Privacy & data governanceTransparency & explainability

Industries
Media, social platforms, and marketing

Affected stakeholders
Workers

Harm types
Economic/Property

Severity
AI incident

Business function:
Research and development

AI system task:
Other


Articles about this incident or hazard

Thumbnail Image

YouTube channels including H3H3 sue Apple over use of videos to train AI

2026-04-06
Notebookcheck
Why's our monitor labelling this an incident or hazard?
The lawsuit explicitly alleges that Apple's AI system was trained using copyrighted videos scraped from YouTube without authorization, violating copyright protections and the DMCA. This is a direct violation of intellectual property rights, which is a recognized harm under the AI Incident definition. The involvement of AI in the development and use of the system is clear, and the harm is realized as the plaintiffs claim substantial profit by Apple without compensation. Hence, this is an AI Incident rather than a hazard or complementary information.
Thumbnail Image

Three YouTubers accuse Apple of illegal scraping to train its AI models

2026-04-06
engadget
Why's our monitor labelling this an incident or hazard?
The event explicitly involves AI systems being trained using scraped copyrighted content without permission, which is a violation of intellectual property rights. This fits the definition of an AI Incident because the AI system's development has directly led to a breach of obligations under applicable law protecting intellectual property rights. The harm is realized in the form of alleged copyright infringement, and the involvement of AI in the training process is central to the incident. Hence, the classification is AI Incident.
Thumbnail Image

Apple Sued by Three YouTube Channels

2026-04-06
MacRumors
Why's our monitor labelling this an incident or hazard?
The event explicitly involves AI systems (Apple's AI models) trained on copyrighted content obtained through alleged unlawful scraping. This constitutes a violation of intellectual property rights, a recognized harm under the AI Incident definition. The lawsuit indicates that the harm has already occurred, not just a potential risk, making this an AI Incident rather than a hazard or complementary information. The involvement of AI in the harm is direct, as the AI models were trained on the copyrighted videos, leading to the legal claim.
Thumbnail Image

Apple accused of scraping millions of YouTube videos for AI training - 9to5Mac

2026-04-06
9to5Mac
Why's our monitor labelling this an incident or hazard?
The event involves the development and use of AI systems trained on a large dataset of YouTube videos allegedly obtained by circumventing protections, which constitutes a breach of intellectual property rights and applicable law. This meets the definition of an AI Incident under category (c) for violations of intellectual property rights. The harm is realized as the lawsuit claims actual use of the data in AI training, not just potential misuse. Therefore, this is an AI Incident rather than a hazard or complementary information.
Thumbnail Image

AppleInsider.com

2026-04-06
AppleInsider
Why's our monitor labelling this an incident or hazard?
The event explicitly involves an AI system ('Apple AI Video') trained on a dataset (Panda-70M) composed of scraped YouTube videos, which were obtained by circumventing protection measures. This constitutes the development and use of an AI system that directly led to a violation of intellectual property rights, a recognized harm under the AI Incident definition. The lawsuit details actual harm to content creators and legal claims against Apple, confirming the realized harm rather than a potential risk. Hence, the classification as an AI Incident is appropriate.
Thumbnail Image

Apple Faces Lawsuit for Allegedly Using YouTube Videos to Train AI

2026-04-06
The Mac Observer
Why's our monitor labelling this an incident or hazard?
The event explicitly involves the use of AI systems trained on copyrighted content without consent, leading to alleged violations of intellectual property rights. This harm has already occurred as the lawsuit claims direct unauthorized use and commercial benefit from creators' content. Therefore, this qualifies as an AI Incident due to the realized violation of rights caused by the AI system's development and use.
Thumbnail Image

Is the Apple AI lawsuit tied to Panda-70M YouTube scraping for AI training?

2026-04-07
The Cryptonomist
Why's our monitor labelling this an incident or hazard?
The event explicitly involves AI systems trained on a dataset created by scraping YouTube videos, which is alleged to have been done without authorization, violating copyright protections. The harm is a breach of intellectual property rights, a recognized category of AI Incident harm. The complaint details the use of AI in training video generation models using this data, linking the AI system's development and use directly to the alleged harm. Thus, this is an AI Incident rather than a hazard or complementary information, as the harm is realized and the AI system's role is pivotal.