
The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.
French company Mistral AI is accused of training its large language model, Mistral Large 3-2512, on thousands of copyrighted books, songs, and articles without permission. Investigations revealed the AI reproduces substantial portions of protected works, violating intellectual property rights and disregarding opt-out requests from content owners.[AI generated]
Why's our monitor labelling this an incident or hazard?
The article explicitly documents that Mistral AI's generative models have been trained on copyrighted materials without authorization, reproducing large portions of protected works and song lyrics, which is a direct violation of intellectual property rights. Additionally, the company has scraped website content despite explicit opt-out requests, further breaching legal rights. These infringements are directly caused by the development and use of Mistral AI's systems, fulfilling the criteria for an AI Incident under violations of intellectual property rights. The harms are actual and ongoing, not merely potential, and the AI system's role is pivotal in causing these harms.[AI generated]