Mistral AI Source Code Stolen in Major Data Breach

Thumbnail Image

The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.

Hackers from TeamPCP stole 450 repositories containing Mistral AI's proprietary source code and training data, threatening to leak them online if not sold. Mistral AI confirmed a breach linked to SDK contamination but stated core systems and user data were unaffected. The incident involves significant intellectual property harm.[AI generated]

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system's development environment, with hackers stealing AI-related source code and internal repositories. The breach and threat to leak proprietary AI code represent a violation of intellectual property rights, which is a recognized harm under the AI Incident definition. The involvement of AI systems is clear, as the stolen data relates to AI model training and deployment. The harm is realized (data theft) and ongoing (threat of leak), thus qualifying as an AI Incident rather than a hazard or complementary information. The company's confirmation of SDK contamination further supports the direct involvement of AI systems in the incident.[AI generated]
AI principles
Robustness & digital security

Industries
Digital securityIT infrastructure and hosting

Affected stakeholders
Business

Harm types
Economic/PropertyReputational

Severity
AI incident

Business function:
Research and development

AI system task:
Content generation


Articles about this incident or hazard

Thumbnail Image

Hackers threaten to leak Mistral files online -- AI giant confirms breach, but not what data is involved

2026-05-15
TechRadar
Why's our monitor labelling this an incident or hazard?
The event explicitly involves an AI system's development environment, with hackers stealing AI-related source code and internal repositories. The breach and threat to leak proprietary AI code represent a violation of intellectual property rights, which is a recognized harm under the AI Incident definition. The involvement of AI systems is clear, as the stolen data relates to AI model training and deployment. The harm is realized (data theft) and ongoing (threat of leak), thus qualifying as an AI Incident rather than a hazard or complementary information. The company's confirmation of SDK contamination further supports the direct involvement of AI systems in the incident.
Thumbnail Image

TeamPCP Claims Theft of Mistral AI Source Code

2026-05-14
TechNadu
Why's our monitor labelling this an incident or hazard?
The event involves the alleged theft of proprietary AI source code and training repositories, which directly relates to the development of an AI system (Mistral AI's language models). The theft of such data constitutes a violation of intellectual property rights, a recognized form of harm under the AI Incident definition. Although Mistral AI has not confirmed the breach and attributes issues to a different attack, the detailed claims and threat actor's demands indicate a realized harm scenario if true. The involvement of AI systems is explicit, and the harm is direct and significant. Therefore, this event fits the AI Incident category rather than a hazard or complementary information, as the harm is materialized or ongoing (theft and extortion).
Thumbnail Image

Mistral AI revs up expansion in Singapore with local partners | Frontier Enterprise

2026-05-12
Frontier Enterprise
Why's our monitor labelling this an incident or hazard?
The article does not report any direct or indirect harm caused by AI systems, nor does it indicate a credible risk of harm from the described activities. Instead, it highlights collaborative efforts to responsibly develop and deploy AI technologies in Singapore. Therefore, it fits the definition of Complementary Information, as it provides context and updates on AI development and governance without describing an AI Incident or AI Hazard.
Thumbnail Image

Mistral AI pitches sovereign security model to European banks

2026-05-15
Retail Banker International
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions AI systems designed to identify cybersecurity weaknesses and potentially carry out autonomous attacks. Although no actual harm or incident has been reported, the AI system's capabilities and the concerns raised (including the White House's consideration of safety checks) indicate a credible risk of harm. Since the event concerns ongoing development and potential future deployment without realized harm, it fits the definition of an AI Hazard rather than an AI Incident or Complementary Information.
Thumbnail Image

La pépite française de l'intelligence artificielle Mistral AI visée par une cyberattaque

2026-05-15
Le Figaro.fr
Why's our monitor labelling this an incident or hazard?
The event involves an AI system because the stolen code samples are used to train AI models, indicating AI system development involvement. The cyberattack is a security breach affecting the AI company's development environment. However, the company states that no critical or user data was accessed and that the attack was neutralized quickly. There is no indication that the AI system malfunctioned or was misused to cause harm, nor that any harm has materialized. The potential for future harm exists if the stolen code is exploited maliciously, but this is not confirmed or realized in the article. Hence, the event fits the definition of an AI Hazard, as it plausibly could lead to an AI Incident if the stolen code is weaponized or causes downstream harm, but currently no harm has occurred.
Thumbnail Image

" Ils ont contaminé certains de nos kits de développement " : le Français Mistral AI victime d'une cyberattaque

2026-05-15
Ouest France
Why's our monitor labelling this an incident or hazard?
The event involves an AI system (AI model code) and a cyberattack that exfiltrated AI-related code, which is relevant to AI development. However, the article states no sensitive user data or essential services were compromised, and no direct or indirect harm to persons, infrastructure, or rights is reported. The company took prompt remediation measures. The event does not describe realized harm or a credible plausible future harm scenario caused by the AI system's use or malfunction. Therefore, it does not qualify as an AI Incident or AI Hazard. Instead, it provides supporting information about an AI-related security breach and the company's response, fitting the definition of Complementary Information.
Thumbnail Image

Mistral AI : la start-up française d'intelligence artificielle victime d'une cyberattaque

2026-05-14
Franceinfo
Why's our monitor labelling this an incident or hazard?
An AI system is clearly involved as Mistral AI develops AI models and the stolen files relate to code used to train their AI. However, the incident did not result in realized harm such as injury, rights violations, or disruption to critical infrastructure. The attack targeted the development environment and codebase, but the company reports no compromise of user data or operational systems. Therefore, while the event involves an AI system and a security breach, it does not meet the threshold for an AI Incident since no direct or indirect harm has occurred. It also does not qualify as an AI Hazard because the article does not indicate plausible future harm resulting from this breach. Instead, it is best classified as Complementary Information, providing an update on a security incident affecting an AI company without materialized harm.
Thumbnail Image

Mistral AI victime d'une cyberattaque

2026-05-15
linformaticien.com
Why's our monitor labelling this an incident or hazard?
The event involves an AI system because the stolen code repositories are related to AI inference, fine-tuning, and other AI tools, which are integral to AI system development and deployment. The cyberattack is a malfunction or misuse event affecting the AI system's development environment. Although Mistral AI denies that critical or essential code or user data was compromised, the hackers' claims and threats to publish the data create a plausible risk of future harm, including intellectual property violations and potential misuse of AI tools. Since no direct or indirect harm has been confirmed or reported as having occurred, the event does not meet the threshold for an AI Incident but does meet the criteria for an AI Hazard due to the credible potential for harm.
Thumbnail Image

Mistral AI touchée par une cyberattaque visant son code source - Siècle Digital

2026-05-18
Siècle Digital
Why's our monitor labelling this an incident or hazard?
The event involves a cyberattack on Mistral AI, an AI system developer, through supply chain compromise of open-source software components integral to AI system development. The attackers accessed and published corrupted development tools and stole internal code repositories related to AI inference and chatbot security, indicating a breach of intellectual property rights and security. The attack directly affected the AI system's development environment and could have led to further harm. Although no physical harm or critical infrastructure disruption is reported, the theft and compromise of AI development code and credentials constitute a violation of intellectual property rights and security obligations, fitting the definition of an AI Incident. The AI system's development was directly impacted, and harm to property (intellectual property) and potential breach of security occurred.