Google Blocks AI-Driven Cyberattack Exploiting Zero-Day Vulnerability

Thumbnail Image

The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.

Google successfully blocked a major cyberattack in which criminals used Anthropic's AI model, Mythos, to discover and attempt to exploit a previously unknown software vulnerability. The incident highlights the growing threat of AI-powered cyberattacks, particularly against critical infrastructure like banking systems.[AI generated]

Why's our monitor labelling this an incident or hazard?

The event involves the use of an AI system ('Mythos') that can identify software vulnerabilities at an unprecedented scale and speed, which could be exploited for cyberattacks. The article does not describe any realized harm but emphasizes the plausible future risk to critical financial infrastructure and consumer services. This fits the definition of an AI Hazard, as the development and use of the AI system could plausibly lead to an AI Incident involving disruption of critical infrastructure and harm to consumers. The article also mentions increased supervisory measures as a response, but the main focus is on the potential threat rather than a realized incident or complementary information about responses.[AI generated]
AI principles
Robustness & digital securitySafety

Industries
Financial and insurance servicesDigital security

Affected stakeholders
BusinessConsumers

Harm types
Economic/PropertyPublic interestHuman or fundamental rights

Severity
AI hazard

AI system task:
Content generationReasoning with knowledge structures/planning


Articles about this incident or hazard

Thumbnail Image

IA et sécurité bancaire : le superviseur financier allemand sonne l'alarme face à l'accélération de la menace

2026-05-12
Boursorama
Why's our monitor labelling this an incident or hazard?
The event involves the use of an AI system ('Mythos') that can identify software vulnerabilities at an unprecedented scale and speed, which could be exploited for cyberattacks. The article does not describe any realized harm but emphasizes the plausible future risk to critical financial infrastructure and consumer services. This fits the definition of an AI Hazard, as the development and use of the AI system could plausibly lead to an AI Incident involving disruption of critical infrastructure and harm to consumers. The article also mentions increased supervisory measures as a response, but the main focus is on the potential threat rather than a realized incident or complementary information about responses.
Thumbnail Image

Faut-il vraiment craindre Mythos, l'IA capable de détecter et d'exploiter des failles de cybersécurité ?

2026-05-12
The Conversation
Why's our monitor labelling this an incident or hazard?
Mythos is an AI system explicitly described as capable of detecting and exploiting cybersecurity vulnerabilities autonomously. The article explains that while the system has discovered and demonstrated exploits for serious vulnerabilities, these have been responsibly reported and mitigated, so no direct harm has yet occurred from its use. However, the capabilities of Mythos clearly pose a credible risk of future harm if misused, such as enabling cyberattacks that could disrupt systems or compromise data. This fits the definition of an AI Hazard: an event or circumstance where the AI system's development or use could plausibly lead to harm. Since no actual harm has been reported, it is not an AI Incident. The article also goes beyond general AI news by focusing on the risks and implications of this AI system's capabilities, so it is not merely Complementary Information or Unrelated.
Thumbnail Image

Google a confirmé les craintes liées à Mythos en révélant avoir réussi à bloquer une cyberattaque de grande envergure au cours de laquelle des cybercriminels ont utilisé l'IA pour découvrir une faille inconnue

2026-05-13
Developpez.com
Why's our monitor labelling this an incident or hazard?
The article explicitly states that criminals used an AI system (a large language model) to discover and plan to exploit a zero-day vulnerability, which is a direct cause of a cyberattack attempt. This fits the definition of an AI Incident because the AI system's use directly led to a significant harm scenario (planned exploitation of a critical security flaw). Although the attack was blocked, the AI's role in enabling the attack attempt and the potential for harm to critical infrastructure and property is clear. The event is not merely a potential hazard or complementary information but a concrete incident involving AI misuse in cybercrime.
Thumbnail Image

Préparez-vous : comment les banques se défendent pour éviter que l'IA ne vide votre compte ! | LesNews

2026-05-10
LesNews
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions an AI system (Mythos) with advanced capabilities that could be exploited maliciously to cause harm, particularly in the financial sector. Although no direct harm has occurred yet, the potential for significant cybersecurity breaches and financial harm is clearly articulated and recognized by high-level officials and experts. This fits the definition of an AI Hazard, as the development and potential misuse of Mythos plausibly could lead to an AI Incident involving harm to critical infrastructure and property. The article also describes ongoing efforts to mitigate these risks, but the primary focus is on the credible threat posed by the AI system rather than on realized harm or responses to past incidents.
Thumbnail Image

0

2026-05-13
developpez.net
Why's our monitor labelling this an incident or hazard?
The article explicitly states that cybercriminals used an AI system (a large language model) to identify a zero-day vulnerability that could bypass two-factor authentication, a critical security measure for banks and enterprises. This use of AI directly led to a cyberattack attempt, which was blocked but demonstrates realized harm or at least a direct threat to critical infrastructure security. The involvement of AI in discovering and exploiting the vulnerability, the malicious intent, and the potential for significant harm to property, communities, and infrastructure clearly classify this as an AI Incident rather than a hazard or complementary information. The event is not merely a potential risk but a realized attack attempt involving AI.
Thumbnail Image

Mistral pourrait proposer sa réponse à Mythos aux banques européennes

2026-05-13
MacGeneration
Why's our monitor labelling this an incident or hazard?
The article centers on the potential security risks and strategic competition around AI tools for code auditing but does not describe any actual harm or incidents resulting from the use or malfunction of these AI systems. The concerns about misuse by hackers and government restrictions indicate plausible future risks, but no direct or indirect harm has occurred yet. Therefore, this event fits the definition of an AI Hazard, as it plausibly could lead to harm (e.g., security breaches if malicious actors gain access to such AI), but no incident has materialized at this time.
Thumbnail Image

Face à Anthropic qui ne permet toujours pas à l'UE de tester son puissant modèle d'IA, Mythos, le Français Mistral travaille sur sa propre version

2026-05-14
BFMTV
Why's our monitor labelling this an incident or hazard?
The article centers on the development and planned deployment of AI systems for cybersecurity vulnerability detection, which could plausibly lead to significant harms if misused or if vulnerabilities are exploited. However, no actual harm, malfunction, or misuse has been reported. The discussion about access limitations and strategic control over these AI tools highlights potential risks but remains in the realm of plausible future harm. Hence, this qualifies as an AI Hazard rather than an AI Incident or Complementary Information. It is not unrelated because AI systems are explicitly involved and the potential for harm is credible.