Google Gemini AI Vulnerability Exploited via Malicious Notifications on Android

Thumbnail Image

The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.

Security researchers from SafeBreach discovered a vulnerability in Google Gemini's Android voice assistant, where crafted notifications from apps like WhatsApp, Slack, and SMS could inject malicious prompts. This allowed attackers to manipulate Gemini, potentially leading to unauthorized actions, privacy breaches, and data exfiltration. Google has since patched the flaw.[AI generated]

Why's our monitor labelling this an incident or hazard?

The event involves an AI system (Google Gemini's voice assistant) whose malfunction or exploitation (prompt injection vulnerability) could directly lead to significant harms including unauthorized physical control of devices, privacy violations, and impersonation attacks. These harms fall under injury or harm to persons (privacy and security breaches) and harm to property or communities (unauthorized control of smart home devices). Since the vulnerability was exploited in demonstrations and poses a direct risk of harm, this qualifies as an AI Incident. The patching and responsible disclosure are complementary information but do not negate the incident classification.[AI generated]
AI principles
Privacy & data governanceRobustness & digital security

Industries
Digital security

Affected stakeholders
Consumers

Harm types
Human or fundamental rights

Severity
AI incident

AI system task:
Interaction support/chatbots


Articles about this incident or hazard

Thumbnail Image

Google Gemini security flaw lets hackers hijack your Android phone via WhatsApp -- what you need to know

2026-06-04
Tom's Guide
Why's our monitor labelling this an incident or hazard?
The event involves an AI system (Google Gemini's voice assistant) whose malfunction or exploitation (prompt injection vulnerability) could directly lead to significant harms including unauthorized physical control of devices, privacy violations, and impersonation attacks. These harms fall under injury or harm to persons (privacy and security breaches) and harm to property or communities (unauthorized control of smart home devices). Since the vulnerability was exploited in demonstrations and poses a direct risk of harm, this qualifies as an AI Incident. The patching and responsible disclosure are complementary information but do not negate the incident classification.
Thumbnail Image

Hackers could use poisoned WhatsApp and Slack notifications to take over your Google Gemini - and make it work on their behalf

2026-06-04
TechRadar
Why's our monitor labelling this an incident or hazard?
The event involves an AI system (Google Gemini) and a prompt injection attack that could directly lead to harm by enabling unauthorized access to user data and control over the AI system. The vulnerability was exploited through malicious notifications that trick the AI into executing harmful commands. This constitutes an AI Incident because the AI system's malfunction (its inability to distinguish instructions from data) directly led to a security risk that could harm users' privacy and data security. The fact that the issue was patched after disclosure does not negate the incident classification, as the harm or risk was realized or imminent before the fix.
Thumbnail Image

Malicious WhatsApp, Slack Alerts Could Have Exposed Millions of Android Users

2026-06-04
TechRepublic
Why's our monitor labelling this an incident or hazard?
The event involves an AI system (Google Gemini) that processes notifications and generates responses. The described flaw allowed malicious notifications to manipulate the AI's behavior, which could have led to harms such as unauthorized control of smart home devices, social engineering attacks, and misinformation. However, the issue was mitigated before any exploitation occurred, and no evidence of real-world harm was found. Thus, the event represents a credible risk of harm (AI Hazard) rather than an actual harm (AI Incident).
Thumbnail Image

Malicious Notifications Could Trick Google Gemini Users

2026-06-03
Dark Reading
Why's our monitor labelling this an incident or hazard?
The event involves an AI system (Google Gemini voice assistant) and a novel prompt injection attack that could plausibly lead to harm such as unauthorized control of devices, social engineering, and poisoning of AI memory. Although the attack was demonstrated in research and not observed in the wild, the credible risk and the need for vendor mitigation classify this as an AI Hazard. There is no indication that harm has already occurred, so it is not an AI Incident. The article focuses on the risk and mitigation rather than a societal or governance response, so it is not Complementary Information. Hence, AI Hazard is the appropriate classification.
Thumbnail Image

WhatsApp, Slack, SMS Notifications Could Hijack Google Gemini on Android

2026-06-04
TechNadu
Why's our monitor labelling this an incident or hazard?
The event explicitly involves an AI system (Google Gemini AI) and details how its malfunction and exploitation via crafted notifications lead to direct harms including unauthorized control of devices, social engineering, and data exfiltration risks. These harms affect user security, privacy, and potentially broader community trust, fitting the definition of an AI Incident. The involvement of the AI system's use and malfunction in causing these harms confirms this classification.
Thumbnail Image

Researchers Show How Android Notifications Could Be Used to Manipulate Google Gemini - IT Security News

2026-06-04
IT Security News - cybersecurity, infosecurity news
Why's our monitor labelling this an incident or hazard?
The event involves an AI system (Google Gemini) that processes notifications and generates outputs influencing user interactions. The researchers demonstrated that maliciously crafted notifications could manipulate the AI assistant to produce false information and potentially unauthorized actions, which constitutes harm to users (misinformation and possible unauthorized commands). The harm is direct and linked to the AI system's malfunction and misuse. The flaw has been remediated, but the incident itself involved realized harm and risk. Hence, it meets the criteria for an AI Incident rather than a hazard or complementary information.
Thumbnail Image

New Google Gemini Vulnerability Exploited via Prompt Injections from WhatsApp, Slack, and SMS

2026-06-03
Cyber Security News
Why's our monitor labelling this an incident or hazard?
The event involves an AI system (Google Gemini voice assistant) whose malfunction or exploitation (via indirect prompt injection attacks) has directly led to significant harms including privacy violations, unauthorized control of devices, and social engineering attacks. These harms fall under violations of human rights (privacy) and harm to communities (security breaches). Since the harm is realized and the AI system's exploitation is central to the incident, this qualifies as an AI Incident.
Thumbnail Image

Android Gemini prompt injection flaw patched by Google

2026-06-04
SC Media
Why's our monitor labelling this an incident or hazard?
The event involves an AI system (Gemini AI assistant) whose malfunction (inability to distinguish legitimate data from malicious instructions) directly led to a security vulnerability that could cause harm by leaking user contacts to attackers. Although the patch has been applied, the event describes a realized security flaw that could lead to harm, thus qualifying as an AI Incident due to the direct link between the AI system's malfunction and potential harm to users' privacy and security.