Researchers Demonstrate Inaudible Audio Attacks That Hijack AI Voice Assistants

Thumbnail Image

The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.

Researchers from China and Singapore have demonstrated that inaudible, adversarial audio signals embedded in podcasts, videos, or music can hijack AI voice assistants and smart speakers. These attacks can trigger unauthorized actions, such as accessing sensitive data or sending information to attackers, highlighting serious security vulnerabilities in AI-powered audio systems.[AI generated]

Why's our monitor labelling this an incident or hazard?

The event involves AI systems explicitly—voice AI models used in smart speakers and phones. The hackers' adversarial audio exploits the AI system's malfunction or vulnerability to cause unauthorized actions, directly leading to harm in terms of privacy breaches and potential access to sensitive information. The harm is realized or highly plausible given the demonstrated attack method. Therefore, this qualifies as an AI Incident due to direct harm caused by the AI system's exploitation.[AI generated]
AI principles
Privacy & data governanceRobustness & digital security

Industries
Consumer productsDigital security

Affected stakeholders
Consumers

Harm types
Human or fundamental rights

Severity
AI incident

AI system task:
Interaction support/chatbots


Articles about this incident or hazard

Thumbnail Image

Inaudible audio files may become a new tool for AI and computer Hijacking: Report

2026-05-24
Firstpost
Why's our monitor labelling this an incident or hazard?
The event involves AI systems explicitly—open-source audio AI models—and their use and potential malfunction due to adversarial inaudible audio inputs. The researchers demonstrated that these AI systems can be manipulated to perform forbidden actions, which could indirectly lead to harm such as unauthorized access to devices and data breaches, constituting violations of user rights and security. Since the attacks have been demonstrated but no actual incidents of harm have been reported, this qualifies as an AI Hazard: a plausible future risk of AI-related harm. The article also includes responses from Microsoft, indicating ongoing efforts to mitigate the risk, but the main focus is on the demonstrated vulnerability and its potential consequences rather than on realized harm or remediation, supporting the classification as an AI Hazard rather than an Incident or Complementary Information.
Thumbnail Image

Hackers Find That Inaudible Sounds Hidden in Podcasts or Random Videos Can Hijack Your AI Voice Chatbot

2026-05-24
Futurism
Why's our monitor labelling this an incident or hazard?
The event involves AI systems explicitly—voice AI models used in smart speakers and phones. The hackers' adversarial audio exploits the AI system's malfunction or vulnerability to cause unauthorized actions, directly leading to harm in terms of privacy breaches and potential access to sensitive information. The harm is realized or highly plausible given the demonstrated attack method. Therefore, this qualifies as an AI Incident due to direct harm caused by the AI system's exploitation.
Thumbnail Image

Hidden Audio Attacks Could Turn AI Assistants Into Security Risks

2026-05-26
ChannelNews
Why's our monitor labelling this an incident or hazard?
The article explicitly involves AI systems—specifically AI voice assistants and audio-based AI models—and describes how adversarial audio inputs can manipulate these systems to cause harm. The harms include unauthorized actions by the AI (downloading malicious files, exposing user data), which are direct harms to users' security and privacy, fitting the definition of injury or harm to persons or groups. The AI system's malfunction or misuse is central to the incident. Therefore, this is an AI Incident rather than a hazard or complementary information.
Thumbnail Image

AI voice bots hijacked by 'hidden' sounds in podcasts, MP3 files and YouTube clips

2026-05-24
Cybernews
Why's our monitor labelling this an incident or hazard?
The event involves AI systems explicitly—voice assistants and AI meeting transcribers capable of interpreting audio and interacting with external tools. The demonstrated attack manipulates these AI systems to carry out unauthorized actions, which can lead to harm such as data breaches and privacy violations. The harm is realized in the sense that the AI systems are manipulated to perform harmful actions, even if the attack is currently a proof-of-concept. This fits the definition of an AI Incident as the AI system's use has directly led to harm or unauthorized actions. The event is not merely a potential hazard or complementary information but a concrete demonstration of an exploit causing harm through AI misuse.
Thumbnail Image

Inaudible background sounds in videos could be used to hack smart speakers

2026-05-25
The Independent
Why's our monitor labelling this an incident or hazard?
The event involves AI systems explicitly (AI-powered smart speakers and assistants using large language models with audio-text integration). The described adversarial audio attacks represent a use and potential misuse of AI systems that could plausibly lead to harm, such as unauthorized access to personal information and malicious actions by hijacked AI agents. Since the study reveals vulnerabilities and demonstrates successful hijacking in controlled tests but does not report actual incidents of harm, this qualifies as an AI Hazard rather than an AI Incident. The event is not merely complementary information because it focuses on the newly identified threat and its implications rather than updates or responses to past incidents.