Researchers Demonstrate Inaudible Audio Attacks That Hijack AI Voice Assistants

Thumbnail Image

The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.

Researchers from China and Singapore have demonstrated that inaudible, adversarial audio signals embedded in podcasts, videos, or music can hijack AI voice assistants and smart speakers. These attacks can trigger unauthorized actions, such as accessing sensitive data or sending information to attackers, highlighting serious security vulnerabilities in AI-powered audio systems.[AI generated]

Why's our monitor labelling this an incident or hazard?

The event involves AI systems explicitly—voice AI models used in smart speakers and phones. The hackers' adversarial audio exploits the AI system's malfunction or vulnerability to cause unauthorized actions, directly leading to harm in terms of privacy breaches and potential access to sensitive information. The harm is realized or highly plausible given the demonstrated attack method. Therefore, this qualifies as an AI Incident due to direct harm caused by the AI system's exploitation.[AI generated]

AI principles

Privacy & data governanceRobustness & digital security

Industries

Consumer productsDigital security

Affected stakeholders

Consumers

Harm types

Human or fundamental rights

Severity

AI incident

AI system task:

Interaction support/chatbots

Articles about this incident or hazard

Thumbnail Image

Inaudible audio files may become a new tool for AI and computer Hijacking: Report

2026-05-24

Firstpost

Why's our monitor labelling this an incident or hazard?

The event involves AI systems explicitly—open-source audio AI models—and their use and potential malfunction due to adversarial inaudible audio inputs. The researchers demonstrated that these AI systems can be manipulated to perform forbidden actions, which could indirectly lead to harm such as unauthorized access to devices and data breaches, constituting violations of user rights and security. Since the attacks have been demonstrated but no actual incidents of harm have been reported, this qualifies as an AI Hazard: a plausible future risk of AI-related harm. The article also includes responses from Microsoft, indicating ongoing efforts to mitigate the risk, but the main focus is on the demonstrated vulnerability and its potential consequences rather than on realized harm or remediation, supporting the classification as an AI Hazard rather than an Incident or Complementary Information.

Thumbnail Image

Hackers Find That Inaudible Sounds Hidden in Podcasts or Random Videos Can Hijack Your AI Voice Chatbot

2026-05-24

Futurism

Why's our monitor labelling this an incident or hazard?

The event involves AI systems explicitly—voice AI models used in smart speakers and phones. The hackers' adversarial audio exploits the AI system's malfunction or vulnerability to cause unauthorized actions, directly leading to harm in terms of privacy breaches and potential access to sensitive information. The harm is realized or highly plausible given the demonstrated attack method. Therefore, this qualifies as an AI Incident due to direct harm caused by the AI system's exploitation.

Thumbnail Image

Hidden Audio Attacks Could Turn AI Assistants Into Security Risks

2026-05-26

ChannelNews

Why's our monitor labelling this an incident or hazard?

The article explicitly involves AI systems—specifically AI voice assistants and audio-based AI models—and describes how adversarial audio inputs can manipulate these systems to cause harm. The harms include unauthorized actions by the AI (downloading malicious files, exposing user data), which are direct harms to users' security and privacy, fitting the definition of injury or harm to persons or groups. The AI system's malfunction or misuse is central to the incident. Therefore, this is an AI Incident rather than a hazard or complementary information.

Thumbnail Image

AI voice bots hijacked by 'hidden' sounds in podcasts, MP3 files and YouTube clips

2026-05-24

Cybernews

Why's our monitor labelling this an incident or hazard?

The event involves AI systems explicitly—voice assistants and AI meeting transcribers capable of interpreting audio and interacting with external tools. The demonstrated attack manipulates these AI systems to carry out unauthorized actions, which can lead to harm such as data breaches and privacy violations. The harm is realized in the sense that the AI systems are manipulated to perform harmful actions, even if the attack is currently a proof-of-concept. This fits the definition of an AI Incident as the AI system's use has directly led to harm or unauthorized actions. The event is not merely a potential hazard or complementary information but a concrete demonstration of an exploit causing harm through AI misuse.

Thumbnail Image

Inaudible background sounds in videos could be used to hack smart speakers

2026-05-25

The Independent

Why's our monitor labelling this an incident or hazard?

The event involves AI systems explicitly (AI-powered smart speakers and assistants using large language models with audio-text integration). The described adversarial audio attacks represent a use and potential misuse of AI systems that could plausibly lead to harm, such as unauthorized access to personal information and malicious actions by hijacked AI agents. Since the study reveals vulnerabilities and demonstrates successful hijacking in controlled tests but does not report actual incidents of harm, this qualifies as an AI Hazard rather than an AI Incident. The event is not merely complementary information because it focuses on the newly identified threat and its implications rather than updates or responses to past incidents.

Thumbnail Image

Inaudible Audio Attacks Can Hijack AI Voice Models, Study Finds - Decrypt

2026-05-26

Decrypt

Why's our monitor labelling this an incident or hazard?

The event involves AI systems explicitly—large audio-language models (LALMs) that process spoken commands. The attack manipulates the AI system's input to cause harmful outputs, including misinformation dissemination, unauthorized actions, and privacy violations. These harms fall under (a) injury or harm to persons (e.g., privacy breaches), and (d) harm to communities (e.g., spreading false information). Since the attack has been demonstrated successfully and is ongoing, it is a realized harm, not just a potential risk. Therefore, this qualifies as an AI Incident because the AI system's use and malfunction (due to adversarial manipulation) have directly led to significant harms.

Thumbnail Image

How Inaudible Sounds in Videos and Podcasts Can Hack AI Assistants - News Directory 3

2026-05-27

News Directory 3

Why's our monitor labelling this an incident or hazard?

The event involves AI systems (voice assistants and chatbots) being compromised through malicious inaudible audio inputs crafted with advanced signal processing. The attacks have directly led or could lead to harms including unauthorized access to sensitive data, manipulation of AI-driven decisions, and breaches of security, which fall under violations of rights and harm to property or systems. Since the article discusses actual exploitation and the resulting security threats, this qualifies as an AI Incident rather than a mere hazard or complementary information.