Security Risks of Autonomous AI Agents Using MCP in Enterprises

Thumbnail Image

The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.

Enterprises rapidly adopting AI agents via Model Context Protocol (MCP) face significant security risks. These autonomous agents, with high-level access to sensitive systems, outpace existing security controls, creating vulnerabilities such as indirect prompt injection, potential data breaches, and operational disruptions. No actual incident reported, but credible hazards exist.[AI generated]

Why's our monitor labelling this an incident or hazard?

The event involves an AI system component—the Model Context Protocol used by AI assistants to connect to external tools—and details how its exploitation leads to direct harm by enabling attackers to intercept and manipulate AI interactions. The described attack can cause violations of data privacy, unauthorized control over AI-driven processes, and harm to enterprise operations, fitting the definition of an AI Incident. The vulnerability has been demonstrated and is not merely a theoretical risk, indicating realized or ongoing harm potential. Therefore, this event qualifies as an AI Incident rather than a hazard or complementary information.[AI generated]
AI principles
Robustness & digital securityPrivacy & data governance

Industries
Digital securityIT infrastructure and hosting

Affected stakeholders
Business

Harm types
Economic/PropertyHuman or fundamental rights

Severity
AI incident

Business function:
ICT management and information security

AI system task:
Goal-driven organisation


Articles about this incident or hazard

Thumbnail Image

Enterprise MCP adoption is outpacing security controls

2026-02-27
VentureBeat
Why's our monitor labelling this an incident or hazard?
The event involves AI systems (autonomous AI agents) whose development and use in enterprises are outpacing existing security controls, creating a permissive environment that could plausibly lead to significant harms such as data breaches or unauthorized access. Although no actual incident of harm is described, the article presents a credible and foreseeable risk of harm due to the current lack of frameworks and controls for these AI agents. Therefore, this situation fits the definition of an AI Hazard, as it plausibly could lead to an AI Incident if not addressed.
Thumbnail Image

The MCP Trojan Horse: AI's Hidden Security Risk

2026-02-26
JFrog
Why's our monitor labelling this an incident or hazard?
The article explicitly involves AI systems (AI agents using MCP servers) that have autonomous capabilities and high-level access to sensitive systems. It details how these AI systems can be manipulated to perform harmful actions such as leaking proprietary code or deleting critical data. Although no specific incident of realized harm is reported, the described vulnerabilities and attack vectors present a credible risk of significant harm to enterprises, including breaches of intellectual property rights and operational disruptions. The focus is on the plausible future harm and systemic security risks inherent in the current use and governance of MCP servers with AI agents, fitting the definition of an AI Hazard rather than an AI Incident or Complementary Information.
Thumbnail Image

ClawJacked: How a Single Browser Flaw Could Let Attackers Hijack Your AI Assistant's Every Move

2026-02-28
WebProNews
Why's our monitor labelling this an incident or hazard?
The event involves an AI system component—the Model Context Protocol used by AI assistants to connect to external tools—and details how its exploitation leads to direct harm by enabling attackers to intercept and manipulate AI interactions. The described attack can cause violations of data privacy, unauthorized control over AI-driven processes, and harm to enterprise operations, fitting the definition of an AI Incident. The vulnerability has been demonstrated and is not merely a theoretical risk, indicating realized or ongoing harm potential. Therefore, this event qualifies as an AI Incident rather than a hazard or complementary information.
Thumbnail Image

What threat do MCP servers pose to AI? [Q&A]

2026-03-02
BetaNews
Why's our monitor labelling this an incident or hazard?
The event involves AI systems (LLMs and agentic AI) using MCP servers to access data and tools. The described security flaws in MCP servers have directly led to harms including unauthorized data access, credential theft, and potential misuse of AI agents, which constitute violations of data privacy and security (harm to property and communities). Therefore, this qualifies as an AI Incident because the development and use of MCP servers have directly led to realized harms. The article also discusses responses and mitigation but the primary focus is on the realized security incidents and their implications.