AI Code Assistants Exploited for Backdoor Injection and Malware Generation

Thumbnail Image

The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.

Security researchers warn that AI-powered code assistants, such as those integrated with IDEs, are vulnerable to exploitation by threat actors. Attackers can contaminate external data sources with malicious prompts, leading these assistants to inject backdoors, leak sensitive data, and generate harmful code, compromising software security and developer trust.[AI generated]

Why's our monitor labelling this an incident or hazard?

The article explicitly discusses how the development and use of AI code assistants have led to security vulnerabilities that can be exploited to insert malicious code (backdoors) into software projects, leak sensitive information, and enable unauthorized access to AI models. These outcomes constitute direct or indirect harm to users and their property through compromised systems and data breaches. The presence of actual attacks or demonstrated exploits (even if simulated) and the discussion of real-world implications and mitigations indicate that these are AI Incidents rather than mere potential hazards or complementary information. Therefore, the event qualifies as an AI Incident due to realized or imminent harm caused by AI system misuse and vulnerabilities.[AI generated]

AI principles

AccountabilityPrivacy & data governanceRespect of human rightsRobustness & digital securitySafetyTransparency & explainability

Industries

Digital security

Affected stakeholders

WorkersBusiness

Harm types

Economic/PropertyReputationalHuman or fundamental rights

Severity

AI incident

Business function:

Research and development

AI system task:

Content generation

Articles about this incident or hazard

Thumbnail Image

The Risks of Code Assistant LLMs: Harmful Content, Misuse and Deception

2025-09-15

Unit42

Why's our monitor labelling this an incident or hazard?

The article explicitly discusses how the development and use of AI code assistants have led to security vulnerabilities that can be exploited to insert malicious code (backdoors) into software projects, leak sensitive information, and enable unauthorized access to AI models. These outcomes constitute direct or indirect harm to users and their property through compromised systems and data breaches. The presence of actual attacks or demonstrated exploits (even if simulated) and the discussion of real-world implications and mitigations indicate that these are AI Incidents rather than mere potential hazards or complementary information. Therefore, the event qualifies as an AI Incident due to realized or imminent harm caused by AI system misuse and vulnerabilities.

Thumbnail Image

Threat Actors and Code Assistants: The Hidden Risks of Backdoor Injections - IT Security News

2025-09-16

IT Security News - cybersecurity, infosecurity news

Why's our monitor labelling this an incident or hazard?

The event involves the use and misuse of AI systems (AI code assistants) leading to potential or actual harm, including security breaches and production of harmful code. Since the article describes active exploitation and resulting risks, this constitutes an AI Incident due to the direct or indirect harm caused by the AI system's misuse or vulnerabilities.

Thumbnail Image

Cyber pros warn of vibe coding dangers: chatbots fetching malicious instructions for attackers

2025-09-16

Cybernews

Why's our monitor labelling this an incident or hazard?

The article explicitly involves AI systems (AI code assistants using large language models) and details how their use can lead to direct harms such as malware generation and system compromise. The harms described include unauthorized access, data leakage, and potential system infection, which qualify as harm to property and security. The researchers provide concrete examples and warnings, indicating that these harms are not merely theoretical but have been demonstrated or are actively plausible. Hence, this qualifies as an AI Incident due to realized or ongoing harm caused by the AI system's misuse or exploitation.

Thumbnail Image

Threat Actors Could Misuse Code Assistant To Inject Backdoors and Generating Harmful Content

2025-09-16

Cyber Security News

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (AI-driven coding assistants) whose misuse leads to the generation and execution of malicious backdoor code, causing harm to software security and potentially to users relying on the compromised software. The harm is realized, not just potential, as the malicious code executes automatically when accepted by developers. This fits the definition of an AI Incident because the AI system's use directly leads to harm (security breach and unauthorized access).

Thumbnail Image

Palo Alto warns of AI coding assistant backdoors

2025-09-18

SC Media

Why's our monitor labelling this an incident or hazard?

The article explicitly involves AI systems (AI-powered coding assistants) and highlights a vulnerability that could plausibly lead to harm (injection of backdoors and malicious content). Although no direct harm has been reported, the potential for such harm is credible and significant, fitting the definition of an AI Hazard rather than an Incident or Complementary Information. The focus is on the plausible risk arising from the AI system's use and its exploitation by adversaries.