Instagram's AI Fails to Block Child Abuse Ads in India, Prompting Government Action

Thumbnail Image

The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.

A BBC investigation revealed that Instagram's AI-driven ad moderation failed to block paid advertisements promoting child sexual abuse material in India. The Indian government has summoned Meta to explain how these ads bypassed AI safeguards, raising concerns about the effectiveness of Instagram's content moderation systems.[AI generated]

Why's our monitor labelling this an incident or hazard?

The event explicitly involves an AI system (Instagram's moderation technology) that approved harmful ads promoting child sexual abuse material, which is illegal and causes significant harm to children and society. The AI system's malfunction or failure to block these ads directly led to the dissemination of harmful content, fulfilling the criteria for an AI Incident under violations of human rights and harm to communities. The harm is realized, not just potential, and the AI system's role is pivotal in the incident.[AI generated]
AI principles
SafetyRespect of human rights

Industries
Media, social platforms, and marketing

Affected stakeholders
Children

Harm types
Human or fundamental rightsPsychological

Severity
AI incident

Business function:
Marketing and advertisement

AI system task:
Recognition/object detection


Articles about this incident or hazard