Roblox Deploys AI System Sentinel to Detect and Prevent Child Predatory Behavior

The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.

Roblox has launched Sentinel, an AI system that analyzes billions of chat messages in real time to detect and report potential child predatory behavior and exploitation. In 2025, Sentinel facilitated 1,200 reports of possible child exploitation, aiming to enhance safety for its predominantly young user base.[AI generated]

Why's our monitor labelling this an incident or hazard?

Roblox Sentinel is an AI system that analyzes chat messages in real time to identify potentially harmful content related to child exploitation. The system's use has directly led to the detection and reporting of numerous cases of potential exploitation, which is a form of harm to children (harm to health and safety). Therefore, this qualifies as an AI Incident because the AI system's use has directly led to harm prevention and intervention in cases of child exploitation. The article also mentions ongoing challenges but does not negate the realized impact of the AI system in detecting harm.[AI generated]

Industries

Media, social platforms, and marketingDigital security

Affected stakeholders

Children

Severity

AI incident

Business function:

Monitoring and quality control

AI system task:

Event/anomaly detection

Articles about this incident or hazard

Roblox utiliza una IA para detectar mensajes de chat que ponen en peligro a los niños

2025-08-08

infobae

Why's our monitor labelling this an incident or hazard?

¿Qué es Sentinel? La inteligencia artificial de Roblox, que protegerá a los niños de los depredadores

2025-08-08

SDPnoticias.com

Why's our monitor labelling this an incident or hazard?

The event describes the use of an AI system (Sentinel) that analyzes chat messages to detect and prevent harm to children, including exploitation and harassment. The AI's role in identifying and reporting these threats has directly led to the prevention and mitigation of harm to a vulnerable group (children), fulfilling the criteria for an AI Incident under harm to health and safety of persons. The system's deployment and its impact on reporting potential exploitation cases confirm realized harm prevention, not just potential risk.

Roblox lanza sistema IA de código abierto para proteger niños de depredadores

2025-08-07

El Vocero de Puerto Rico

Why's our monitor labelling this an incident or hazard?

The event involves the use of an AI system (Sentinel) that analyzes chat data to detect potential child predatory behavior, which directly relates to preventing harm to children (harm to health and safety). The AI system's use is explicitly described as helping to identify and report possible exploitation cases, thus directly contributing to harm prevention. Therefore, this qualifies as an AI Incident because the AI system's use is directly linked to addressing and mitigating harm to a vulnerable group (children) from predatory behavior.