Twitter's Automated Moderation Linked to Surge in Harmful Content

The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.

Following Elon Musk's acquisition, Twitter shifted to AI-driven automated moderation, reducing manual reviews and favoring content visibility restrictions over removals. This approach coincided with a reported surge in hate speech and child exploitation material, raising concerns about the effectiveness and unintended consequences of AI moderation on user safety and benign content.[AI generated]

Why's our monitor labelling this an incident or hazard?

The article explicitly mentions that Twitter replaced much of its human moderation staff with automated systems for content moderation. This AI system's malfunction or inadequate performance has directly led to increased hate speech and harmful content on the platform, causing harm to communities and violating rights. The harm is realized and ongoing, meeting the criteria for an AI Incident. The AI system's role is pivotal as it automates moderation decisions that have resulted in increased harmful content.[AI generated]

AI principles

SafetyRespect of human rightsAccountabilityTransparency & explainabilityRobustness & digital securityFairnessDemocracy & human autonomyHuman wellbeing

Industries

Media, social platforms, and marketingDigital security

Affected stakeholders

General publicChildren

Harm types

PsychologicalHuman or fundamental rightsPublic interestReputational

Business function:

Monitoring and quality controlICT management and information security

AI system task:

Recognition/object detectionOrganisation/recommendersEvent/anomaly detectionOther

Twitter is leaning too much on machine-based content moderation. Here's why this is problematic- Technology News, Firstpost

2022-12-05

Firstpost

Just one person remains on Twitter's Asia child safety team, report says, despite Elon Musk saying dealing with child abuse is his biggest priority

2022-11-29

Business Insider

Twitter's Automated Moderation Linked to Surge in Harmful Content

Why's our monitor labelling this an incident or hazard?

Articles about this incident or hazard

Twitter si affida all'intelligenza artificiale per non diventare il social dell'odio e delle fake news

Twitter, più intelligenza artificiale contro i post vietati - Hi-tech

Twitter, con Musk triplica il discorso d'odio. Lo staff di moderazione: "Ha automatizzato il processo"

Twitter: più IA per bloccare i contenuti vietati

Twitter, più automazione e meno "umani" per moderare i contenuti

Twitter executive says moving fast on moderation, as harmful content surges

Exclusive-Twitter exec says moving fast on moderation, as harmful content surges

Twitter executive says moving fast on moderation, as harmful content surges

Twitter moderators turn to automation amid a reported surge in hate speech

Twitter is now relying more on AI to identify harmful content, says its new trust and safety chief

Exclusive-Twitter exec says moving fast on moderation, as harmful content surges

Twitter executive says moving fast on moderation, as harmful content surges

Twitter Under Elon Musk "Fast, Aggressive" With Harmful Content: Top Executive

Twitter's Top Executive Says Moving Fast on Moderation, as Harmful Content Increases

Twitter moves to automated moderation as hate speech surges

Twitter to rely more on AI than staff to detect hate speech as racism reports grow

Twitter exec says it's moving fast on moderation as harmful content surges

Twitter under Elon Musk fast, aggressive with harmful content, says top executive

Twitter Under Elon Musk More Empowered, Aggressive Against Hateful Content: Top Executive

Twitter exec says moving fast on moderation, as harmful content surges

Twitter is leaning too much on machine-based content moderation. Here's why this is problematic- Technology News, Firstpost

Content moderation: Twitter turns to AI to combat hate speech, racism

Twitter exec says moving fast on moderation, as harmful content surges

Twitter Moves To Automate Its Moderation Systems - SlashGear

Elon Musk's Twitter Banks on Automation to Moderate Content, Combat Hate Speech

Twitter exec says moving fast on moderation, as harmful content surges

Twitter is now relying more on AI to identify harmful content, says its new trust and safety chief | Business Insider

Automated detection will be integral to Twitter content moderation

Exclusive-Twitter exec says moving fast on moderation, as harmful content surges

Twitter exec says moving fast on moderation, as harmful content surges&nbsp;

Twitter is now relying more on AI to identify harmful content, says its new trust and safety chief

Twitter moves to automated moderation as hate speech surges - WSTale.com

El efecto Elon Musk: Discursos de odio se dispararon en Twitter tras la compra de la red social

Los discursos de odio se disparan en Twitter con Elon Musk, según expertos

Los discursos de odio se disparan en Twitter con Elon Musk, según expertos

Discursos racistas y homofóbicos se dispararon en Twitter con Elon Musk: The New York Times

Discurso de odio Un fenómeno que se dispara desde la llegada de Musk

NY Times revela que se han disparado los discursos de odio en Twitter con nueva gestión de Musk

Los discursos de odio se disparan en Twitter tras su adquisición por Elon Musk

Aseguran que los discursos de odio se dispararon en Twitter tras la compra de Elon Musk | Mundo

Los discursos de odio se han disparado en Twitter desde la llegada de Elon Musk

El discurso de odio se dispara en Twitter con Elon Musk, según expertos

Los discursos de odio se disparan en Twitter con Elon Musk, según expertos

Los discursos de odio se disparan en Twitter con Elon Musk, según expertos - Revista Summa

Expertos: discursos de odio se disparan con Elon Musk al frente de Twitter

Just one person remains on Twitter's Asia child safety team, report says, despite Elon Musk saying dealing with child abuse is his biggest priority

Elon Musk's job cuts decimated Twitter team tackling child sexual abuse

Elon Musk fans claim he's already eliminated child abuse material on Twitter -- experts say otherwise

Layoffs Have Gutted Twitter's Child Safety Team

Elon Musk's job cuts decimated Twitter team tackling child sexual abuse

Musk's job cuts decimated Twitter team tackling child sexual abuse - Portland Press Herald

Musk's job cuts decimated Twitter team tackling child sexual abuse - Lewiston Sun Journal

Twitter exec says moving fast on moderation, as harmful content surges