Twitter's Automated Moderation Linked to Surge in Harmful Content

Thumbnail Image

The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.

Following Elon Musk's acquisition, Twitter shifted to AI-driven automated moderation, reducing manual reviews and favoring content visibility restrictions over removals. This approach coincided with a reported surge in hate speech and child exploitation material, raising concerns about the effectiveness and unintended consequences of AI moderation on user safety and benign content.[AI generated]

Why's our monitor labelling this an incident or hazard?

The article explicitly mentions that Twitter replaced much of its human moderation staff with automated systems for content moderation. This AI system's malfunction or inadequate performance has directly led to increased hate speech and harmful content on the platform, causing harm to communities and violating rights. The harm is realized and ongoing, meeting the criteria for an AI Incident. The AI system's role is pivotal as it automates moderation decisions that have resulted in increased harmful content.[AI generated]
AI principles
SafetyRespect of human rightsAccountabilityTransparency & explainabilityRobustness & digital securityFairnessDemocracy & human autonomyHuman wellbeing

Industries
Media, social platforms, and marketingDigital security

Affected stakeholders
General publicChildren

Harm types
PsychologicalHuman or fundamental rightsPublic interestReputational

Severity
AI incident

Business function:
Monitoring and quality controlICT management and information security

AI system task:
Recognition/object detectionOrganisation/recommendersEvent/anomaly detectionOther


Articles about this incident or hazard

Thumbnail Image

Twitter si affida all'intelligenza artificiale per non diventare il social dell'odio e delle fake news

2022-12-05
Tiscali Notizie
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI systems in content moderation, which is a use of AI. However, the article does not report any direct or indirect harm caused by the AI systems themselves, nor does it describe a plausible future harm stemming from AI malfunction or misuse. Instead, it highlights AI's role in mitigating harm by improving content moderation speed and effectiveness. Therefore, this is complementary information about AI's role in addressing social media harms rather than an incident or hazard.
Thumbnail Image

Twitter, più intelligenza artificiale contro i post vietati - Hi-tech

2022-12-05
ANSA.it
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI systems for content moderation (an AI system is explicitly mentioned). The AI is used in the operation of the platform to detect and remove harmful content, which relates to violations of rights and harm to communities. However, the article does not report any direct or indirect harm caused by the AI system itself, nor does it describe any malfunction or misuse leading to harm. Instead, it focuses on the deployment and effectiveness of AI tools to reduce harm. Therefore, this is complementary information about AI use and its impact on content moderation, not an incident or hazard.
Thumbnail Image

Twitter, con Musk triplica il discorso d'odio. Lo staff di moderazione: "Ha automatizzato il processo"

2022-12-03
Open
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions that Twitter replaced much of its human moderation staff with automated systems for content moderation. This AI system's malfunction or inadequate performance has directly led to increased hate speech and harmful content on the platform, causing harm to communities and violating rights. The harm is realized and ongoing, meeting the criteria for an AI Incident. The AI system's role is pivotal as it automates moderation decisions that have resulted in increased harmful content.
Thumbnail Image

Twitter: più IA per bloccare i contenuti vietati

2022-12-04
Punto Informatico
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI systems for automated content moderation on Twitter, which is explicitly mentioned. The AI system's use is part of the content moderation process aimed at preventing harm (hate speech, incitement to violence, child exploitation content). However, the reported increase in hate speech and harmful content after Musk's acquisition suggests that the AI system's deployment has indirectly led to harm to communities by failing to adequately control such content. This fits the definition of an AI Incident, as the AI system's use is directly linked to harm occurring (harm to communities through increased hate speech and harmful content). The article does not merely discuss potential future harm or governance responses but reports realized harm associated with AI system use.
Thumbnail Image

Twitter, più automazione e meno "umani" per moderare i contenuti

2022-12-05
Prima Comunicazione
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions AI involvement in content moderation and changes in moderation strategy. However, it does not report any harm caused by the AI system, nor does it describe a plausible future harm directly linked to the AI system's use. The increased hate speech is noted but not attributed to AI malfunction or misuse. The main focus is on describing the shift towards more automated moderation and its context, which fits the definition of Complementary Information rather than an Incident or Hazard.
Thumbnail Image

Twitter executive says moving fast on moderation, as harmful content surges

2022-12-03
The Japan Times
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions automation in content moderation, which reasonably infers the use of AI systems for tasks like detecting and restricting harmful content. The event involves the use of AI systems in content moderation, which can lead to violations of rights such as freedom of expression or informational harm if benign uses are restricted. However, the article does not report any specific realized harm or incident resulting from this automation, only the current operational approach and policy changes. Therefore, this is best classified as Complementary Information, providing context on governance and operational responses to AI use in content moderation rather than reporting a specific AI Incident or Hazard.
Thumbnail Image

Exclusive-Twitter exec says moving fast on moderation, as harmful content surges

2022-12-03
MSN International Edition
Why's our monitor labelling this an incident or hazard?
The article explicitly discusses Twitter's use of automation (AI systems) for content moderation, including automated takedown of harmful posts and restrictions on abusive hashtags. The surge in hate speech and harmful content on the platform constitutes harm to communities. The AI system's role in moderating content is central to the event, as it both reflects the challenges of managing harmful content and the company's approach to mitigating it. Since harm is occurring and AI systems are involved in the use phase, this qualifies as an AI Incident.
Thumbnail Image

Twitter executive says moving fast on moderation, as harmful content surges

2022-12-03
Economic Times
Why's our monitor labelling this an incident or hazard?
Twitter is explicitly using automation, which reasonably involves AI systems, for content moderation tasks including detection and restriction of harmful content. The article reports a surge in hate speech and harmful content despite these measures, indicating that the AI system's use has directly or indirectly led to harm to communities by allowing increased hateful content dissemination. The moderation approach, including automated takedowns and visibility filtering, is central to the event. Hence, the event meets the criteria for an AI Incident as the AI system's use has contributed to realized harm.
Thumbnail Image

Twitter moderators turn to automation amid a reported surge in hate speech

2022-12-03
The Guardian
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions the use of automation (AI systems) for content moderation on Twitter, which is a clear AI system involvement. The surge in hate speech and harmful content on the platform constitutes harm to communities, fulfilling the harm criteria. The AI system's use in moderating content is directly linked to this harm, as the automation affects how harmful content is managed and potentially contributes to the dynamics of content visibility and removal. Hence, this is an AI Incident due to realized harm involving AI system use in content moderation.
Thumbnail Image

Twitter is now relying more on AI to identify harmful content, says its new trust and safety chief

2022-12-03
Business Insider
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions AI being used to identify harmful content and enforce safety measures on Twitter, which involves an AI system. The harms described (hate speech, child safety violations) are ongoing and significant, but the AI system is being used to mitigate these harms rather than causing them. There is no indication that the AI system malfunctioned or caused harm; instead, it is part of the response to existing issues. The article focuses on the strategic use of AI and policy changes rather than a specific incident or hazard caused by AI. Thus, it fits the definition of Complementary Information, providing updates on AI deployment and governance in content moderation.
Thumbnail Image

Exclusive-Twitter exec says moving fast on moderation, as harmful content surges

2022-12-03
Yahoo Sports Canada
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI systems for automated content moderation on Twitter, which is explicitly mentioned as a key part of the platform's approach. The surge in hate speech and harmful content following changes in moderation policies and staffing indicates that the AI system's use has directly or indirectly led to harm to communities by allowing increased harmful content visibility. The article details realized harm (increased hateful content) linked to the AI system's deployment and policy changes, meeting the criteria for an AI Incident. The involvement is through the use of AI systems in content moderation, and the harm is to communities via increased hate speech and harmful content dissemination.
Thumbnail Image

Twitter executive says moving fast on moderation, as harmful content surges

2022-12-03
MoneyControl
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions the use of automation (AI systems) for content moderation on Twitter, which is directly involved in managing harmful content. The surge in hate speech and harmful content following changes in moderation policies and staffing indicates realized harm to communities and platform users. The AI system's role in restricting or allowing content, including the use of automated takedowns and visibility filtering, directly influences the presence and spread of harmful content. This meets the criteria for an AI Incident, as the AI system's use has directly led to harm to communities through increased exposure to hateful and abusive content.
Thumbnail Image

Twitter Under Elon Musk "Fast, Aggressive" With Harmful Content: Top Executive

2022-12-03
NDTV
Why's our monitor labelling this an incident or hazard?
Twitter's content moderation relies heavily on AI automation to detect and restrict harmful content. The article highlights that despite these AI systems, there has been a reported increase in hate speech and harmful content, indicating that the AI moderation's use and possible malfunction or limitations have directly led to harm to communities through increased exposure to abusive content. This fits the definition of an AI Incident, as the AI system's use in moderation has directly influenced harm to communities by failing to adequately prevent the spread of harmful content and possibly enabling it through policy changes and automation reliance.
Thumbnail Image

Twitter's Top Executive Says Moving Fast on Moderation, as Harmful Content Increases

2022-12-03
News18
Why's our monitor labelling this an incident or hazard?
Twitter's automated content moderation systems qualify as AI systems because they perform complex content analysis and decision-making to restrict or remove harmful content. The article reports a surge in hate speech and harmful content, indicating that the AI moderation system's use has directly or indirectly led to harm to communities by allowing increased dissemination of hateful and abusive content. The shift to automation and reduced human review, combined with increased harmful content, shows the AI system's role in the harm. Hence, this is an AI Incident involving harm to communities due to the AI system's use in content moderation.
Thumbnail Image

Twitter moves to automated moderation as hate speech surges

2022-12-03
The Sydney Morning Herald
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI systems for automated content moderation on Twitter, which is an AI system influencing virtual environments by controlling content visibility. The use of AI to restrict hate speech and abusive content is a direct application of AI in managing harmful online behavior. However, the article does not describe any specific harm caused by the AI system's malfunction or misuse, nor does it report any incident where the AI system led to injury, rights violations, or other harms. Instead, it describes a change in operational approach and policy using AI. Therefore, this is best classified as Complementary Information, as it provides context and updates on AI use in content moderation without reporting a new AI Incident or AI Hazard.
Thumbnail Image

Twitter to rely more on AI than staff to detect hate speech as racism reports grow

2022-12-05
The Independent
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions the use of AI systems for content moderation on Twitter, which is an AI system by definition. The harms described (hate speech, abusive content) are ongoing and have been reported to have increased, but the AI system is being deployed to address these harms rather than causing new harm. There is no indication that the AI system malfunctioned or caused harm; rather, the article discusses the operational shift to AI moderation following staff reductions and the challenges faced. This fits the definition of Complementary Information, as it updates on the use and governance of AI systems in response to existing harms, rather than describing a new AI Incident or AI Hazard.
Thumbnail Image

Twitter exec says it's moving fast on moderation as harmful content surges

2022-12-03
Rappler
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI systems for automated content moderation, which is explicitly mentioned as a key part of Twitter's approach. The AI system's use has directly led to the removal of harmful content and accounts, addressing child safety violations, which is a harm to persons that is being mitigated. However, the article also highlights a surge in hate speech, indicating ongoing harm to communities. Since the AI system's development and use have directly influenced the presence and moderation of harmful content, this qualifies as an AI Incident. The article does not merely discuss potential future harm or general AI developments but reports on realized harms and mitigation efforts involving AI systems.
Thumbnail Image

Twitter under Elon Musk fast, aggressive with harmful content, says top executive

2022-12-03
India Today
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions Twitter's increased reliance on automation (AI systems) for content moderation, which directly affects the presence and spread of harmful content such as hate speech and child exploitation material. This moderation approach influences harm to communities and child safety, fulfilling the criteria for an AI Incident because the AI system's use has directly led to harm (increased hate speech and challenges in content moderation). The event involves the use of AI systems in a way that has materialized harm, not just potential harm, and thus qualifies as an AI Incident rather than a hazard or complementary information.
Thumbnail Image

Twitter Under Elon Musk More Empowered, Aggressive Against Hateful Content: Top Executive

2022-12-03
Jagran English
Why's our monitor labelling this an incident or hazard?
The event involves AI systems used for content moderation on Twitter, including automated takedown of harmful posts and restriction of abusive hashtags. The article reports a surge in hateful content, indicating that the AI systems' use and changes in moderation policies have directly influenced the presence and visibility of harmful content, which constitutes harm to communities. The AI systems' development and use are central to the event, and the harms described are realized, not just potential. Thus, this qualifies as an AI Incident due to the direct link between AI-driven moderation and the presence of harmful content affecting users and communities.
Thumbnail Image

Twitter exec says moving fast on moderation, as harmful content surges

2022-12-03
The Straits Times
Why's our monitor labelling this an incident or hazard?
The event involves AI systems (automation in content moderation) used in the development and use phases to manage harmful content on Twitter. However, the article does not report a specific AI malfunction or misuse causing new harm, nor does it describe a plausible future harm scenario from AI use. Instead, it details the company's strategic shift and operational changes in AI moderation tools amid ongoing challenges with harmful content. This fits the definition of Complementary Information, as it provides context and updates on societal and governance responses to AI-related harms rather than reporting a new AI Incident or AI Hazard.
Thumbnail Image

Twitter is leaning too much on machine-based content moderation. Here's why this is problematic- Technology News, Firstpost

2022-12-05
Firstpost
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions Twitter's use of automated (machine-based) content moderation systems, which qualify as AI systems due to their role in analyzing and filtering content. The harms described include increased hateful and abusive speech, which constitutes harm to communities and a violation of rights. The AI system's use in content moderation has directly or indirectly led to these harms by failing to adequately moderate content and by enabling increased harmful speech. Hence, this qualifies as an AI Incident under the framework because the AI system's use has led to realized harm.
Thumbnail Image

Content moderation: Twitter turns to AI to combat hate speech, racism

2022-12-05
Pulse Nigeria
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI systems for content moderation, which is clearly an AI system involvement. The surge in hate speech and offensive content represents harm to communities (harm category d). However, the article does not attribute the increase in hate speech to the AI system itself, nor does it describe any malfunction or misuse of the AI system causing harm. Instead, the AI system is being deployed as a tool to mitigate harm. Therefore, this is not an AI Incident. There is no indication that the AI system could plausibly lead to harm in the future beyond the current situation, so it is not an AI Hazard. The article mainly provides information about the platform's governance and technical response to existing issues, which fits the definition of Complementary Information.
Thumbnail Image

Twitter exec says moving fast on moderation, as harmful content surges

2022-12-03
St. Louis Post-Dispatch
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions the use of automation (AI systems) for content moderation and the challenges in managing harmful content such as hate speech and child exploitation material. These harms fall under violations of rights and harm to communities. However, the article does not describe a specific AI Incident where the AI system directly or indirectly caused harm, nor does it describe a plausible future harm scenario (hazard). Instead, it focuses on the company's approach, challenges, and responses to harmful content moderation, making it Complementary Information that enhances understanding of AI's role in content moderation and its societal implications.
Thumbnail Image

Twitter Moves To Automate Its Moderation Systems - SlashGear

2022-12-04
SlashGear
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI systems for automated content moderation, which is a clear AI system involvement. The change is in the use of AI for moderation decisions. However, the article does not describe any direct or indirect harm that has occurred due to this shift, only potential concerns and intentions. Therefore, this qualifies as an AI Hazard, as the automated moderation could plausibly lead to harms such as misinformation spread, inadequate content policing, or rights violations in the future, but no incident has yet materialized.
Thumbnail Image

Elon Musk's Twitter Banks on Automation to Moderate Content, Combat Hate Speech

2022-12-04
Tech Times
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions automation for content moderation, which reasonably infers the use of AI systems for detecting and limiting abusive content. However, it does not report any direct or indirect harm resulting from the AI system's malfunction or misuse. The concerns raised are about increased hate speech and platform safety under new leadership, but these are not directly linked to AI system failures or misuse causing harm. The article mainly provides context on the evolving AI moderation approach and policy stance, without describing a realized incident or a plausible imminent hazard. Therefore, this is best classified as Complementary Information, as it provides supporting context on AI system use and governance responses rather than reporting a new AI Incident or AI Hazard.
Thumbnail Image

Twitter exec says moving fast on moderation, as harmful content surges

2022-12-03
Malay Mail
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions Twitter's use of automation (AI systems) for content moderation and the resulting surge in harmful content such as hate speech and child exploitation material. These harms affect communities and individuals, fulfilling the criteria for harm under the AI Incident definition. The AI systems' use in moderation is central to the event, and the harms are occurring, not just potential. Hence, this qualifies as an AI Incident rather than a hazard or complementary information.
Thumbnail Image

Twitter is now relying more on AI to identify harmful content, says its new trust and safety chief | Business Insider

2022-12-04
BusinessInsider
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI systems for content moderation, which is an AI system involvement in use. While the AI system is used to identify harmful content and remove accounts violating child safety, the article does not describe any realized harm caused by the AI system itself or any malfunction leading to harm. The focus is on the ongoing use and strategic shift towards AI-driven moderation and related governance discussions. Therefore, this is not an AI Incident or AI Hazard but rather Complementary Information providing context on AI deployment and governance responses in the AI ecosystem.
Thumbnail Image

Automated detection will be integral to Twitter content moderation

2022-12-05
Android Headlines
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI systems (automated detection for content moderation) in the development and use phases. However, the article does not describe any direct or indirect harm resulting from these AI systems. It focuses on the company's strategy, challenges, and intentions regarding AI-driven moderation. Therefore, it does not meet the criteria for an AI Incident or AI Hazard. Instead, it provides contextual information about AI deployment and governance responses, fitting the definition of Complementary Information.
Thumbnail Image

Exclusive-Twitter exec says moving fast on moderation, as harmful content surges

2022-12-03
Colorado Springs Gazette
Why's our monitor labelling this an incident or hazard?
Twitter's use of automated AI systems for content moderation is explicitly mentioned, including automated takedowns and visibility filtering. The surge in hateful content and child exploitation material represents realized harm to communities and individuals. The AI system's development and use in moderation directly influence these harms, making this an AI Incident. The article focuses on the ongoing harm and the company's response, not just potential future harm or general AI news.
Thumbnail Image

Twitter exec says moving fast on moderation, as harmful content surges 

2022-12-03
The Standard
Why's our monitor labelling this an incident or hazard?
The article explicitly states that Twitter is using automation (AI systems) to moderate content, including automatic takedown of harmful posts and restricting abusive hashtags. The surge in hate speech and harmful content on the platform after changes in moderation policies and staffing indicates that the AI system's use and its operational decisions have directly contributed to harm to communities through increased exposure to hateful and abusive content. This meets the criteria for an AI Incident because the AI system's use has directly led to harm (harmful content proliferation) affecting community well-being and safety.
Thumbnail Image

Twitter is now relying more on AI to identify harmful content, says its new trust and safety chief

2022-12-03
Business Insider Nederland
Why's our monitor labelling this an incident or hazard?
The article focuses on Twitter's strategic use of AI to identify and restrict harmful content, including hate speech and child safety violations, which are recognized harms. However, it does not describe a new incident where AI caused harm or malfunctioned. Instead, it reports on the company's approach and governance measures involving AI, including collaboration with cybersecurity groups and political discussions about content moderation. This aligns with Complementary Information, as it updates on responses and ongoing management of AI-related harms rather than describing a new AI Incident or AI Hazard.
Thumbnail Image

Twitter moves to automated moderation as hate speech surges - WSTale.com

2022-12-03
WSTale.com
Why's our monitor labelling this an incident or hazard?
The article explicitly discusses the use of automated moderation tools, which are AI systems, to detect and restrict harmful content such as hate speech and child exploitation material. The surge in hate speech and the platform's moderation approach directly affect community harm. The AI system's role in moderating content and restricting visibility is central to the event, and the harm (hate speech proliferation and content restriction) is occurring. Hence, this is an AI Incident as the AI system's use has directly led to harm to communities through the spread and management of hateful content.
Thumbnail Image

El efecto Elon Musk: Discursos de odio se dispararon en Twitter tras la compra de la red social

2022-12-02
El Mercurio de Santiago
Why's our monitor labelling this an incident or hazard?
Twitter employs AI systems to detect and moderate hate speech and harmful content. The article indicates that after policy changes, including amnesty for suspended accounts, there was a sharp rise in hate speech. This suggests that the AI systems' role in content moderation was affected, either by changes in their use or by policy decisions overriding AI moderation. The increase in hate speech constitutes harm to communities, fulfilling the criteria for an AI Incident where the AI system's use or malfunction (or its overridden function) has indirectly led to harm. Therefore, this event qualifies as an AI Incident.
Thumbnail Image

Los discursos de odio se disparan en Twitter con Elon Musk, según expertos

2022-12-02
infobae
Why's our monitor labelling this an incident or hazard?
Twitter uses AI systems for content moderation, including detecting and removing hate speech and extremist content. The article describes a clear increase in harmful content after changes in moderation policies and staffing under Elon Musk's ownership. This suggests that the AI systems' use or malfunction (reduced moderation effectiveness) has indirectly led to harm to communities by allowing hate speech and extremist content to proliferate. Therefore, this event qualifies as an AI Incident due to the realized harm caused by the AI system's failure or reduced effectiveness in content moderation.
Thumbnail Image

Los discursos de odio se disparan en Twitter con Elon Musk, según expertos

2022-12-02
infobae
Why's our monitor labelling this an incident or hazard?
Twitter uses AI systems for content moderation, including detecting hate speech and extremist content. The article describes a clear increase in harmful content and extremist accounts after changes in moderation policies and staff reductions under Musk's ownership. This indicates that the AI systems' use or malfunction (due to reduced oversight or policy changes) has directly led to harm to communities by allowing more hate speech and extremist content to spread. Therefore, this event qualifies as an AI Incident due to realized harm caused by the AI system's use or malfunction in content moderation.
Thumbnail Image

Discursos racistas y homofóbicos se dispararon en Twitter con Elon Musk: The New York Times

2022-12-03
El Financiero
Why's our monitor labelling this an incident or hazard?
The article explicitly discusses the increase in harmful content on Twitter linked to changes in moderation under Elon Musk's leadership. Content moderation on social media platforms typically involves AI systems that detect and filter hate speech and extremist content. The reduction in moderation staff and policy changes have impaired the effectiveness of these AI systems, leading to a tripling of racist insults and increases in homophobic and antisemitic messages. This constitutes a malfunction or failure in the AI system's use, directly causing harm to communities by enabling the spread of hate speech and extremist content. Hence, the event meets the criteria for an AI Incident as the AI system's use has directly led to harm.
Thumbnail Image

Discurso de odio Un fenómeno que se dispara desde la llegada de Musk

2022-12-03
Diario El Día
Why's our monitor labelling this an incident or hazard?
Twitter's content moderation and recommendation systems likely involve AI to detect, filter, and manage content. The article describes a surge in harmful content such as hate speech and extremist profiles, which suggests a failure or reduction in effective AI moderation after Musk's acquisition. This has led to harm to communities by enabling the spread of hateful and extremist content. Therefore, the event qualifies as an AI Incident due to the indirect harm caused by the AI system's use or malfunction in content moderation and management.
Thumbnail Image

NY Times revela que se han disparado los discursos de odio en Twitter con nueva gestión de Musk

2022-12-03
Vanguardia
Why's our monitor labelling this an incident or hazard?
Twitter employs AI systems for content moderation to detect and limit hate speech and extremist content. The article reports a sharp increase in such harmful content after management changes that included firing moderation staff and policy shifts, which likely impaired the AI moderation system's effectiveness. This has led to realized harm to communities through increased hate speech and extremist content dissemination. Hence, the AI system's malfunction or reduced operation is indirectly causing harm, qualifying this as an AI Incident.
Thumbnail Image

Los discursos de odio se disparan en Twitter tras su adquisición por Elon Musk

2022-12-02
Público.es
Why's our monitor labelling this an incident or hazard?
Twitter uses AI systems for content moderation to detect and remove hate speech and extremist content. The article reports a sharp increase in such harmful content after the acquisition, coinciding with layoffs of moderation staff and policy changes. The AI moderation system's malfunction or reduced effectiveness (due to changes in use or oversight) has indirectly led to harm to communities by allowing hate speech and extremist content to proliferate. This fits the definition of an AI Incident, as the AI system's use and malfunction have directly or indirectly led to harm.
Thumbnail Image

Aseguran que los discursos de odio se dispararon en Twitter tras la compra de Elon Musk | Mundo

2022-12-03
Los Andes
Why's our monitor labelling this an incident or hazard?
Twitter uses AI systems for content moderation, including detection and removal of hate speech and extremist content. The reported surge in hate speech and the reinstatement of suspended accounts suggest a failure or change in the AI moderation system's use or effectiveness after the ownership change. This has directly led to harm to communities by enabling the spread of hate speech and extremist content, fulfilling the criteria for an AI Incident due to the AI system's role in content management and its impact on harmful content dissemination.
Thumbnail Image

Los discursos de odio se han disparado en Twitter desde la llegada de Elon Musk

2022-12-02
Vozpópuli
Why's our monitor labelling this an incident or hazard?
Twitter employs AI systems for content moderation and detection of hate speech. The article highlights that after Musk's takeover, moderation efforts were reduced, leading to a surge in hate speech and extremist content. This indicates a failure or misuse of AI systems in content control, resulting in harm to communities through increased exposure to hate speech and extremist propaganda. The harm is realized and ongoing, meeting the criteria for an AI Incident. The AI system's malfunction or reduced effectiveness due to policy and staffing changes directly contributed to the harm.
Thumbnail Image

El discurso de odio se dispara en Twitter con Elon Musk, según expertos

2022-12-02
López-Dóriga Digital
Why's our monitor labelling this an incident or hazard?
Twitter employs AI systems for content moderation and detection of harmful content. The article reports a tripling of racist insults, increases in homophobic and antisemitic messages, and resurgence of extremist accounts after Musk's takeover and policy changes, including staff cuts in moderation. These changes have allowed harmful content to proliferate, causing harm to communities and violating rights. The AI systems' malfunction or reduced effectiveness (due to policy and staffing changes) has indirectly contributed to this harm. Hence, this qualifies as an AI Incident due to realized harm linked to AI system use and malfunction in content moderation.
Thumbnail Image

Los discursos de odio se disparan en Twitter con Elon Musk, según expertos

2022-12-02
La Hora Noticias de Ecuador, sus provincias y el mundo
Why's our monitor labelling this an incident or hazard?
The event involves AI systems used for content moderation on Twitter, which are responsible for detecting and removing hate speech and extremist content. The article describes a clear increase in harmful content following changes in moderation policies and staff cuts, implying a malfunction or reduced effectiveness of these AI systems. This has directly or indirectly led to harm to communities through the spread of hate speech and extremist propaganda. Hence, it meets the criteria for an AI Incident as the AI system's malfunction or use has caused realized harm.
Thumbnail Image

Los discursos de odio se disparan en Twitter con Elon Musk, según expertos - Revista Summa

2022-12-02
Revista Summa
Why's our monitor labelling this an incident or hazard?
Twitter uses AI systems for content moderation, recommendation, and verification processes. The increase in hate speech and extremist content suggests a failure or change in these AI systems' use or effectiveness, leading to harm to communities by enabling the spread of harmful content. Although the article does not explicitly detail AI malfunction, the platform's AI-driven moderation and recommendation systems are implicated in the rise of harmful content. Therefore, this qualifies as an AI Incident due to indirect harm caused by the AI systems' use and management.
Thumbnail Image

Expertos: discursos de odio se disparan con Elon Musk al frente de Twitter

2022-12-02
Telemundo Washington DC (44)
Why's our monitor labelling this an incident or hazard?
Twitter uses AI systems for content moderation, detection, and management of harmful content. The article reports a clear increase in hate speech and extremist content after changes in management and moderation policies, which involve AI systems' use or malfunction (e.g., reduced moderation effectiveness). This has led to harm to communities through the spread of hate speech and extremist content. The AI systems' role in content filtering and verification is pivotal in this harm, making this an AI Incident rather than a hazard or complementary information.
Thumbnail Image

Just one person remains on Twitter's Asia child safety team, report says, despite Elon Musk saying dealing with child abuse is his biggest priority

2022-11-29
Business Insider
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI systems for content moderation on Twitter, specifically targeting child sexual abuse material. The drastic reduction in the human team responsible for overseeing and managing these AI systems' outputs in the Asia-Pacific region has directly led to a diminished capacity to remove harmful content, which is a violation of rights and causes harm to communities. The article indicates that despite claims of prioritizing child safety, the reduction in staff undermines this goal, implying ongoing or increased harm. Hence, this qualifies as an AI Incident because the AI system's use and its insufficient human support have directly contributed to harm related to child sexual exploitation content on the platform.
Thumbnail Image

Elon Musk's job cuts decimated Twitter team tackling child sexual abuse

2022-11-29
The Indian Express
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions the use of AI-based tools for identifying child sexual exploitation material, indicating the presence of AI systems in content moderation. The harm described is the increased risk and likely presence of illegal and harmful content due to the drastic reduction in human moderators who complement AI tools. This reduction has overwhelmed the remaining team, impairing the platform's ability to effectively detect and remove harmful content. Since child sexual exploitation material is illegal and causes significant harm to children and communities, and the AI system's role in moderation is pivotal but insufficient without human oversight, the event meets the criteria for an AI Incident. The harm is realized (not just potential), and the AI system's involvement in the moderation process is central to the incident.
Thumbnail Image

Elon Musk fans claim he's already eliminated child abuse material on Twitter -- experts say otherwise

2022-11-28
The Daily Dot
Why's our monitor labelling this an incident or hazard?
While the article involves Twitter's content moderation, which likely uses AI systems for detecting harmful content, the main focus is on the effectiveness and organizational changes in moderation rather than a specific AI system causing harm or posing a plausible future harm. There is no direct or indirect link to an AI Incident or AI Hazard as defined, since the article does not describe an AI system malfunction, misuse, or credible risk leading to harm. Instead, it critiques the social and operational handling of CSAM on the platform, making it Complementary Information that provides context and expert opinion on ongoing challenges rather than reporting a new AI-related harm or hazard.
Thumbnail Image

Layoffs Have Gutted Twitter's Child Safety Team

2022-11-28
WIRED UK
Why's our monitor labelling this an incident or hazard?
The article explicitly discusses the use of AI systems and human moderators to detect and remove CSAM on Twitter. The layoffs have reduced the human oversight critical to the effective functioning of these AI systems, leading to a direct and ongoing harm: increased risk of child sexual abuse content remaining on the platform. This constitutes a violation of human rights and harm to communities. The AI system's malfunction or reduced effectiveness due to lack of human support is a direct contributing factor to the harm. Therefore, this event qualifies as an AI Incident.
Thumbnail Image

Elon Musk's job cuts decimated Twitter team tackling child sexual abuse

2022-11-29
The Spokesman Review
Why's our monitor labelling this an incident or hazard?
The article explicitly involves AI-based tools as part of the content moderation system, combined with human experts. The drastic cuts to the human team responsible for reviewing and escalating reports of child sexual exploitation have directly impaired the platform's ability to manage illegal content, which constitutes harm to communities and individuals (children). The AI system's role is pivotal as it supports detection, but the lack of sufficient human oversight and moderation has led to a failure in effectively preventing harm. This meets the criteria for an AI Incident because the development and use of AI-supported moderation systems, combined with human resource reductions, have directly led to harm through inadequate content policing and increased risk of child exploitation material spreading.
Thumbnail Image

Musk's job cuts decimated Twitter team tackling child sexual abuse - Portland Press Herald

2022-11-29
Portland Press Herald
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions the use of AI-based tools to identify child sexual exploitation material but emphasizes the critical role of human specialists in reviewing and escalating reports. The drastic cuts to the human moderation team have overwhelmed the remaining staff, reducing the platform's ability to effectively combat illegal content. This has led to a direct harm scenario where child sexual abuse material and grooming content are less effectively controlled, violating legal obligations and causing harm to communities. The AI system's involvement is indirect but essential, as the moderation system relies on both AI and human expertise. The failure to maintain adequate human oversight combined with AI tools' limitations has directly contributed to the harm, qualifying this as an AI Incident rather than a hazard or complementary information.
Thumbnail Image

Musk's job cuts decimated Twitter team tackling child sexual abuse - Lewiston Sun Journal

2022-11-29
Lewiston Sun Journal
Why's our monitor labelling this an incident or hazard?
The event involves the use and management of AI-based tools combined with human moderation to detect child sexual abuse material on Twitter. The drastic cuts to the human moderation team, which works alongside AI tools, have directly led to a reduced ability to prevent the spread of illegal and harmful content. This constitutes a failure in the use of an AI system (content moderation tools) and human oversight, resulting in direct harm to communities and individuals (children) through increased exposure to child sexual exploitation material. Therefore, this qualifies as an AI Incident due to the direct link between AI system use, human moderation reduction, and realized harm.