
The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.
After Meta reduced AI-driven content moderation on Facebook in early 2025, abusive comments targeting U.S. lawmakers tripled, with violent threats and harassment rising sharply. The Center for Countering Digital Hate linked this surge to Meta's policy changes, highlighting the direct harm caused by weakened AI enforcement.[AI generated]
Why's our monitor labelling this an incident or hazard?
Meta's content moderation relies heavily on AI systems to detect and remove harmful content. The rollback of these AI-driven moderation policies led to a measurable increase in violent threats and harassment, which are direct harms to individuals and communities. The report from CCDH provides evidence of this causal link. The AI system's reduced enforcement capability (whether by design or malfunction) directly contributed to the increase in harmful content, fulfilling the criteria for an AI Incident. The event is not merely a potential risk or a complementary update but a realized harm linked to AI system use.[AI generated]