AI-Assisted Content Moderation on WhatsApp Breaches User Privacy

Thumbnail Image

The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.

Reports reveal that Facebook uses AI systems to flag WhatsApp messages reported as spam, allowing human moderators to access and review these messages despite claims of end-to-end encryption. This practice has led to privacy violations and the sharing of user data with authorities, undermining WhatsApp's privacy assurances.[AI generated]

Why's our monitor labelling this an incident or hazard?

The article explicitly mentions the use of Facebook's machine learning framework (an AI system) to flag content, which then allows human moderators to access the last five messages in a chat thread. This process directly leads to violations of user privacy and breaches of the claimed end-to-end encryption, constituting a violation of fundamental rights. Therefore, the event involves an AI system's use leading directly to harm (privacy violations), qualifying it as an AI Incident under the framework.[AI generated]
AI principles
Privacy & data governanceRespect of human rightsTransparency & explainabilityAccountability

Industries
Media, social platforms, and marketing

Affected stakeholders
Consumers

Harm types
Human or fundamental rights

Severity
AI incident

Business function:
Monitoring and quality control

AI system task:
Event/anomaly detection


Articles about this incident or hazard

Thumbnail Image

Encryption A Lie? Facebook Can Read Your WhatsApp Messages, Says Report

2021-09-08
IndiaTimes
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions the use of Facebook's machine learning framework (an AI system) to flag content, which then allows human moderators to access the last five messages in a chat thread. This process directly leads to violations of user privacy and breaches of the claimed end-to-end encryption, constituting a violation of fundamental rights. Therefore, the event involves an AI system's use leading directly to harm (privacy violations), qualifying it as an AI Incident under the framework.
Thumbnail Image

Facebook IS reading your WhatsApp texts despite 'encryption', researchers claim

2021-09-08
The Sun
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions the use of Facebook's artificial intelligence systems to sift through reported WhatsApp messages for inappropriate content. The AI system's outputs trigger human review and have been used in legal prosecutions, indicating direct involvement in causing harm related to privacy violations and potential breaches of fundamental rights. The harm is realized, not just potential, as evidenced by prosecutions and imprisonment. Hence, this event meets the criteria for an AI Incident due to the AI system's role in leading to violations of human rights and privacy.
Thumbnail Image

Investigation raises fresh concerns over WhatsApp privacy and encryption

2021-09-08
Android Authority
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions the use of machine learning systems (AI) to scan flagged content, indicating AI system involvement. However, it does not describe any direct or indirect harm caused by the AI system's development, use, or malfunction. The content review system is presented as a measure to prevent harm by banning abusive users, and no harm or privacy breach is reported as having occurred. The concerns raised are about potential privacy implications but do not constitute a realized harm or a plausible future harm event. Thus, the article fits the definition of Complementary Information, providing context and updates on AI use and privacy concerns without reporting an AI Incident or AI Hazard.
Thumbnail Image

WhatsApp "end-to-end encrypted" messages aren't that private after all

2021-09-08
Ars Technica
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions AI algorithms used to process flagged WhatsApp messages and metadata for content moderation, which directly impacts user privacy and rights. The harm is realized as users' private messages are reviewed by AI and human moderators, violating the expectation of privacy despite end-to-end encryption claims. This is a clear violation of privacy rights, a form of harm under the framework. Therefore, the event qualifies as an AI Incident due to the direct involvement of AI systems in causing harm to users' privacy and rights.
Thumbnail Image

Facebook may be reading private WhatsApp messages despite promises it wouldn't

2021-09-08
Daily Star
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions the use of AI to assist contractors in reviewing reported WhatsApp messages, indicating the involvement of an AI system. The harm realized is the violation of users' privacy rights and breach of promises regarding message confidentiality, which falls under violations of human rights and legal obligations. The AI system's role is pivotal as it enables the large-scale review and filtering of private content, directly contributing to the harm. Hence, this event meets the criteria for an AI Incident rather than a hazard or complementary information.
Thumbnail Image

WhatsApp to offer end-to-end encryption option for backups

2021-09-10
Deccan Herald
Why's our monitor labelling this an incident or hazard?
The article does not report any realized harm or incident caused by the AI system or encryption technology. Instead, it details a new security feature aimed at preventing potential future harms related to unauthorized access of user data. The involvement of AI is implicit in the sophisticated encryption key management and security framework, but no harm or misuse is described. Therefore, this is best classified as Complementary Information, as it provides an update on a significant privacy and security enhancement in a widely used AI-enabled communication platform, without describing an AI Incident or AI Hazard.
Thumbnail Image

WhatsApp to add encrypted backups for messages

2021-09-10
Asian News International (ANI)
Why's our monitor labelling this an incident or hazard?
The article details a new security feature being introduced by WhatsApp to protect user data through encryption. While it involves AI-related cryptographic technology and secure key management, there is no mention or implication of any realized harm, violation of rights, or disruption caused by this feature. The event is a development in AI-related security technology and governance but does not describe an incident or hazard. Therefore, it fits the category of Complementary Information as it provides context and updates on AI-related privacy and security measures without reporting harm or plausible future harm.
Thumbnail Image

Can Facebook Read Private WhatsApp Messages? Report Indicates It Can

2021-09-08
https://newsbusters.org/
Why's our monitor labelling this an incident or hazard?
The report explicitly mentions the use of artificial intelligence systems combined with human contractors to sift through private messages and media reported by users. This AI-assisted monitoring leads to direct harm by violating user privacy and enabling legal actions against users based on the data processed. Therefore, this constitutes an AI Incident due to the realized harm to users' privacy and rights stemming from the AI system's use in content moderation and data sharing.
Thumbnail Image

WhatsApp Allegedly Spying On Chats, Secretly Monitoring Users In India Despite Encryption Claims

2021-09-08
Inc42 Media
Why's our monitor labelling this an incident or hazard?
The article explicitly mentions the use of an AI-based monitoring system to flag content for moderation, indicating AI system involvement. The AI system's use in monitoring and flagging content has directly led to privacy violations and potential breaches of user rights, as unencrypted chat data and metadata are collected and shared with governments without full user consent. This constitutes a violation of fundamental rights and privacy, fulfilling the criteria for an AI Incident. The harm is realized, not just potential, as user data is being monitored and shared, and accounts are banned based on AI-flagged content. Hence, the event is classified as an AI Incident.
Thumbnail Image

WhatsApp Encrypted Messages Are Not Private For Facebook Research Claims - Research Snipers

2021-09-08
Research Snipers
Why's our monitor labelling this an incident or hazard?
The article describes an AI system used by Facebook to pre-screen reported WhatsApp messages before human review, which involves processing encrypted messages. This is a clear AI system involvement in the use phase. However, the messages are only accessed when users report them, and this is disclosed in the terms of use, so the system operates within stated policies. There is no explicit report of harm such as unauthorized access, injury, or rights violations beyond the privacy concerns inherent in the system's design and user expectations. The event mainly provides additional context and clarification about AI's role in content moderation and privacy, rather than reporting a new incident or hazard. Therefore, it fits the definition of Complementary Information rather than an AI Incident or AI Hazard.
Thumbnail Image

Report: Facebook Says It Doesn't Read Your WhatsApp Messages. Wrong. It Does

2021-09-10
Moguldom
Why's our monitor labelling this an incident or hazard?
The event involves AI systems used for content moderation on WhatsApp, which flag messages for review. The flagged content is then accessed by human contractors, leading to privacy violations and potential breaches of user rights. The AI system's involvement in identifying content for review directly contributes to harm by enabling access to private messages, contradicting WhatsApp's claims of end-to-end encryption privacy. This constitutes a violation of fundamental rights (privacy) and thus meets the criteria for an AI Incident. The harm is realized, not just potential, as private data is accessed and shared with law enforcement, fulfilling the definition of an AI Incident under violations of human rights or breach of obligations to protect fundamental rights.
Thumbnail Image

A report shows Facebook can read your WhatsApp messages

2021-09-09
Techzim
Why's our monitor labelling this an incident or hazard?
The report explicitly states that WhatsApp messages reported as spam are accessible to Facebook moderators, implying the use of AI or algorithmic systems to identify and process such messages. This access contradicts the claimed end-to-end encryption and results in a violation of privacy rights, a form of human rights violation. The involvement of AI in content moderation and the resulting privacy breach constitute an AI Incident under the framework, as harm to human rights has occurred due to the AI system's use.
Thumbnail Image

Facebook says it doesn't read WhatsApp messages, but an investigation found it actually does

2021-09-07
Business Insider
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI systems to flag abusive content and the subsequent human review of unencrypted messages, which directly leads to a violation of users' privacy rights. This constitutes a breach of obligations under applicable laws protecting fundamental rights. The harm is realized (not just potential) as users' private messages are accessed by contractors, contrary to the service's claims. Therefore, this qualifies as an AI Incident due to the direct involvement of AI in content flagging and the resulting harm to privacy rights.
Thumbnail Image

Facebook reads and shares WhatsApp private messages: report

2021-09-07
New York Post
Why's our monitor labelling this an incident or hazard?
The event explicitly involves AI systems used to flag and moderate WhatsApp messages, which are supposed to be end-to-end encrypted and private. The AI system's outputs lead to human review of private messages, resulting in privacy violations and sharing of user data with law enforcement. This directly breaches users' rights to privacy and confidentiality, fulfilling the criteria for an AI Incident under violations of human rights and breach of obligations protecting fundamental rights. The harm is realized, not just potential, as private messages are accessed and shared without full user consent.
Thumbnail Image

WhatsApp can read your encrypted messages, but you may not need to worry - MSPoweruser

2021-09-07
MSPoweruser
Why's our monitor labelling this an incident or hazard?
The event explicitly involves AI systems used by WhatsApp/Facebook to analyze and filter reported messages before human moderators review them. This use of AI and human moderation leads to a violation of users' privacy rights, a breach of fundamental rights, which is a recognized harm under the AI Incident definition. Since the harm is realized and directly linked to the AI system's use in content moderation and message access, this qualifies as an AI Incident rather than a hazard or complementary information.
Thumbnail Image

WhatsApp's promise of end-to-end encryption may be a complete lie

2021-09-07
Cult of Mac
Why's our monitor labelling this an incident or hazard?
An AI system is explicitly mentioned as being used to examine user content on WhatsApp, which is claimed to be end-to-end encrypted. The use of AI and contractors to read messages contradicts the encryption promise, constituting a violation of user privacy rights, a breach of obligations under applicable law protecting fundamental rights. This is a direct harm caused by the AI system's use in content moderation and surveillance. Therefore, this event qualifies as an AI Incident due to the realized harm to users' rights and privacy.
Thumbnail Image

How Facebook Undermines Privacy Protections for Its 2 Billion WhatsApp Users – Peter Elkind, Jack Gillum and Craig Silverman

2021-09-08
WallStreetWindow
Why's our monitor labelling this an incident or hazard?
The event involves AI systems used for content moderation and metadata analysis, which directly lead to violations of user privacy and potentially breach fundamental rights. The AI systems' role in identifying and forwarding content for human review, combined with the sharing of metadata with law enforcement, constitutes a direct link to harm (privacy violations and rights breaches). The article documents actual practices and consequences, not just potential risks, qualifying this as an AI Incident rather than a hazard or complementary information. The harm is significant and clearly articulated, involving privacy violations affecting billions of users.
Thumbnail Image

Apesar da criptografia, WhatsApp acessa mensagens denunciadas

2021-09-10
TecMundo
Why's our monitor labelling this an incident or hazard?
An AI system is involved as the algorithm automatically groups reported messages for review, and AI is mentioned as part of the message reading process. The use of AI and human moderators to access and analyze reported messages directly leads to a violation of user privacy and data protection rights, which are fundamental rights. The harm is realized, not just potential, as the article cites actual sharing of data with authorities and legal consequences. Therefore, this qualifies as an AI Incident due to the direct role of AI in processing user-reported messages leading to rights violations.
Thumbnail Image

Facebook lê mensagens do WhatsApp, diz investigação feita pela ProPublica

2021-09-08
O Liberal
Why's our monitor labelling this an incident or hazard?
An AI system is involved as AI assists in analyzing reported messages. The use of human moderators reading private messages, even if only those reported, constitutes a violation of privacy rights, which falls under violations of human rights or breach of obligations intended to protect fundamental rights. Since the investigation reveals that this practice is ongoing, the harm is realized. Therefore, this qualifies as an AI Incident due to the direct involvement of AI in content moderation leading to privacy rights violations.
Thumbnail Image

Mensagens denunciadas no WhatsApp são revisadas por mais de 1.000 moderadores, diz site

2021-09-09
O Alto Acre
Why's our monitor labelling this an incident or hazard?
An AI system is explicitly involved in the proactive detection of suspicious activity on WhatsApp, which is part of the content moderation process. The AI's role is to identify potential violations that could lead to harm such as fraud, spam, or child exploitation. The event describes the use of AI in the system's operation to prevent these harms, but it does not report any specific incident of harm occurring or a failure of the AI system leading to harm. Instead, it describes the ongoing use of AI and human moderation to manage risks. Therefore, this is not an AI Incident or AI Hazard but rather Complementary Information about the AI system's role in content moderation and the platform's approach to handling reports and abuse prevention.
Thumbnail Image

پیام‌رسان واتساپ احتمالا از رمزگذاری سرتاسری پشتیبانی نمی‌کند - زومیت

2021-09-07
زومیت
Why's our monitor labelling this an incident or hazard?
The event involves AI systems used by Facebook to analyze and filter WhatsApp messages for inappropriate content, which are then reviewed by human contractors. This use of AI directly leads to a violation of users' privacy rights, as the content is accessible beyond the chat participants, contradicting the claimed end-to-end encryption. The harm is realized (not just potential), as user messages are being accessed and reviewed, constituting a breach of fundamental rights. Hence, this qualifies as an AI Incident under the framework because the AI system's use has directly led to a violation of human rights (privacy).
Thumbnail Image

چشم سوم پیام‌های واتس‌اپ را نیز می‌خواند - پیوست

2021-09-08
پیوست
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI systems to monitor and select reported WhatsApp messages for human review, which leads to the reading of messages that users believed were protected by end-to-end encryption. This constitutes a violation of user privacy and a breach of fundamental rights, fulfilling the criteria for harm under the AI Incident definition. The AI system's role is pivotal in enabling this monitoring and message access. The harm is realized, not just potential, as messages are actually read and user privacy compromised. Hence, the classification as an AI Incident is appropriate.
Thumbnail Image

¿Empleados de WhatsApp revisan y comparten información de usuarios?

2021-09-13
Portafolio.co
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI-supported systems to process and filter private user messages, which are then reviewed by human contractors. This use of AI and human review has directly led to violations of user privacy and data protection rights, constituting a breach of fundamental rights. The harm is realized as users' private communications are accessed and shared without full transparency or consent, undermining trust and privacy guarantees. Therefore, this qualifies as an AI Incident due to the direct involvement of AI systems in processing private data leading to harm in terms of privacy violations and potential legal breaches.
Thumbnail Image

Bericht: Facebook kann Whatsapp-Chats mitlesen

2021-09-08
PC-WELT
Why's our monitor labelling this an incident or hazard?
The article explicitly describes the use of AI systems to analyze WhatsApp user messages that have been reported, including decrypting messages and using AI to detect suspicious patterns. The AI system's outputs are used by human contractors to make decisions that can restrict user access, which constitutes a violation of privacy and potentially other rights. The involvement of AI in processing private communications and the resulting consequences for users meet the criteria for an AI Incident under violations of human rights and privacy. The sharing of data with authorities leading to legal proceedings further supports the classification as an incident rather than a mere hazard or complementary information.
Thumbnail Image

تحقيق: فيسبوك تقوض خصوصية مستخدمي واتساب رغم وعود زوكربيرغ

2021-09-13
الرأي
Why's our monitor labelling this an incident or hazard?
The article explicitly describes the use of AI systems and human contractors to analyze WhatsApp user content and metadata, which directly leads to violations of privacy and user rights. The harm is realized, not hypothetical, as users' private messages and metadata are being inspected contrary to prior assurances. This constitutes a breach of fundamental rights and privacy, fitting the definition of an AI Incident. The AI system's development and use in content moderation and metadata analysis are central to the harm described. Hence, the classification is AI Incident.
Thumbnail Image

فضيحة كبيرة لفيسبوك.. تحقيق يكشف قراءة رسائل المستخدمين ومشاهدة صورهم وفيديوهاتهم على واتساب

2021-09-13
Aljazeera
Why's our monitor labelling this an incident or hazard?
The article explicitly describes the use of AI systems and human contractors employing AI tools to monitor and analyze WhatsApp user content, which directly leads to violations of privacy rights and breaches of user trust. The harm is realized and ongoing, affecting a large user base. The AI systems' role is pivotal in enabling large-scale surveillance and content inspection beyond user expectations and prior assurances. This fits the definition of an AI Incident as it involves violations of human rights and breaches of legal obligations related to privacy and data protection caused by the use of AI systems.
Thumbnail Image

"فيسبوك" تقوض خصوصية مستخدمي "واتساب" رغم وعود زاكربيرغ | القدس العربي

2021-09-15
القدس العربي
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI systems (AI-assisted content moderation and metadata analysis) in the operation of WhatsApp, which has directly led to violations of user privacy and rights. The investigation uncovers that Facebook's practices contradict public assurances of privacy, and that user data is being accessed and shared without full transparency or consent. This constitutes a breach of obligations intended to protect fundamental rights, qualifying as an AI Incident under the framework. The harm is realized, not just potential, as users' privacy has been compromised through AI-enabled surveillance and data sharing.
Thumbnail Image

الخبر-فايسبوك تنتهك خصوصية مستعملي"واتساب"

2021-09-15
El Khabar
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI systems in the operation of WhatsApp to analyze user messages and metadata, which directly leads to violations of privacy rights, a breach of fundamental rights protected by law. The article documents realized harm through unauthorized surveillance and data analysis, not merely potential risks. Hence, it meets the criteria for an AI Incident as the AI system's use has directly led to harm (privacy violations).
Thumbnail Image

بألف موظف.. كيف يقوض فيسبوك خصوصية مستخدمي واتسآب؟ - الخليج الجديد

2021-09-13
الخليج الجديد
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI systems to process and analyze WhatsApp user data, including messages and metadata, to identify potentially abusive content. This AI-enabled monitoring directly results in privacy violations and breaches of user rights, as users' encrypted messages and metadata are accessed and analyzed without full transparency or consent. The harm is realized, not just potential, as users' privacy is compromised. Hence, it meets the criteria for an AI Incident because the AI system's use has directly led to violations of fundamental rights and harm to users' privacy.
Thumbnail Image

الـ 'واتساب' يقرأ رسائلنا؟ تسريب خطير!

2021-09-14
صيدا أون لاين :: Saidaonline
Why's our monitor labelling this an incident or hazard?
The event involves the use of AI systems (AI-assisted content and metadata analysis) in the operation of WhatsApp, leading to direct harm in the form of violations of user privacy and rights. The investigation uncovers that these practices are ongoing and have resulted in unauthorized data sharing and surveillance, which fits the definition of an AI Incident due to violations of human rights and legal obligations. Therefore, this is classified as an AI Incident rather than a hazard or complementary information.
Thumbnail Image

تحقيق: "فيسبوك" تقوض خصوصية مستخدمي "واتساب"

2021-09-13
أخبارنا
Why's our monitor labelling this an incident or hazard?
The event involves AI systems explicitly used to analyze and filter WhatsApp content, which directly impacts user privacy and rights. The use of AI and human contractors to inspect private messages and metadata without full transparency or consent constitutes a violation of privacy and fundamental rights, fulfilling the criteria for an AI Incident. The harm is realized, as users' private communications are accessed and analyzed, and data is shared with authorities, breaching privacy and legal protections. Hence, this is not merely a potential risk or complementary information but a concrete AI Incident involving harm to human rights.