
The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.
A Columbia University-led audit revealed that AI systems, particularly large language models, are responsible for fabricating over 4,000 citations in biomedical papers. These false references undermine clinical guidelines and scientific integrity, potentially leading to harmful medical decisions. The issue is ongoing and affects the reliability of healthcare research globally.[AI generated]
Why's our monitor labelling this an incident or hazard?
The article explicitly states that AI systems used in biomedical research writing are fabricating citations, leading to thousands of papers containing false references. These fabricated citations have not been corrected and can influence clinical guidelines, which doctors use to make treatment decisions, thus indirectly harming patient health. The AI's role in generating these false citations is central to the harm, fulfilling the criteria for an AI Incident involving injury or harm to health (a).[AI generated]