
The information displayed in the AIM should not be reported as representing the official views of the OECD or of its member countries.
A study by Mass General Brigham found that large language model AI systems, including GPT-5 and Gemini, fail to provide adequate early differential diagnoses in over 80% of cases. While accurate with complete data, their lack of clinical reasoning poses risks if used unsupervised in medical settings.[AI generated]






































