Out-of-distribution (OOD) generalization

Catalogue of Tools & Metrics for Trustworthy AI

These tools and metrics are designed to help AI actors develop and use trustworthy AI systems and applications that respect human rights and are fair, transparent, explainable, robust, secure and safe.

Overview Tools Metrics About the catalogue

Website

Robustness Metrics provides lightweight modules in order to evaluate the robustness of classification models. OOD generalization is defined as, e.g. a non-expert human would be able to classify similar objects, but possibly changed viewpoint, scene setting or clutter.

The library includes popular out-of-distribution datasets (ImageNetV2, ImageNet-C, etc.) and can be readily applied to benchmark arbitrary models and is not limited to vision models: any mapping from input -> logits will do.

Trustworthy AI Relevance

This metric addresses Robustness, Safety by quantifying relevant system properties. OOD generalization is fundamentally about ensuring AI systems remain reliable and perform safely when faced with novel or shifted data distributions, which aligns closely with the Robustness objective. By improving OOD generalization, the system reduces the risk of unexpected failures or harmful outputs in new environments, thereby supporting the Safety objective.

Related use cases :

Towards Out-Of-Distribution Generalization: A Survey

Uploaded on Oct 25, 2022

Classic machine learning methods are built on the i.i.d. assumption that training and testing data are independent and identically distributed. However, in real scena...

About the metric

You can click on the links to see the associated metrics

Objective(s):

Robustness

Lifecycle stage(s):

Operate & monitor
Verify & validate

Target users:

Data scientist
Developer
Researcher

Risk management stage(s):

Assess
Treat

Modify this metric

Partnership on AI

Disclaimer: The tools and metrics featured herein are solely those of the originating authors and are not vetted or endorsed by the OECD or its member countries. The Organisation cannot be held responsible for possible issues resulting from the posting of links to third parties' tools and metrics on this catalogue. More on the methodology can be found at https://oecd.ai/catalogue/faq.