Catalogue of Tools & Metrics for Trustworthy AI

These tools and metrics are designed to help AI actors develop and use trustworthy AI systems and applications that respect human rights and are fair, transparent, explainable, robust, secure and safe.

4 citations of this metric

The Normalized Scanpath Saliency was introduced to the saliency community as a simple correspondence measure between saliency maps and ground truth, computed as the average normalized saliency at fixated locations. Unlike in AUC, the absolute saliency values are part of the normalization calculation. NSS is sensitive to false positives, relative differences in saliency across the image, and general monotonic transformations.

Trustworthy AI Relevance

This metric addresses Explainability and Transparency by quantifying relevant system properties. NSS quantifies the fidelity of saliency/attention maps to human gaze data, which directly supports Explainability by providing an objective measure of how well model-generated explanations (saliency maps) align with human attention and thus how trustworthy those explanations are. Because saliency maps are often used to disclose where a model 'looks' when making decisions, NSS also supports Transparency: it provides a reproducible score that helps communicate and audit model behavior.

References

1. Nikulin, M.S., 2001. Hellinger distance. In Encyclopedia of Mathematics. EMS Press. 2. Hellinger, E., 1909. Neue Begründung der Theorie quadratischer Formen von unendlichvielen Veränderlichen. Journal für die reine und angewandte Mathematik, 136, pp.210–271. 3. Jeffreys, H., 1946. An invariant form for the prior probability in estimation problems. Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences, 186(1007), pp.453–461. 4. Liese, F. and Miescke, K.-J., 2008. Statistical Decision Theory: Estimation, Testing, and Selection. Springer. 5. Pardo, L., 2006. Statistical Inference Based on Divergence Measures. Chapman and Hall/CRC.

About the metric




Lifecycle stage(s):


Target users:


Risk management stage(s):


Github stars:

  • 7100

Github forks:

  • 720

Modify this metric

catalogue Logos

Disclaimer: The tools and metrics featured herein are solely those of the originating authors and are not vetted or endorsed by the OECD or its member countries. The Organisation cannot be held responsible for possible issues resulting from the posting of links to third parties' tools and metrics on this catalogue. More on the methodology can be found at https://oecd.ai/catalogue/faq.