Catalogue of Tools & Metrics for Trustworthy AI

These tools and metrics are designed to help AI actors develop and use trustworthy AI systems and applications that respect human rights and are fair, transparent, explainable, robust, secure and safe.

Overview Tools Metrics About the catalogue

AILuminate benchmark v1.0

Website

Github

The AILuminate v1.0 benchmark, developed by MLCommons, evaluates the safety of AI systems by testing how they respond to a set of prompts related to potential harms.

The benchmark can be applied to both bare models, which are standalone models without external guardrails, and AI systems, which may include additional components such as moderation filters, guardrails, or other safety mechanisms. To evaluate a system, AILuminate inputs prompts into the system under test (SUT), records its responses, and analyzes them using specialized safety evaluator models. These evaluator models determine whether the responses violate the AILuminate Assessment Standard guidelines. The benchmark examines several categories of hazards, including physical harms (such as violent crime or self-harm), non-physical harms (such as hate speech, privacy violations, or intellectual property issues), and contextual hazards like unsafe specialized advice. The findings are summarized in a human-readable report, and each system receives a grade ranging from Poor to Excellent based on the percentage of responses that violate the assessment standard. This allows different AI systems to be compared according to their observed safety performance.

About the tool

You can click on the links to see the associated tools

Developing organisation(s):

mlcommons

Tool type(s):

Technical validation
Rating framework

Objective(s):

Safety

Impacted stakeholders:

Other

Purpose(s):

Event/anomaly detection

Lifecycle stage(s):

Verify & validate

Type of approach:

Technical

Maturity:

Published document

Usage rights:

Free of charge

Target groups:

Technical community

Target users:

Developer

Stakeholder group:

Technical community

Benefits:

Responsible implementation

Geographical scope:

International

Technology platforms:

Platform neutral

Tags:

ai ethics
trustworthy ai
ai security
ai
ai safety

Modify this tool

Use Cases

There is no use cases for this tool yet.

Would you like to submit a use case for this tool?

If you have used this tool, we would love to know more about your experience.

Add use case

Partnership on AI

Disclaimer: The tools and metrics featured herein are solely those of the originating authors and are not vetted or endorsed by the OECD or its member countries. The Organisation cannot be held responsible for possible issues resulting from the posting of links to third parties' tools and metrics on this catalogue. More on the methodology can be found at https://oecd.ai/catalogue/faq.