Catalogue of Tools & Metrics for Trustworthy AI

These tools and metrics are designed to help AI actors develop and use trustworthy AI systems and applications that respect human rights and are fair, transparent, explainable, robust, secure and safe.

AILuminate benchmark v1.0



AILuminate benchmark v1.0

The AILuminate v1.0 benchmark, developed by MLCommons, evaluates the safety of AI systems by testing how they respond to a set of prompts related to potential harms.

The benchmark can be applied to both bare models, which are standalone models without external guardrails, and AI systems, which may include additional components such as moderation filters, guardrails, or other safety mechanisms. To evaluate a system, AILuminate inputs prompts into the system under test (SUT), records its responses, and analyzes them using specialized safety evaluator models. These evaluator models determine whether the responses violate the AILuminate Assessment Standard guidelines. The benchmark examines several categories of hazards, including physical harms (such as violent crime or self-harm), non-physical harms (such as hate speech, privacy violations, or intellectual property issues), and contextual hazards like unsafe specialized advice. The findings are summarized in a human-readable report, and each system receives a grade ranging from Poor to Excellent based on the percentage of responses that violate the assessment standard. This allows different AI systems to be compared according to their observed safety performance.

About the tool


Developing organisation(s):



Objective(s):


Impacted stakeholders:



Lifecycle stage(s):


Type of approach:



Usage rights:


Target groups:


Target users:


Stakeholder group:



Geographical scope:


Technology platforms:


Tags:

  • ai ethics
  • trustworthy ai
  • ai security
  • ai
  • ai safety

Modify this tool

Use Cases

There is no use cases for this tool yet.

Would you like to submit a use case for this tool?

If you have used this tool, we would love to know more about your experience.

Add use case
Partnership on AI

Disclaimer: The tools and metrics featured herein are solely those of the originating authors and are not vetted or endorsed by the OECD or its member countries. The Organisation cannot be held responsible for possible issues resulting from the posting of links to third parties' tools and metrics on this catalogue. More on the methodology can be found at https://oecd.ai/catalogue/faq.