Tools for Trustworthy AI

Catalogue of Tools & Metrics for Trustworthy AI

These tools and metrics are designed to help AI actors develop and use trustworthy AI systems and applications that respect human rights and are fair, transparent, explainable, robust, secure and safe.

Overview Tools Metrics About the catalogue

Show tools Show use cases

AI Screener

TechnicalEducationalUploaded on Aug 27, 2025

AI Screener to enable universal early screening for all children.

Objective(s)

Robustness Safety

Related lifecycle stage(s)

Plan & design

Dytective

TechnicalEducationalUploaded on Aug 1, 2025

Dytective by Change Dyslexia is an innovative AI-powered tool designed to detect the risk of dyslexia in children quickly and reliably. Developed in collaboration with researchers, Dytective combines language exercises with machine learning to screen for dyslexia in just 15 minutes. Backed by scientific validation and used by schools and families worldwide, it empowers early intervention and promotes equal opportunities in education.

Objective(s)

Human Agency & Control Safety

Related lifecycle stage(s)

Operate & monitor Deploy

FloodMapp - ForeCast, NowCast and PostCast

AustraliaUploaded on May 22, 2025

FloodMapp is a technology company that specialises in rapid real-time flood forecasting and flood inundation mapping to provide greater warning time and situational awareness.

Objective(s)

Robustness Safety

Pano - Actionable Intelligence for Wildfire Management

TechnicalUnited StatesUploaded on May 22, 2025

Pano - Actionable Intelligence for Wildfire Management is an advanced, connected platform designed for fire professionals, enabling them to detect threats, verify fires, and share critical information with response teams faster than ever before.

Objective(s)

Human Agency & Control Safety

SeismicAI's Earthquake Early Warning Systems

TechnicalEducationalMexicoUnited StatesIsraelUploaded on May 19, 2025

SeismicAI is a provider of innovative Earthquake Early Warning Systems (EEW) ensuring earthquake preparedness. SeismicAI's algorithms utilise local sensors to issue high-precision alerts for earthquake preparedness. The system covers the full early warning cycle - from monitoring and reporting, through alerts, to optionally triggering automated preventive actions.

Objective(s)

Robustness Safety

Related lifecycle stage(s)

Operate & monitor

PetaBencana.id Bot

EducationalIndonesiaUploaded on May 14, 2025

PetaBencana.id leverages AI to provide residents, government agencies, and first responders with a real-time disaster mapping platform for Indonesia.

Objective(s)

Human Agency & Control Safety

Risk Atlas Nexus

TechnicalIrelandUploaded on May 2, 2025

Risk Atlas Nexus provides tooling to connect fragmented AI governance resources through a community-driven approach to curation of linkages between risks, datasets, benchmarks, and mitigations. It transforms abstract risk definitions into actionable AI governance workflows.

Objective(s)

Safety Transparency

Related lifecycle stage(s)

Operate & monitor Verify & validate Plan & design

A Frontier AI Risk Management Framework: Bridging the Gap Between Current AI Practices and Established Risk Management

ProceduralFranceUploaded on Apr 2, 2025

This tool provides a comprehensive risk management framework for frontier AI development, integrating established risk management principles with AI-specific practices. It combines four key components: risk identification through systematic methods, quantitative risk analysis, targeted risk treatment measures, and clear governance structures.

Objective(s)

Safety Data Governance & Traceability

Related lifecycle stage(s)

Build & interpret model

Behavior Elicitation Tool

TechnicalFranceEuropean UnionUploaded on Mar 24, 2025

Behavior Elicitation Tool (BET) is a complex-AI system that systematically probes and elicits specific behaviors from cutting-edge LLMs. Whether for red-teaming or targeted behavioral analysis, this automated solution is Dynamic Optimized and Adversarial (DAO) and can be configured to test the robustness precisely and help to have a better control of the AI system.

Objective(s)

Human Agency & Control Safety

Related lifecycle stage(s)

Deploy Verify & validate Build & interpret model

COMPL-AI

TechnicalSwitzerlandEuropean UnionUploaded on Jan 24, 2025

COMPL-AI is an open-source compliance-centered evaluation framework for Generative AI models

Objective(s)

Safety Data Governance & Traceability

Related lifecycle stage(s)

Operate & monitor Verify & validate

CEN ISO/TR 22100-5:2022 - Safety of machinery - Relationship with ISO 12100 - Part 5: Implications of artificial intelligence machine learning

ProceduralUploaded on Jan 6, 2025

This document addresses how artificial intelligence machine learning can impact the safety of machinery and machinery systems.

Objective(s)

Robustness Safety

AIxploit

TechnicalFranceUploaded on Dec 6, 2024

AIxploit is a tool designed to evaluate and enhance the robustness of Large Language Models (LLMs) through adversarial testing. This tool simulates various attack scenarios to identify vulnerabilities and weaknesses in LLMs, ensuring they are more resilient and reliable in real-world applications.

Objective(s)

Robustness Safety

Related lifecycle stage(s)

Operate & monitor Verify & validate

Adversa: AI Red Teaming Platform

TechnicalUploaded on Dec 6, 2024

Continuous proactive AI red teaming platform for AI and GenAI models, applications and agents.

Objective(s)

Robustness Safety

Related lifecycle stage(s)

Verify & validate Build & interpret model Plan & design

garak

TechnicalUploaded on Nov 5, 2024

garak, Generative AI Red-teaming & Assessment Kit, is an LLM vulnerability scanner. Garak checks if an LLM can be made to fail.

Objective(s)

Safety Digital Security

Related lifecycle stage(s)

Operate & monitor Verify & validate

HarmBench

TechnicalInternationalUploaded on Nov 5, 2024

A fast, scalable, and open-source framework for evaluating automated red teaming methods and LLM attacks/defenses. HarmBench has out-of-the-box support for transformers-compatible LLMs, numerous closed-source APIs, and several multimodal models.

Objective(s)

Fairness Safety

Responsible innovation toolkit: Harms modeling framework

TechnicalUnited StatesUploaded on Sep 9, 2024

Harms Modeling is a practice designed to help you anticipate the potential for harm, identify gaps in product that could put people at risk, and ultimately create approaches that proactively address harm.

Objective(s)

Human Agency & Control Safety

Dioptra

TechnicalUnited StatesUploaded on Sep 9, 2024

Dioptra is an open source software test platform for assessing the trustworthy characteristics of artificial intelligence (AI). It helps developers on determining which types of attacks may impact negatively their model's performance.

Objective(s)

Human Agency & Control Safety

Related lifecycle stage(s)

Operate & monitor Verify & validate Build & interpret model

BELLS - Benchmarks for the Evaluation of LLM Safeguards

TechnicalFranceUploaded on Aug 2, 2024

Evaluate input-output safeguards for LLM systems such as jailbreak and hallucination detectors, to understand how good they are and on which type of inputs they fail.

Objective(s)

Robustness Safety

Related lifecycle stage(s)

Operate & monitor Verify & validate

ISO/IEC 25023 - Application of ISO 14971 to machine learning in artificial intelligence. Guide

ProceduralUploaded on Jul 3, 2024

Defining quality measures for quantitatively evaluating system and software product quality in terms of characteristics and subcharacteristics defined in ISO/IEC 25010 and is intended to be used together with ISO/IEC 25010.

Objective(s)

Safety Data Governance & Traceability

DIN SPEC 92001-2 - Artificial Intelligence - Life Cycle Processes and Quality Requirements - Part 2: Robustness

ProceduralUploaded on Jul 2, 2024

The DIN SPEC series describes a number of AI quality requirements which are structured using an AI quality meta model. The DIN SPEC series applies to all phases of the life cycle of an AI module.

Objective(s)

Robustness Safety

Partnership on AI

Disclaimer: The tools and metrics featured herein are solely those of the originating authors and are not vetted or endorsed by the OECD or its member countries. The Organisation cannot be held responsible for possible issues resulting from the posting of links to third parties' tools and metrics on this catalogue. More on the methodology can be found at https://oecd.ai/catalogue/faq.

Type

Origin

Scope

SUBMIT A TOOL

AI Screener

Dytective

FloodMapp - ForeCast, NowCast and PostCast

Pano - Actionable Intelligence for Wildfire Management

SeismicAI's Earthquake Early Warning Systems

PetaBencana.id Bot

Risk Atlas Nexus

A Frontier AI Risk Management Framework: Bridging the Gap Between Current AI Practices and Established Risk Management

Behavior Elicitation Tool

COMPL-AI

CEN ISO/TR 22100-5:2022 - Safety of machinery - Relationship with ISO 12100 - Part 5: Implications of artificial intelligence machine learning

AIxploit

Adversa: AI Red Teaming Platform

garak

HarmBench

Responsible innovation toolkit: Harms modeling framework

Dioptra

BELLS - Benchmarks for the Evaluation of LLM Safeguards

ISO/IEC 25023 - Application of ISO 14971 to machine learning in artificial intelligence. Guide

DIN SPEC 92001-2 - Artificial Intelligence - Life Cycle Processes and Quality Requirements - Part 2: Robustness