Catalogue of Tools & Metrics for Trustworthy AI

These tools and metrics are designed to help AI actors develop and use trustworthy AI systems and applications that respect human rights and are fair, transparent, explainable, robust, secure and safe.

Type

Explainability

Clear all

Origin

Scope

SUBMIT A TOOL

If you have a tool that you think should be featured in the Catalogue of AI Tools & Metrics, we would love to hear from you!

Submit

TechnicalUploaded on Oct 9, 2025
An open-source framework for large language model evaluations. Inspect can be used for a broad range of evaluations that measure coding, agentic tasks, reasoning, knowledge, behavior, and multi-modal understanding.

Related lifecycle stage(s)

Operate & monitorVerify & validate

EducationalMaltaUploaded on Sep 1, 2025<1 day
This is a complete workshop package for the teaching of practical governance tools when using AI in collaborative teams. It covers the theoretical background, the facilitator notes for each phase, and the student workbook.

Uploaded on Aug 4, 2025
KOBI is a groundbreaking reading app specifically designed to support children with dyslexia. Grounded in robust scientific principles and a deep understanding of reading difficulties, KOBI integrates evidence-based methodologies to create an effective and engaging learning experience. Here is the science behind KOBI, highlighting its key features and the research that informs its development.

TechnicalUnited StatesUploaded on May 15, 2025
The GDA leverages aerial imagery, satellite data, and machine learning techniques to evaluate the damage in areas impacted by natural disasters. This tool greatly enhances the efficiency and precision of disaster response operations.

TechnicalProceduralEUUploaded on May 2, 2025
Croissant is an open-source framework developed by MLCommons to standardise dataset descriptions, enhance data discoverability, and facilitate automated use across machine-learning tasks. Croissant ensures datasets are consistently documented by providing structured metadata schemas, improving interoperability, transparency, and ease of integration.

TechnicalIndiaUploaded on Apr 3, 2025
The Infosys Responsible AI toolkit provides a set of APIs to integrate safety, security, privacy, explainability, fairness, and hallucination detection into AI solutions, ensuring trustworthiness and transparency.

TechnicalUnited StatesUploaded on Nov 8, 2024
The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and machine learning engineers to proactively find risks in their generative AI systems.

Related lifecycle stage(s)

Operate & monitorVerify & validate

ProceduralUploaded on Jul 2, 2024
BSI Flex 1890 defines terms, abbreviations, and acronyms for the connected and automated vehicles (CAVs) sector, focused on those relating to vehicles and associated technologies.

ProceduralUploaded on Jul 2, 2024
VDE-AR-E 2842-61-1 specifies a general framework for the development of trustworthy solutions and trustworthy autonomous / cognitive systems, including the requirements for the subsequent phases of Product life cycle (e.

ProceduralUploaded on Jul 2, 2024
The Work Item will identify and describe use cases and scenarios that are enabled with enhanced experience, through the use of network intelligence.

ProceduralUploaded on Jul 2, 2024
This Work Item will provide terms and definitions used within the scope of the ISG ENI, in order to achieve a "common language" across all the ISG ENI documentation.

ProceduralUploaded on Jul 2, 2024
This Recommendation presents an overview of the framework for a language learning system based on speech and natural language processing (NLP) technology.

ProceduralUploaded on Jul 2, 2024
Standard for Ontologies for Robotics and Automation, to represent additional domain-specific concepts, definitions, and axioms commonly used in Autonomous Robotics (AuR).

ProceduralUploaded on Jul 2, 2024
This document establishes terminology for AI and describes concepts in the field of AI.

ProceduralUploaded on Jul 1, 2024
New IEEE Standard defining a set of operator interfaces frequently used in artificial intelligence (AI) applications

TechnicalUnited StatesUploaded on Apr 22, 2024
Visual analysis and diagnostic tools to facilitate machine learning model selection.

Related lifecycle stage(s)

Plan & design

TechnicalIsraelUploaded on Apr 22, 2024
Explainability for Vision Transformers

TechnicalUploaded on Apr 22, 2024
The open big data serving engine. https://vespa.ai

TechnicalUnited StatesUploaded on Apr 18, 2024
An end-to-end model risk management platform that automates model documentation and dramatically simplifies AI model validation.

TechnicalProceduralIsraelUploaded on Apr 11, 2024
Citrusx offers a multifaceted solution to connect all stakeholders in the company through an SDK, user-friendly UI, and automated reporting system.

Partnership on AI

Disclaimer: The tools and metrics featured herein are solely those of the originating authors and are not vetted or endorsed by the OECD or its member countries. The Organisation cannot be held responsible for possible issues resulting from the posting of links to third parties' tools and metrics on this catalogue. More on the methodology can be found at https://oecd.ai/catalogue/faq.