Tools for Trustworthy AI

Catalogue of Tools & Metrics for Trustworthy AI

These tools and metrics are designed to help AI actors develop and use trustworthy AI systems and applications that respect human rights and are fair, transparent, explainable, robust, secure and safe.

Overview Tools Metrics About the catalogue

Show tools Show use cases

Objective Explainability

AI Governance Evaluator

TechnicalProceduralColombiaUploaded on Jun 9, 2026

Web application that allows organizations to assess their level of maturity in artificial intelligence governance and automatically generate a customized roadmap to meet national and international standards

Objective(s)

Related lifecycle stage(s)

Operate & monitor Deploy Build & interpret model

SegMate Toolkit

TechnicalInternationalUploaded on Jun 3, 2026

SegMate is an open source AI Toolkit developed by the Vector Institute, which can help organizations and researchers apply cutting-edge computer vision techniques in the fight against climate change

Objective(s)

Explainability Data Governance & Traceability

Related lifecycle stage(s)

Collect & process data

AuditNLG

TechnicalProceduralUploaded on Jun 3, 2026

AuditNLG is an open-source toolkit for auditing the trustworthiness of generative AI text. It evaluates outputs across three key dimensions: factualness (consistency with knowledge), safety (harmful or biased content), and constraint adherence (compliance with instructions). The tool aggregates multiple state-of-the-art methods and provides scores, explanations, and improved text suggestions via self-refinement prompts. It supports both API-based and local models, enabling flexible integration into evaluation pipelines and governance frameworks.

Objective(s)

Robustness Safety Explainability

Related lifecycle stage(s)

Operate & monitor Verify & validate Build & interpret model

Resaro AI Solutions Quality Index Engineer (ASQI Engineer)

TechnicalUploaded on Jan 19, 2026

ASQI Engineer is an open-source framework for testing and assuring AI systems. Built for scale and reliability, it uses containerised test packages, automated assessments, and repeatable workflows to make evaluation transparent and robust. With ASQI Engineer, organisations also run ASQIs that they have created themselves, giving teams full control and confidence in AI quality.

Objective(s)

Explainability Digital Security

Related lifecycle stage(s)

Operate & monitor Deploy Verify & validate

Inspect

TechnicalUploaded on Oct 9, 2025

An open-source framework for large language model evaluations. Inspect can be used for a broad range of evaluations that measure coding, agentic tasks, reasoning, knowledge, behavior, and multi-modal understanding.

Objective(s)

Transparency Explainability

Related lifecycle stage(s)

Operate & monitor Verify & validate

AI Collaboration Workshop in Professional Practice

EducationalMaltaUploaded on Sep 1, 2025<1 day

This is a complete workshop package for the teaching of practical governance tools when using AI in collaborative teams. It covers the theoretical background, the facilitator notes for each phase, and the student workbook.

Objective(s)

Human Agency & Control Explainability

Related lifecycle stage(s)

Operate & monitor Verify & validate

KOBI

Uploaded on Aug 4, 2025

KOBI is a groundbreaking reading app specifically designed to support children with dyslexia. Grounded in robust scientific principles and a deep understanding of reading difficulties, KOBI integrates evidence-based methodologies to create an effective and engaging learning experience. Here is the science behind KOBI, highlighting its key features and the research that informs its development.

Objective(s)

Human Agency & Control Explainability

Geospatial Damage Assessments (GDA) model

TechnicalUnited StatesUploaded on May 15, 2025

The GDA leverages aerial imagery, satellite data, and machine learning techniques to evaluate the damage in areas impacted by natural disasters. This tool greatly enhances the efficiency and precision of disaster response operations.

Objective(s)

Robustness Explainability

Croissant

TechnicalProceduralEUUploaded on May 2, 2025

Croissant is an open-source framework developed by MLCommons to standardise dataset descriptions, enhance data discoverability, and facilitate automated use across machine-learning tasks. Croissant ensures datasets are consistently documented by providing structured metadata schemas, improving interoperability, transparency, and ease of integration.

Objective(s)

Human Agency & Control Explainability

Related lifecycle stage(s)

Verify & validate Collect & process data Plan & design

Infosys Responsible AI Toolkit

TechnicalIndiaUploaded on Apr 3, 2025

The Infosys Responsible AI toolkit provides a set of APIs to integrate safety, security, privacy, explainability, fairness, and hallucination detection into AI solutions, ensuring trustworthiness and transparency.

Objective(s)

Explainability Data Governance & Traceability

Related lifecycle stage(s)

Deploy Plan & design

PyRIT

TechnicalUnited StatesUploaded on Nov 8, 2024

The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and machine learning engineers to proactively find risks in their generative AI systems.

Objective(s)

Transparency Explainability

Related lifecycle stage(s)

Operate & monitor Verify & validate