Catalogue of Tools & Metrics for Trustworthy AI

These tools and metrics are designed to help AI actors develop and use trustworthy AI systems and applications that respect human rights and are fair, transparent, explainable, robust, secure and safe.

Type

Robustness & digital security

Clear all

Origin

Scope

SUBMIT A TOOL

If you have a tool that you think should be featured in the Catalogue of AI Tools & Metrics, we would love to hear from you!

SUBMIT
Objective Robustness & digital security

TechnicalUnited StatesUploaded on Nov 8, 2024
The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and machine learning engineers to proactively find risks in their generative AI systems.

ProceduralSingaporeUploaded on Oct 2, 2024
Resaro offers independent, third-party assurance of mission-critical AI systems. It promotes responsible, safe and robust AI adoption for enterprises, through technical advisory and evaluation of AI systems against emerging regulatory requirements.

ProceduralUploaded on Oct 2, 2024
FairNow is an AI governance software tool that simplifies and centralises AI risk management at scale. To build and maintain trust with customers, organisations must conduct thorough risk assessments on their AI models, ensuring compliance, fairness, and security. Risk assessments also ensure organisations know where to prioritise their AI governance efforts, beginning with high-risk models and use cases.

TechnicalUploaded on Nov 5, 2024
garak, Generative AI Red-teaming & Assessment Kit, is an LLM vulnerability scanner. Garak checks if an LLM can be made to fail.

TechnicalInternationalUploaded on Nov 5, 2024
A fast, scalable, and open-source framework for evaluating automated red teaming methods and LLM attacks/defenses. HarmBench has out-of-the-box support for transformers-compatible LLMs, numerous closed-source APIs, and several multimodal models.

TechnicalUnited StatesUploaded on Sep 9, 2024
Harms Modeling is a practice designed to help you anticipate the potential for harm, identify gaps in product that could put people at risk, and ultimately create approaches that proactively address harm.

TechnicalUnited StatesUploaded on Sep 9, 2024
Dioptra is an open source software test platform for assessing the trustworthy characteristics of artificial intelligence (AI). It helps developers on determining which types of attacks may impact negatively their model's performance.

TechnicalFranceUploaded on Aug 2, 2024
Evaluate input-output safeguards for LLM systems such as jailbreak and hallucination detectors, to understand how good they are and on which type of inputs they fail.

TechnicalUnited StatesUploaded on Aug 2, 2024
AI Security Platform for GenAI and Conversational AI applications. Probe enables security officers and developers identify, mitigate, and monitor AI system security.

ProceduralUploaded on Jul 2, 2024
The DIN SPEC series describes a number of AI quality requirements which are structured using an AI quality meta model. The DIN SPEC series applies to all phases of the life cycle of an AI module.

ProceduralUploaded on Jul 2, 2024
The document highlights quality objectives for organizations responsible for datasets. The document describes control of records during the lifecycle of datasets, including but not limited to data collection, annotation, transfer, utilization, storage, maintenance, updates, retirement, and other activities.

ProceduralUploaded on Jul 2, 2024
This standard defines a framework and architectures for machine learning in which a model is trained using encrypted data that has been aggregated from multiple sources and is processed by a third party trusted execution environment (TEE).

ProceduralUploaded on Jul 2, 2024
In this standard, quality of experience (QoE) assessment is categorized into two subcategories which are perceptual quality and virtual reality (VR) cybersickness.

ProceduralUploaded on Jul 3, 2024
This document addresses bias in relation to AI systems, especially with regards to AI-aided decision-making.

ProceduralUploaded on Jul 1, 2024
The purpose of this work item is to define what would be considered an AI threat and how it might differ from threats to traditional systems.

ProceduralUploaded on Jun 28, 2024
This work item aims to summarize and analyze existing and potential mitigation against threats for AI-based systems.

ProceduralUploaded on Jun 28, 2024
This work item describes the challenges of securing AI-based systems and solutions, including challenges relating to data, algorithms and models in both training and implementation environments.

ProceduralUploaded on Jul 1, 2024
This standard provides test specifications with a set of indicators for interference and adversarial attacks, which can be used to evaluate the robustness of Artificial Intelligence-based Image Recognition services.

EducationalUploaded on Jul 11, 2024<1 week
The DVMS NIST Cybersecurity Framework Overlay System (DVMS NIST-CSF) provides organizations of any size, scale, or complexity an affordable way to mitigate cybersecurity risk to assure digital business performance, resilience & trust

TechnicalUnited KingdomUploaded on Jun 5, 2024
Advai Insight is designed for enterprise-level which require information on key insights and performance indicators. This tool provides monitoring solutions for all models and risks, giving advanced insights into the AI's performance

catalogue Logos

Disclaimer: The tools and metrics featured herein are solely those of the originating authors and are not vetted or endorsed by the OECD or its member countries. The Organisation cannot be held responsible for possible issues resulting from the posting of links to third parties' tools and metrics on this catalogue. More on the methodology can be found at https://oecd.ai/catalogue/faq.