Catalogue of Tools & Metrics for Trustworthy AI

These tools and metrics are designed to help AI actors develop and use trustworthy AI systems and applications that respect human rights and are fair, transparent, explainable, robust, secure and safe.

Overview Tools Metrics About the catalogue

PyRIT

Website

Github

Tool package

PyRIT is a library developed by the AI Red Team for researchers and engineers to help them assess the robustness of their generative AI systems against different harm categories such as fabrication/ungrounded content (e.g., hallucination), misuse (e.g., bias), and prohibited content (e.g., harassment).

PyRIT automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft).

The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements. Additionally, this tool allows researchers to iterate and improve their mitigations against different harms. For example, at Microsoft we are using this tool to iterate on different versions of a product (and its metaprompt) so that we can more effectively protect against prompt injection attacks.

About the tool

You can click on the links to see the associated tools

Developing organisation(s):

Microsoft

Tool type(s):

Audit Process
Toolkit/software
Risk management framework

Objective(s):

Transparency
Explainability

Impacted stakeholders:

Consumers
Employees
Management

Purpose(s):

Interaction support/chatbots
Reasoning with knowledge structures/planning
Content generation

Target sector(s):

Science & technology

Country/Territory of origin:

United States

Lifecycle stage(s):

Operate & monitor
Verify & validate

Type of approach:

Technical

Maturity:

Implemented in multiple projects
In development
Published document

Usage rights:

Open source/Permissive

License:

MIT License

Target groups:

Academia/educators/students
Professionals
Technical community

Target users:

Developer
Researcher
System operators

Stakeholder group:

Business
Other

Validity:

Always up to date

Benefits:

Other
Reduction in risk of failure
Responsible implementation

Geographical scope:

International

People involved:

Clients
IT employees
Other

Required skills:

Domain expertise
IT infrastructure
IT skills

Tags:

ai responsible
ai risks
ai security
adversarial ai
Security and resilience
safety
red-teaming
llm security
ai red teaming
ai safety

Modify this tool

Use Cases

There is no use cases for this tool yet.

Would you like to submit a use case for this tool?

If you have used this tool, we would love to know more about your experience.

Add use case

Partnership on AI

Disclaimer: The tools and metrics featured herein are solely those of the originating authors and are not vetted or endorsed by the OECD or its member countries. The Organisation cannot be held responsible for possible issues resulting from the posting of links to third parties' tools and metrics on this catalogue. More on the methodology can be found at https://oecd.ai/catalogue/faq.