Catalogue of Tools & Metrics for Trustworthy AI

These tools and metrics are designed to help AI actors develop and use trustworthy AI systems and applications that respect human rights and are fair, transparent, explainable, robust, secure and safe.

Overview Tools Metrics About the catalogue

AuditNLG

Website

Github

AuditNLG is an open-source Python library designed to assess and improve the trustworthiness of text generated by large language models (LLMs). It helps reduce risks associated with using generative AI by evaluating outputs across three key dimensions:

Factualness – checks whether a text is consistent with provided or external knowledge, ensuring it is free from hallucinations and aligns with factual information.
Safety – detects harmful, biased, toxic, or sensitive content, including hate speech, identity attacks, violence, sexual or profane content, and other unsafe language.
Constraint adherence – evaluates whether the text follows explicit or implicit instructions, formats, styles, or target audience requirements.

The toolkit integrates multiple evaluation approaches, including LLM-based scoring, QA-based factuality verification, and specialised safety classifiers, providing quantitative scores, structured metadata, and textual explanations for each assessed output. It supports both API-based models (e.g., OpenAI) and local open-source models, allowing flexible deployment depending on infrastructure, privacy, or compliance needs.

AuditNLG also features the PromptHelper module, which leverages self-refinement and rewriting techniques to automatically suggest improved versions of problematic outputs. This enhances factual accuracy, safety, and compliance with constraints.

The system uses structured JSON inputs containing prompts, generated outputs, and optional grounding knowledge. It can be accessed through a Python package or command-line interface, making it suitable for integration into evaluation pipelines, AI research workflows, governance processes, and auditing frameworks.

The toolkit is primarily intended for developers, data scientists, and system operators, while also supporting regulators and policy communities seeking structured, auditable assessments of generative AI outputs. By providing standardized metrics and explanations, AuditNLG supports trustworthy AI evaluation, risk management, and governance practices.

As an open-source resource, AuditNLG encourages contributions from the community to expand its methods, improve accessibility, and enhance the evaluation of generative AI across domains.

About the tool

You can click on the links to see the associated tools

Developing organisation(s):

Salesforce

Tool type(s):

Toolkit/software
Technical validation

Objective(s):

Robustness
Safety
Explainability

Impacted stakeholders:

Regulators
Specific policy communities

Purpose(s):

Governance and compliance
Risk management

Target sector(s):

Science & technology
Digital Economy
Corporate governance

Lifecycle stage(s):

Operate & monitor
Verify & validate
Build & interpret model

Type of approach:

Technical
Procedural

Target users:

Data scientist
Developer
Management

Programming languages:

Python

Tags:

trustworthy ai
bias
explainability
safety
ai safety

Modify this tool

Use Cases

There is no use cases for this tool yet.

Would you like to submit a use case for this tool?

If you have used this tool, we would love to know more about your experience.

Add use case

Partnership on AI

Disclaimer: The tools and metrics featured herein are solely those of the originating authors and are not vetted or endorsed by the OECD or its member countries. The Organisation cannot be held responsible for possible issues resulting from the posting of links to third parties' tools and metrics on this catalogue. More on the methodology can be found at https://oecd.ai/catalogue/faq.