chrF - OECD.AI

Catalogue of Tools & Metrics for Trustworthy AI

These tools and metrics are designed to help AI actors develop and use trustworthy AI systems and applications that respect human rights and are fair, transparent, explainable, robust, secure and safe.

Overview Tools Metrics About the catalogue

Github

Website

chrF (CHaRacter-level F-score) is a metric for machine translation evaluation that calculates the similarity between a machine translation output and a reference translation using character n-grams, not word n-grams.

Monitoring chrF across varied domains and time-slices lets engineers spot drops in translation quality early, tightening feedback loops that keep the system robust to distribution shifts.

About the metric

You can click on the links to see the associated metrics

Objective(s):

Robustness

Purpose(s):

Recognition/object detection
Content generation

Lifecycle stage(s):

Build & interpret model

Target users:

Developer
Project manager
System operators
System integrators

Risk management stage(s):

Define

Modify this metric

Disclaimer: The tools and metrics featured herein are solely those of the originating authors and are not vetted or endorsed by the OECD or its member countries. The Organisation cannot be held responsible for possible issues resulting from the posting of links to third parties' tools and metrics on this catalogue. More on the methodology can be found at https://oecd.ai/catalogue/faq.