FrugalScore

Catalogue of Tools & Metrics for Trustworthy AI

These tools and metrics are designed to help AI actors develop and use trustworthy AI systems and applications that respect human rights and are fair, transparent, explainable, robust, secure and safe.

Overview Tools Metrics About the catalogue

Github

Website

FrugalScore is a reference-based metric for Natural Language Generation (NLG) model evaluation. It is based on a distillation approach that allows to learn a fixed, low cost version of any expensive NLG metric, while retaining most of its original performance.

The FrugalScore models are obtained by continuing the pretraining of small models on a synthetic dataset constructed using summarization, backtranslation and denoising models. During the training, the small models learn the internal mapping of the expensive metric, including any similarity function.

FrugalScore's main contribution to Trustworthy AI is through Environmental Sustainability. By enabling cheaper, lighter, and faster evaluation of NLG models, it reduces the computational resources and energy required for model assessment, thereby mitigating environmental impact. This aligns with the objective of improving sustainable practices in AI development and use.

About the metric

You can click on the links to see the associated metrics

Objective(s):

Environmental Sustainability

Purpose(s):

Forecasting/prediction

Lifecycle stage(s):

Build & interpret model

Target users:

Developer

Risk management stage(s):

Assess
Govern
Treat

Modify this metric

Disclaimer: The tools and metrics featured herein are solely those of the originating authors and are not vetted or endorsed by the OECD or its member countries. The Organisation cannot be held responsible for possible issues resulting from the posting of links to third parties' tools and metrics on this catalogue. More on the methodology can be found at https://oecd.ai/catalogue/faq.