Catalogue of Tools & Metrics for Trustworthy AI

These tools and metrics are designed to help AI actors develop and use trustworthy AI systems and applications that respect human rights and are fair, transparent, explainable, robust, secure and safe.

IndT5: A Text-to-Text Transformer for 10 Indigenous Languages



IndT5: A Text-to-Text Transformer for 10 Indigenous Languages

Transformer language models have become fundamental components of natural language processing-based pipelines. Although several Transformer models have been introduced to serve many languages, there is a shortage of models pre-trained for low-resource and Indigenous languages. IndT5 is the first Transformer language model for Indigenous languages. To train IndT5, IndCorpus is built — a new dataset for ten Indigenous languages and Spanish. The application of IndT5 to machine translation is also introduced by investigating different approaches to translate between Spanish and the Indigenous languages. IndT5 and IndCorpus are publicly available for research.

About the tool


Developing organisation(s):




Target sector(s):


Lifecycle stage(s):


Type of approach:



Stakeholder group:



Programming languages:



Github stars:

  • 4

Github forks:

  • 1

Modify this tool

Use Cases

There is no use cases for this tool yet.

Would you like to submit a use case for this tool?

If you have used this tool, we would love to know more about your experience.

Add use case
Partnership on AI

Disclaimer: The tools and metrics featured herein are solely those of the originating authors and are not vetted or endorsed by the OECD or its member countries. The Organisation cannot be held responsible for possible issues resulting from the posting of links to third parties' tools and metrics on this catalogue. More on the methodology can be found at https://oecd.ai/catalogue/faq.