Accuracy

Catalogue of Tools & Metrics for Trustworthy AI

These tools and metrics are designed to help AI actors develop and use trustworthy AI systems and applications that respect human rights and are fair, transparent, explainable, robust, secure and safe.

Overview Tools Metrics About the catalogue

Github

Website

Accuracy is the proportion of correct predictions among the total number of cases processed. It can be computed with:

Accuracy = (TP + TN) / (TP + TN + FP + FN) , where:

TP: True positive

TN: True negative

FP: False positive

FN: False negative

Trustworthy AI Relevance

This metric addresses Robustness, Safety by quantifying relevant system properties. Direct connections: Accuracy quantifies how often an AI system produces correct outputs and therefore directly measures performance under standard conditions, which is central to Robustness (ability to maintain performance) and Safety (avoiding harmful incorrect outputs). Higher accuracy typically reduces the frequency of errors that could cause harm.

## AI Validation Analysis **Connection to Trustworthy AI Objectives:** Accuracy is a fundamental metric for evaluating AI model performance. It directly relates to Robustness (reliable performance), Safety (preventing harmful predictions), and Transparency (clear performance measurement). **Validation Score:** 4.5/5

Related use cases :

Evaluating Model Robustness and Stability to Dataset Shift

Uploaded on Oct 21, 2022

As the use of machine learning in high impact domains becomes widespread, the importance of evaluating safety has increased. An important aspect of this is evaluating how robus...

Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift

Uploaded on Oct 21, 2022

We might hope that when faced with unexpected inputs, well-designed software systems would fire off warnings. Machine learning (ML) systems, however, which depend strongly on p...

Explaining the Predictions of Any Classifier

Uploaded on Oct 21, 2022

Despite widespread adoption, machine learning models remain mostly black boxes. Understanding the reasons behind predictions is, however, quite important in assessing trust, wh...

Empirical Risk Minimization under Fairness Constraints

Uploaded on Oct 21, 2022

We address the problem of algorithmic fairness: ensuring that sensitive variables do not unfairly influence the outcome of a classifier. We present an approach based on empiric...

Fairness Constraints: A Flexible Approach for Fair Classification

Uploaded on Oct 21, 2022

Algorithmic decision making is employed in an increasing number of real-world applicationstions to aid human decision making. While it has shown considerable promise in terms o...

Data preprocessing techniques for classification without discrimination

Uploaded on Oct 21, 2022

Recently, the following Discrimination-Aware Classification Problem was introduced: Suppose we are given training data that exhibit unlawful discrimination; e.g., towa...

DetectA: abrupt concept drift detection in non-stationary environments

Uploaded on Oct 21, 2022

Almost all drift detection mechanisms designed for classification problems work reactively: after receiving the complete data set (input patterns and class labels) they apply a...

Model Inversion Attacks that Exploit Confidence Information and Basic Countermeasures

Uploaded on Oct 21, 2022

Machine-learning (ML) algorithms are increasingly utilized in privacy-sensitive applications such as predicting lifestyle choices, making medical diagnoses, and facial recognit...

The Secret Sharer: Evaluating and Testing Unintended Memorization in Neural Networks

Uploaded on Oct 21, 2022

This paper describes a testing methodology for quantitatively assessing the risk that rare or unique training-data sequences are unintentionally memorized by generative sequenc...

Privacy-preserving Neural Representations of Text

Uploaded on Oct 21, 2022

This article deals with adversarial attacks towards deep learning systems for Natural Language Processing (NLP), in the context of privacy protection. We study a specific type ...

Information Maximization Clustering via Multi-View Self-Labelling

Uploaded on Mar 27, 2023

Image clustering is a particularly challenging computer vision task, which aims to generate annotations without human supervision. Recent advances focus on the use of self-supervis...

AE-Net:Adjoint Enhancement Network for Efficient Action Recognition in Video Understanding

Uploaded on Nov 1, 2023

In this paper, we investigate visual-based camera re-localization with neural networks for robotics and autonomous vehicles applications. Our solution is a CNN-based algorithm whic...

AKHCRNet: Bengali Handwritten Character Recognition Using Deep Learning

Uploaded on Nov 1, 2023

Recognizing human non-speech vocalizations is an important task and has broad applications such as automatic sound transcription and health condition monitoring. However, existing ...

AZTR: Aerial Video Action Recognition with Auto Zoom and Temporal Reasoning

Uploaded on Nov 1, 2023

Outcome prediction from clinical text can prevent doctors from overlooking possible risks and help hospitals to plan capacities. We simulate patients at admission time, when decisi...

AsymmNet: Towards ultralight convolution neural networks using asymmetrical bottlenecks

Uploaded on Nov 1, 2023

ImageNet Large Scale Visual Recognition Challenge (ILSVRC) is one of the most authoritative academic competitions in the field of Computer Vision (CV) in recent years. But applying...

Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation

Uploaded on Nov 1, 2023

This work proposes a syntax-enhanced grammatical error correction (GEC) approach named SynGEC that effectively incorporates dependency syntactic information into the encoder part o...

Boosting Contrastive Self-Supervised Learning with False Negative Cancellation

Uploaded on Nov 1, 2023

LiDAR and camera are two important sensors for 3D object detection in autonomous driving. Despite the increasing popularity of sensor fusion in this field, the robustness against i...

Compounding the Performance Improvements of Assembled Techniques in a Convolutional Neural Network

Uploaded on Nov 1, 2023

Recent studies in image classification have demonstrated a variety of techniques for improving the performance of Convolutional Neural Networks (CNNs). However, attempts to combine...

Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation

Uploaded on Nov 1, 2023

We present a weakly supervised instance segmentation algorithm based on deep community learning with multiple tasks. This task is formulated as a combination of weakly supervised o...

DADA: Differentiable Automatic Data Augmentation

Uploaded on Nov 1, 2023

Data augmentation (DA) techniques aim to increase data variability, and thus train deep networks with better generalisation. The pioneering AutoAugment automated the search for opt...

DeBERTa: Decoding-enhanced BERT with Disentangled Attention

Uploaded on Nov 1, 2023

Recent progress in pre-trained neural language models has significantly improved the performance of many natural language processing (NLP) tasks. In this paper we propose a new mod...

DeepViT: Towards Deeper Vision Transformer

Uploaded on Nov 1, 2023

Citations in scientific papers not only help us trace the intellectual lineage but also are a useful indicator of the scientific significance of the work. Citation intents prove be...

DyTox: Transformers for Continual Learning with DYnamic TOken eXpansion

Uploaded on Nov 1, 2023

Large Language Models (LLMs) have shown remarkable performances on a wide range of natural language understanding and generation tasks. We observe that the LLMs provide effective p...

EPE-NAS: Efficient Performance Estimation Without Training for Neural Architecture Search

Uploaded on Nov 1, 2023

Large language models (LLMs) have recently garnered significant interest. With in-context learning, LLMs achieve impressive results in various natural language tasks. However, the ...

ESResNet: Environmental Sound Classification Based on Visual Domain Models

Uploaded on Nov 1, 2023

Environmental Sound Classification (ESC) is an active research area in the audio domain and has seen a lot of progress in the past years. However, many of the existing approaches a...

EVA: Exploring the Limits of Masked Visual Representation Learning at Scale

Uploaded on Nov 1, 2023

Disruptive technologies provides unparalleled opportunities to contribute to the identifications of many aspects in pervasive healthcare, from the adoption of the Internet of Thing...

ElasticFace: Elastic Margin Loss for Deep Face Recognition

Uploaded on Nov 1, 2023

Recent work in open-domain conversational agents has demonstrated that significant improvements in model engagingness and humanness metrics can be achieved via massive scaling in b...

End-to-End Audio Strikes Back: Boosting Augmentations Towards An Efficient Audio Classification Network

Uploaded on Nov 1, 2023

Despite being very powerful in standard learning settings, deep learning models can be extremely brittle when deployed in scenarios different from those on which they were trained....

Enhancing Intra-class Information Extraction for Heterophilous Graphs: One Neural Architecture Search Approach

Uploaded on Nov 1, 2023

Given an image with multiple people, our goal is to directly regress the pose and shape of all the people as well as their relative depth. Inferring the depth of a person in an ima...

Enhancing Prototypical Few-Shot Learning by Leveraging the Local-Level Strategy

Uploaded on Nov 1, 2023

Adversarial purification using generative models demonstrates strong adversarial defense performance. These methods are classifier and attack-agnostic, making them versatile but of...

Exphormer: Sparse Transformers for Graphs

Uploaded on Nov 1, 2023

Quantification of behavior is critical in applications ranging from neuroscience, veterinary medicine and animal conservation efforts. A common key step for behavioral analysis is ...

Exploring Localization for Self-supervised Fine-grained Contrastive Learning

Uploaded on Nov 1, 2023

Semi-supervised object detection (SSOD) has made significant progress with the development of pseudo-label-based end-to-end methods. However, many of these methods face challenges ...

Few-Shot Image Classification via Contrastive Self-Supervised Learning

Uploaded on Nov 1, 2023

The success of Vision Transformer (ViT) in various computer vision tasks has promoted the ever-increasing prevalence of this convolution-free network. The fact that ViT works on im...

Free Lunch for Domain Adversarial Training: Environment Label Smoothing

Uploaded on Nov 1, 2023

With the recent development of Semi-Supervised Object Detection (SS-OD) techniques, object detectors can be improved by using a limited amount of labeled data and abundant unlabele...

From Generalized zero-shot learning to long-tail with class descriptors

Uploaded on Nov 1, 2023

Recent text-to-image models have achieved impressive results. However, since they require large-scale datasets of text-image pairs, it is impractical to train them on new domains w...

Generative Data Augmentation for Commonsense Reasoning

Uploaded on Nov 1, 2023

Recent advances in commonsense reasoning depend on large-scale human-annotated training data to achieve peak performance. However, manual curation of training examples is expensive...

Graph Neural Networks with Learnable and Optimal Polynomial Bases

Uploaded on Nov 1, 2023

Human language expression is based on the subjective construal of the situation instead of the objective truth conditions, which means that speakers' personalities and emotions aft...

Graph-Based High-Order Relation Discovery for Fine-Grained Recognition

Uploaded on Nov 1, 2023

Pedestrian detection benefits from deep learning technology and gains rapid development in recent years. Most of detectors follow general object detection frame, i.e. default boxes...

HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection

Uploaded on Nov 1, 2023

Hate speech is a challenging issue plaguing the online social media. While better models for hate speech detection are continuously being developed, there is little research on the...

Hierarchical Action Classification with Network Pruning

Uploaded on Nov 1, 2023

Research on human action classification has made significant progresses in the past few years. Most deep learning methods focus on improving performance by adding more network comp...

Hierarchically Decomposed Graph Convolutional Networks for Skeleton-Based Action Recognition

Uploaded on Nov 1, 2023

We present TransProteus, a dataset, and methods for predicting the 3D structure, masks, and properties of materials, liquids, and objects inside transparent vessels from a single i...

Improving ProtoNet for Few-Shot Video Object Recognition: Winner of ORBIT Challenge 2022

Uploaded on Nov 1, 2023

Despite advances in feature representation, leveraging geometric relations is crucial for establishing reliable visual correspondences under large variations of images. In this wor...

Improving out-of-distribution generalization via multi-task self-supervised pretraining

Uploaded on Nov 1, 2023

Self-supervised feature representations have been shown to be useful for supervised classification, few-shot learning, and adversarial robustness. We show that features obtained us...

Information Maximization Clustering via Multi-View Self-Labelling

Uploaded on Nov 1, 2023

Score-based diffusion models (SBDMs) have achieved the SOTA FID results in unpaired image-to-image translation (I2I). However, we notice that existing methods totally ignore the tr...

Instance Credibility Inference for Few-Shot Learning

Uploaded on Nov 1, 2023

Few-shot learning (FSL) aims to recognize new objects with extremely limited training data for each category. Previous efforts are made by either leveraging meta-learning paradigm ...

InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning

Uploaded on Nov 1, 2023

Recently, it has attracted much attention to build reliable named entity recognition (NER) systems using limited annotated data. Nearly all existing works heavily rely on domain-sp...

InternVideo: General Video Foundation Models via Generative and Discriminative Learning

Uploaded on Nov 1, 2023

We present the vector quantized diffusion (VQ-Diffusion) model for text-to-image generation. This method is based on a vector quantized variational autoencoder (VQ-VAE) whose laten...

Investigation of deep learning models on identification of minimum signal length for precise classification of conveyor rubber belt loads

Uploaded on Nov 1, 2023

Recent research on the application of remote sensing and deep learning-based analysis in precision agriculture demonstrated a potential for improved crop management and reduced env...

Learn From All: Erasing Attention Consistency for Noisy Label Facial Expression Recognition

Uploaded on Nov 1, 2023

Anomaly detection in video is a challenging computer vision problem. Due to the lack of anomalous events at training time, anomaly detection requires the design of learning methods...

Learning Efficient, Explainable and Discriminative Representations for Pulmonary Nodules Classification

Uploaded on Nov 1, 2023

Methods for extracting audio and speech features have been studied since pioneering work on spectrum analysis decades ago. Recent efforts are guided by the ambition to develop gene...

Learning Representation for Clustering via Prototype Scattering and Positive Sampling

Uploaded on Nov 1, 2023

Driven by the goal of eradicating language barriers on a global scale, machine translation has solidified itself as a key focus of artificial intelligence research today. However, ...

Liquid Structural State-Space Models

Uploaded on Nov 1, 2023

Few-shot object detection is an imperative and long-lasting problem due to the inherent long-tail distribution of real-world data. Its performance is largely affected by the data s...

MV-MR: multi-views and multi-representations for self-supervised learning and knowledge distillation

Uploaded on Nov 1, 2023

The recent success of machine learning methods applied to time series collected from Intensive Care Units (ICU) exposes the lack of standardized machine learning benchmarks for dev...

MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers

Uploaded on Nov 1, 2023

Medical image segmentation is one of the most fundamental tasks concerning medical information analysis. Various solutions have been proposed so far, including many deep learning-b...

Maximum Entropy Weighted Independent Set Pooling for Graph Neural Networks

Uploaded on Nov 1, 2023

Nowadays, Semi-Supervised Object Detection (SSOD) is a hot topic, since, while it is rather easy to collect images for creating a new dataset, labeling them is still an expensive a...

Meta Dropout: Learning to Perturb Latent Features for Generalization

Uploaded on Nov 1, 2023

Pre-trained language models (LMs) have been shown to memorize a substantial amount of knowledge from the pre-training corpora; however, they are still limited in recalling factuall...

Meta-Learning with a Geometry-Adaptive Preconditioner

Uploaded on Nov 1, 2023

We propose GANav, a novel group-wise attention mechanism to identify safe and navigable regions in off-road terrains and unstructured environments from RGB images. Our approach cla...

Milking CowMask for Semi-Supervised Image Classification

Uploaded on Nov 1, 2023

In recent computer vision research, the advent of the Vision Transformer (ViT) has rapidly revolutionized various architectural design efforts: ViT achieved state-of-the-art image ...

MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices

Uploaded on Nov 1, 2023

Unlike existing knowledge distillation methods focus on the baseline settings, where the teacher models and training strategies are not that strong and competing as state-of-the-ar...

Model Rubik's Cube: Twisting Resolution, Depth and Width for TinyNets

Uploaded on Nov 1, 2023

Motion, measured via optical flow, provides a powerful cue to discover and learn objects in images and videos. However, compared to using appearance, it has some blind spots, such ...

Multi-Label Compound Expression Recognition: C-EXPR Database & Network

Uploaded on Nov 1, 2023

We present a neural network architecture for medical image segmentation of diabetic foot ulcers and colonoscopy polyps. Diabetic foot ulcers are caused by neuropathic and vascular ...

Multi-Semantic Fusion Model for Generalized Zero-Shot Skeleton-Based Action Recognition

Uploaded on Nov 1, 2023

Domain generalization is the task of learning models that generalize to unseen target domains. We propose a simple yet effective method for domain generalization, named cross-domai...

Multiscale Vision Transformers

Uploaded on Nov 1, 2023

We tackle the Few-Shot Open-Set Recognition (FSOSR) problem, i.e. classifying instances among a set of classes for which we only have a few labeled samples, while simultaneously de...

New Benchmarks for Learning on Non-Homophilous Graphs

Uploaded on Nov 1, 2023

In recent years, there is strong emphasis on mining medical data using machine learning techniques. A common problem is to obtain a noiseless set of textual documents, with a relev...

NoiseRank: Unsupervised Label Noise Reduction with Dependence Models

Uploaded on Nov 1, 2023

Label noise is increasingly prevalent in datasets acquired from noisy channels. Existing approaches that detect and remove label noise generally rely on some form of supervision, w...

OBoW: Online Bag-of-Visual-Words Generation for Self-Supervised Learning

Uploaded on Nov 1, 2023

Learning image representations without human supervision is an important and active research field. Several recent approaches have successfully leveraged the idea of making such a ...

On Evolving Attention Towards Domain Adaptation

Uploaded on Nov 1, 2023

We formulate monocular depth estimation using denoising diffusion models, inspired by their recent successes in high fidelity image generation. To that end, we introduce innovation...

Online Graph Dictionary Learning

Uploaded on Nov 1, 2023

Hand gesture serves as a crucial role during the expression of sign language. Current deep learning based methods for sign language understanding (SLU) are prone to over-fitting du...

Optimal Representations for Covariate Shift

Uploaded on Nov 1, 2023

The drone has been used for various purposes, including military applications, aerial photography, and pesticide spraying. However, the drone is vulnerable to external disturbances...

PaLI-3 Vision Language Models: Smaller, Faster, Stronger

Uploaded on Nov 1, 2023

The interest of the machine learning community in image synthesis has grown significantly in recent years, with the introduction of a wide range of deep generative models and means...

Patch-Mix Transformer for Unsupervised Domain Adaptation: A Game Perspective

Uploaded on Nov 1, 2023

The task of long-form question answering (LFQA) involves retrieving documents relevant to a given question and using them to generate a paragraph-length answer. While many models h...

Pattern Attention Transformer with Doughnut Kernel

Uploaded on Nov 1, 2023

Image retrieval task consists of finding similar images to a query image from a set of gallery (database) images. Such systems are used in various applications e.g. person re-ident...

Penalizing the Hard Example But Not Too Much: A Strong Baseline for Fine-Grained Visual Classification

Uploaded on Nov 1, 2023

Biological systems perceive the world by simultaneously processing high-dimensional inputs from modalities as diverse as vision, audition, touch, proprioception, etc. The perceptio...

Point Cloud Classification Using Content-based Transformer via Clustering in Feature Space

Uploaded on Nov 1, 2023

In many sequential decision-making problems (e.g., robotics control, game playing, sequential prediction), human or expert data is available containing useful information about the...

PointASNL: Robust Point Clouds Processing using Nonlocal Neural Networks with Adaptive Sampling

Uploaded on Nov 1, 2023

Raw point clouds data inevitably contains outliers or noise through acquisition from 3D sensors or reconstruction algorithms. In this paper, we present a novel end-to-end network f...

Pushing on Text Readability Assessment: A Transformer Meets Handcrafted Linguistic Features

Uploaded on Nov 1, 2023

Denoising Diffusion Probabilistic Models have shown an impressive generation quality, although their long sampling chain leads to high computational costs. In this paper, we observ...

Pyramid Adversarial Training Improves ViT Performance

Uploaded on Nov 1, 2023

Autonomous driving on water surfaces plays an essential role in executing hazardous and time-consuming missions, such as maritime surveillance, survivors rescue, environmental moni...

RASAT: Integrating Relational Structures into Pretrained Seq2Seq Model for Text-to-SQL

Uploaded on Nov 1, 2023

Medical image segmentation is important for computer-aided diagnosis. Good segmentation demands the model to see the big picture and fine details simultaneously, i.e., to learn ima...

Random Dilated Shapelet Transform: A New Approach for Time Series Shapelets

Uploaded on Nov 1, 2023

Transformer-based models have been widely demonstrated to be successful in computer vision tasks by modelling long-range dependencies and capturing global representations. However,...

ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning

Uploaded on Nov 1, 2023

Recent powerful pre-trained language models have achieved remarkable performance on most of the popular datasets for reading comprehension. It is time to introduce more challenging...

ResNet strikes back: An improved training procedure in timm

Uploaded on Nov 1, 2023

Graph and hypergraph representation learning has attracted increasing attention from various research fields. Despite the decent performance and fruitful applications of Graph Neur...

Rethinking Domain Generalization Baselines

Uploaded on Nov 1, 2023

In this paper, we exploit the innate document segment structure for improving the extractive summarization task. We build two text segmentation models and find the most optimal str...

SNoRe: Scalable Unsupervised Learning of Symbolic Node Representations

Uploaded on Nov 1, 2023

Learning from complex real-life networks is a lively research area, with recent advances in learning information-rich, low-dimensional network node representations. However, state-...

SP-ViT: Learning 2D Spatial Priors for Vision Transformers

Uploaded on Nov 1, 2023

Automatic pulmonary nodules classification is significant for early diagnosis of lung cancers. Recently, deep learning techniques have enabled remarkable progress in this field. Ho...

SSR: An Efficient and Robust Framework for Learning with Unknown Label Noise

Uploaded on Nov 1, 2023

Jointly processing information from multiple sensors is crucial to achieving accurate and robust perception for reliable autonomous driving systems. However, current 3D perception ...

Sample Selection with Uncertainty of Losses for Learning with Noisy Labels

Uploaded on Nov 1, 2023

In this paper, we introduce a framework ARBEx, a novel attentive feature extraction framework driven by Vision Transformer with reliability balancing to cope against poor class dis...

Selecting Relevant Features from a Multi-domain Representation for Few-shot Classification

Uploaded on Nov 1, 2023

Autonomous driving is a popular research area within the computer vision research community. Since autonomous vehicles are highly safety-critical, ensuring robustness is essential ...

Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style

Uploaded on Nov 1, 2023

Previous attempts to incorporate a mention detection step into end-to-end neural coreference resolution for English have been hampered by the lack of singleton mention span data as...

Self-attention Dual Embedding for Graphs with Heterophily

Uploaded on Nov 1, 2023

3D softwares are now capable of producing highly realistic images that look nearly indistinguishable from the real images. This raises the question: can real datasets be enhanced w...

Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting

Uploaded on Nov 1, 2023

Online social media is rife with offensive and hateful comments, prompting the need for their automatic detection given the sheer amount of posts created every second. Creating hig...

Semi-Supervised Recognition under a Noisy and Fine-grained Dataset

Uploaded on Nov 1, 2023

The design choices in the Transformer attention mechanism, including weak inductive bias and quadratic computational complexity, have limited its application for modeling long sequ...

Sequence Alignment Ensemble with a Single Neural Network for Sequence Labeling

Uploaded on Nov 1, 2023

This paper studies a new problem setting of entity alignment for knowledge graphs (KGs). Since KGs possess different sets of entities, there could be entities that cannot find alig...

Shape-Biased Domain Generalization via Shock Graph Embeddings

Uploaded on Nov 1, 2023

Contrastive learning-based video-language representation learning approaches, e.g., CLIP, have achieved outstanding performance, which pursue semantic interaction upon pre-defined ...

Slow-Fast Auditory Streams For Audio Recognition

Uploaded on Nov 1, 2023

Language agents, which use a large language model (LLM) capable of in-context learning to interact with an external environment, have recently emerged as a promising approach to co...

Space-time Mixing Attention for Video Transformer

Uploaded on Nov 1, 2023

The growing popularity of Vision Transformers as the go-to models for image classification has led to an explosion of architectural modifications claiming to be more efficient than...

Stay on topic with Classifier-Free Guidance

Uploaded on Nov 1, 2023

In this work, we propose a permutation invariant language model, SymphonyNet, as a solution for symbolic symphony music generation. We propose a novel Multi-track Multi-instrument ...

Supervised Domain Adaptation: A Graph Embedding Perspective and a Rectified Experimental Protocol

Uploaded on Nov 1, 2023

Single frame data contains finite information which limits the performance of the existing vision-based multi-camera 3D object detection paradigms. For fundamentally pushing the pe...

TAPAS: Weakly Supervised Table Parsing via Pre-training

Uploaded on Nov 1, 2023

Document-level relation extraction aims to extract relations among entities within a document. Compared with its sentence-level counterpart, Document-level relation extraction requ...

TAPEX: Table Pre-training via Learning a Neural SQL Executor

Uploaded on Nov 1, 2023

This paper presents a new framework for open-vocabulary semantic segmentation with the pre-trained vision-language model, named Side Adapter Network (SAN). Our approach models the ...

The Expressive Leaky Memory Neuron: an Efficient and Expressive Phenomenological Neuron Model Can Solve Long-Horizon Tasks

Uploaded on Nov 1, 2023

In LiDAR-based 3D object detection for autonomous driving, the ratio of the object size to input scene size is significantly smaller compared to 2D detection cases. Overlooking thi...

Transfer learning based few-shot classification using optimal transport mapping from preprocessed latent space of backbone neural network

Uploaded on Nov 1, 2023

In-context learning (ICL) for large language models has proven to be a powerful approach for many natural language processing tasks. However, determining the best method to select ...

UPANets: Learning from the Universal Pixel Attention Networks

Uploaded on Nov 1, 2023

Diffusion frameworks have achieved comparable performance with previous state-of-the-art image generation models. Researchers are curious about its variants in discriminative tasks...

UniFormer: Unifying Convolution and Self-attention for Visual Recognition

Uploaded on Nov 1, 2023

The discrepancy between the cost function used for training a speech enhancement model and human auditory perception usually makes the quality of enhanced speech unsatisfactory. Ob...

Unsupervised Embedding Adaptation via Early-Stage Feature Reconstruction for Few-Shot Classification

Uploaded on Nov 1, 2023

3D object detection is a central task for applications such as autonomous driving, in which the system needs to localize and classify surrounding traffic agents, even in the presen...

Unsupervised Few-shot Learning via Deep Laplacian Eigenmaps

Uploaded on Nov 1, 2023

Learning to reject unknown samples (not present in the source classes) in the target domain is fairly important for unsupervised domain adaptation (UDA). There exist two typical UD...

ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision

Uploaded on Nov 1, 2023

A key function of auditory cognition is the association of characteristic sounds with their corresponding semantics over time. Humans attempting to discriminate between fine-graine...

VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Uploaded on Nov 1, 2023

We observe that despite their hierarchical convolutional nature, the synthesis process of typical generative adversarial networks depends on absolute pixel coordinates in an unheal...

WaferSegClassNet -- A Light-weight Network for Classification and Segmentation of Semiconductor Wafer Defects

Uploaded on Nov 1, 2023

Traffic event cognition and reasoning in videos is an important task that has a wide range of applications in intelligent transportation, assisted driving, and autonomous vehicles....

mT5: A massively multilingual pre-trained text-to-text transformer

Uploaded on Nov 1, 2023

The conventional recipe for maximizing model accuracy is to (1) train multiple models with various hyperparameters and (2) pick the individual model which performs best on a held-o...

Graph Transformers without Positional Encodings

Uploaded on Mar 15, 2024

Vision Transformer (ViT) self-attention mechanism is characterized by feature collapse in deeper layers, resulting in the vanishing of low-level visual features. However, such feat...

ReViT: Enhancing Vision Transformers with Attention Residual Connections for Visual Recognition

Uploaded on Mar 15, 2024

In this paper, we revisit techniques for uncertainty estimation within deep neural networks and consolidate a suite of techniques to enhance their reliability. Our investigation re...

Adaptive Fusion of Single-View and Multi-View Depth for Autonomous Driving

Uploaded on Apr 2, 2024

Multimodal Large Language Models (MLLMs) have experienced significant advancements recently. Nevertheless, challenges persist in the accurate recognition and comprehension of intri...

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

Uploaded on Apr 2, 2024

In this paper, we revisit techniques for uncertainty estimation within deep neural networks and consolidate a suite of techniques to enhance their reliability. Our investigation re...

Explore Human Parsing Modality for Action Recognition

Uploaded on Apr 2, 2024

Vision Transformer (ViT) self-attention mechanism is characterized by feature collapse in deeper layers, resulting in the vanishing of low-level visual features. However, such feat...

Learning Semantic Proxies from Visual Prompts for Parameter-Efficient Fine-Tuning in Deep Metric Learning

Uploaded on Apr 2, 2024

With the proposal of the Segment Anything Model (SAM), fine-tuning SAM for medical image segmentation (MIS) has become popular. However, due to the large size of the SAM model and ...

Loss-aware Curriculum Learning for Heterogeneous Graph Neural Networks

Uploaded on Apr 2, 2024

Image classifiers often rely on convolutional neural networks (CNN) for their tasks, which are inherently more heavyweight than multilayer perceptrons (MLPs), which can be problema...

MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations

Uploaded on Apr 2, 2024

Object detectors often perform poorly on data that differs from their training set. Domain adaptive object detection (DAOD) methods have recently demonstrated strong results on add...

Multiple Object Tracking as ID Prediction

Uploaded on Apr 2, 2024

Earth observation (EO) applications involving complex and heterogeneous data sources are commonly approached with machine learning models. However, there is a common assumption tha...

PaddingFlow: Improving Normalizing Flows with Padding-Dimensional Noise

Uploaded on Apr 2, 2024

Multimodal Large Language Models (MLLMs) excel in generating responses based on visual inputs. However, they often suffer from a bias towards generating responses similar to their ...

The VampPrior Mixture Model

Uploaded on Apr 2, 2024

Language models (LMs) have proven to be powerful tools for psycholinguistic research, but most prior work has focused on purely behavioural measures (e.g., surprisal comparisons). ...

You Only Need One Color Space: An Efficient Network for Low-light Image Enhancement

Uploaded on Apr 2, 2024

In this work, we introduce Mini-Gemini, a simple and effective framework enhancing multi-modality Vision Language Models (VLMs). Despite the advancements in VLMs facilitating basic...

Human-Computer Trust Scale (HCTS)

Uploaded on Apr 10, 2024

The Human-Computer Trust scale (HCTS) is a simple, nine-item attitude Likert scale that gives a global view of subjective assessments of trust in technology.

The HCTS resu...

A Bag of Tricks for Few-Shot Class-Incremental Learning

Uploaded on Apr 22, 2024

Large language models (LLMs) have shown great potential in complex reasoning tasks, yet their performance is often hampered by the scarcity of high-quality and reasoning-focused tr...

A Single Graph Convolution Is All You Need: Efficient Grayscale Image Classification

Uploaded on Apr 22, 2024

Image classifiers often rely on convolutional neural networks (CNN) for their tasks, which are inherently more heavyweight than multilayer perceptrons (MLPs), which can be problema...

CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios

Uploaded on Apr 22, 2024

Table-based reasoning with large language models (LLMs) is a promising direction to tackle many table understanding tasks, such as table-based question answering and fact verificat...

Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval

Uploaded on Apr 22, 2024

Vision Transformer (ViT) self-attention mechanism is characterized by feature collapse in deeper layers, resulting in the vanishing of low-level visual features. However, such feat...

Efficient Image Super-Resolution via Symmetric Visual Attention Network

Uploaded on Apr 22, 2024

In this paper, we introduce an open-vocabulary panoptic segmentation model that effectively unifies the strengths of the Segment Anything Model (SAM) with the vision-language CLIP ...

HANet: A Hierarchical Attention Network for Change Detection With Bitemporal Very-High-Resolution Remote Sensing Images

Uploaded on Apr 22, 2024

Multimodal Large Language Models (MLLMs) have experienced significant advancements recently. Nevertheless, challenges persist in the accurate recognition and comprehension of intri...

MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining

Uploaded on Apr 22, 2024

We propose a novel model-selection method for dynamic real-life networks. Our approach involves training a classifier on a large body of synthetic network data. The data is generat...

ProMISe: Promptable Medical Image Segmentation using SAM

Uploaded on Apr 22, 2024

This paper introduces fourteen novel datasets for the evaluation of Large Language Models' safety in the context of enterprise tasks. A method was devised to evaluate a model's saf...

Proprioception Is All You Need: Terrain Classification for Boreal Forests

Uploaded on Apr 22, 2024

In Multiple Object Tracking (MOT), tracking-by-detection methods have stood the test for a long time, which split the process into two parts according to the definition: object det...

ShapeLLM: Universal 3D Object Understanding for Embodied Interaction

Uploaded on Apr 22, 2024

The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership wi...

Shrinking Your TimeStep: Towards Low-Latency Neuromorphic Object Recognition with Spiking Neural Network

Uploaded on Apr 22, 2024

Multimodal Large Language Models (MLLMs) excel in generating responses based on visual inputs. However, they often suffer from a bias towards generating responses similar to their ...

A Bag of Tricks for Few-Shot Class-Incremental Learning

Uploaded on May 21, 2024

Large language models (LLMs) have shown great potential in complex reasoning tasks, yet their performance is often hampered by the scarcity of high-quality and reasoning-focused tr...

A Single Graph Convolution Is All You Need: Efficient Grayscale Image Classification

Uploaded on May 21, 2024

Image classifiers often rely on convolutional neural networks (CNN) for their tasks, which are inherently more heavyweight than multilayer perceptrons (MLPs), which can be problema...

CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios

Uploaded on May 21, 2024

Table-based reasoning with large language models (LLMs) is a promising direction to tackle many table understanding tasks, such as table-based question answering and fact verificat...

Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval

Uploaded on May 21, 2024

Vision Transformer (ViT) self-attention mechanism is characterized by feature collapse in deeper layers, resulting in the vanishing of low-level visual features. However, such feat...

Efficient Image Super-Resolution via Symmetric Visual Attention Network

Uploaded on May 21, 2024

In this paper, we introduce an open-vocabulary panoptic segmentation model that effectively unifies the strengths of the Segment Anything Model (SAM) with the vision-language CLIP ...

HANet: A Hierarchical Attention Network for Change Detection With Bitemporal Very-High-Resolution Remote Sensing Images

Uploaded on May 21, 2024

Multimodal Large Language Models (MLLMs) have experienced significant advancements recently. Nevertheless, challenges persist in the accurate recognition and comprehension of intri...

MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining

Uploaded on May 21, 2024

We propose a novel model-selection method for dynamic real-life networks. Our approach involves training a classifier on a large body of synthetic network data. The data is generat...

ProMISe: Promptable Medical Image Segmentation using SAM

Uploaded on May 21, 2024

This paper introduces fourteen novel datasets for the evaluation of Large Language Models' safety in the context of enterprise tasks. A method was devised to evaluate a model's saf...

Proprioception Is All You Need: Terrain Classification for Boreal Forests

Uploaded on May 21, 2024

In Multiple Object Tracking (MOT), tracking-by-detection methods have stood the test for a long time, which split the process into two parts according to the definition: object det...

ShapeLLM: Universal 3D Object Understanding for Embodied Interaction

Uploaded on May 21, 2024

The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership wi...

Shrinking Your TimeStep: Towards Low-Latency Neuromorphic Object Recognition with Spiking Neural Network

Uploaded on May 21, 2024

Multimodal Large Language Models (MLLMs) excel in generating responses based on visual inputs. However, they often suffer from a bias towards generating responses similar to their ...

A Bag of Tricks for Few-Shot Class-Incremental Learning

Uploaded on Jun 5, 2024

Large language models (LLMs) have shown great potential in complex reasoning tasks, yet their performance is often hampered by the scarcity of high-quality and reasoning-focused tr...

A Single Graph Convolution Is All You Need: Efficient Grayscale Image Classification

Uploaded on Jun 5, 2024

Image classifiers often rely on convolutional neural networks (CNN) for their tasks, which are inherently more heavyweight than multilayer perceptrons (MLPs), which can be problema...

CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios

Uploaded on Jun 5, 2024

Table-based reasoning with large language models (LLMs) is a promising direction to tackle many table understanding tasks, such as table-based question answering and fact verificat...

Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval

Uploaded on Jun 5, 2024

Vision Transformer (ViT) self-attention mechanism is characterized by feature collapse in deeper layers, resulting in the vanishing of low-level visual features. However, such feat...

Efficient Image Super-Resolution via Symmetric Visual Attention Network

Uploaded on Jun 5, 2024

In this paper, we introduce an open-vocabulary panoptic segmentation model that effectively unifies the strengths of the Segment Anything Model (SAM) with the vision-language CLIP ...

HANet: A Hierarchical Attention Network for Change Detection With Bitemporal Very-High-Resolution Remote Sensing Images

Uploaded on Jun 5, 2024

Multimodal Large Language Models (MLLMs) have experienced significant advancements recently. Nevertheless, challenges persist in the accurate recognition and comprehension of intri...

MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining

Uploaded on Jun 5, 2024

We propose a novel model-selection method for dynamic real-life networks. Our approach involves training a classifier on a large body of synthetic network data. The data is generat...

ProMISe: Promptable Medical Image Segmentation using SAM

Uploaded on Jun 5, 2024

This paper introduces fourteen novel datasets for the evaluation of Large Language Models' safety in the context of enterprise tasks. A method was devised to evaluate a model's saf...

Proprioception Is All You Need: Terrain Classification for Boreal Forests

Uploaded on Jun 5, 2024

In Multiple Object Tracking (MOT), tracking-by-detection methods have stood the test for a long time, which split the process into two parts according to the definition: object det...

ShapeLLM: Universal 3D Object Understanding for Embodied Interaction

Uploaded on Jun 5, 2024

The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership wi...

Shrinking Your TimeStep: Towards Low-Latency Neuromorphic Object Recognition with Spiking Neural Network

Uploaded on Jun 5, 2024

Multimodal Large Language Models (MLLMs) excel in generating responses based on visual inputs. However, they often suffer from a bias towards generating responses similar to their ...

A Bag of Tricks for Few-Shot Class-Incremental Learning

Uploaded on Jan 9, 2025

Large language models (LLMs) have shown great potential in complex reasoning tasks, yet their performance is often hampered by the scarcity of high-quality and reasoning-focused tr...

A Single Graph Convolution Is All You Need: Efficient Grayscale Image Classification

Uploaded on Jan 9, 2025

Image classifiers often rely on convolutional neural networks (CNN) for their tasks, which are inherently more heavyweight than multilayer perceptrons (MLPs), which can be problema...

CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios

Uploaded on Jan 9, 2025

Table-based reasoning with large language models (LLMs) is a promising direction to tackle many table understanding tasks, such as table-based question answering and fact verificat...

Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval

Uploaded on Jan 14, 2025

Vision Transformer (ViT) self-attention mechanism is characterized by feature collapse in deeper layers, resulting in the vanishing of low-level visual features. However, such feat...

Efficient Image Super-Resolution via Symmetric Visual Attention Network

Uploaded on Jan 14, 2025

In this paper, we introduce an open-vocabulary panoptic segmentation model that effectively unifies the strengths of the Segment Anything Model (SAM) with the vision-language CLIP ...

HANet: A Hierarchical Attention Network for Change Detection With Bitemporal Very-High-Resolution Remote Sensing Images

Uploaded on Jan 15, 2025

Multimodal Large Language Models (MLLMs) have experienced significant advancements recently. Nevertheless, challenges persist in the accurate recognition and comprehension of intri...

MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining

Uploaded on Jan 16, 2025

We propose a novel model-selection method for dynamic real-life networks. Our approach involves training a classifier on a large body of synthetic network data. The data is generat...

ProMISe: Promptable Medical Image Segmentation using SAM

Uploaded on Jan 21, 2025

This paper introduces fourteen novel datasets for the evaluation of Large Language Models' safety in the context of enterprise tasks. A method was devised to evaluate a model's saf...

Proprioception Is All You Need: Terrain Classification for Boreal Forests

Uploaded on Jan 24, 2025

In Multiple Object Tracking (MOT), tracking-by-detection methods have stood the test for a long time, which split the process into two parts according to the definition: object det...

ShapeLLM: Universal 3D Object Understanding for Embodied Interaction

Uploaded on Jan 29, 2025

The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership wi...

Shrinking Your TimeStep: Towards Low-Latency Neuromorphic Object Recognition with Spiking Neural Network

Uploaded on Feb 13, 2025

Multimodal Large Language Models (MLLMs) excel in generating responses based on visual inputs. However, they often suffer from a bias towards generating responses similar to their ...

A Bag of Tricks for Few-Shot Class-Incremental Learning

Uploaded on Mar 24, 2025

Large language models (LLMs) have shown great potential in complex reasoning tasks, yet their performance is often hampered by the scarcity of high-quality and reasoning-focused tr...

A Single Graph Convolution Is All You Need: Efficient Grayscale Image Classification

Uploaded on Mar 24, 2025

Image classifiers often rely on convolutional neural networks (CNN) for their tasks, which are inherently more heavyweight than multilayer perceptrons (MLPs), which can be problema...

CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios

Uploaded on Apr 18, 2025

Table-based reasoning with large language models (LLMs) is a promising direction to tackle many table understanding tasks, such as table-based question answering and fact verificat...

Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval

Uploaded on Jun 19, 2025

Vision Transformer (ViT) self-attention mechanism is characterized by feature collapse in deeper layers, resulting in the vanishing of low-level visual features. However, such feat...

Efficient Image Super-Resolution via Symmetric Visual Attention Network

Uploaded on Sep 26, 2025

In this paper, we introduce an open-vocabulary panoptic segmentation model that effectively unifies the strengths of the Segment Anything Model (SAM) with the vision-language CLIP ...

HANet: A Hierarchical Attention Network for Change Detection With Bitemporal Very-High-Resolution Remote Sensing Images

Uploaded on Sep 26, 2025

Multimodal Large Language Models (MLLMs) have experienced significant advancements recently. Nevertheless, challenges persist in the accurate recognition and comprehension of intri...

MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining

Uploaded on Sep 26, 2025

We propose a novel model-selection method for dynamic real-life networks. Our approach involves training a classifier on a large body of synthetic network data. The data is generat...

ProMISe: Promptable Medical Image Segmentation using SAM

Uploaded on Sep 26, 2025

This paper introduces fourteen novel datasets for the evaluation of Large Language Models' safety in the context of enterprise tasks. A method was devised to evaluate a model's saf...

Proprioception Is All You Need: Terrain Classification for Boreal Forests

Uploaded on Oct 7, 2025

In Multiple Object Tracking (MOT), tracking-by-detection methods have stood the test for a long time, which split the process into two parts according to the definition: object det...

About the metric

You can click on the links to see the associated metrics

Objective(s):

Robustness
Safety
Transparency

Purpose(s):

Event/anomaly detection
Forecasting/prediction
Recognition/object detection

Target sector(s):

Agriculture
Science & technology
Public governance
Investment
Innovation
Health
Finance and insurance
Environment
Education
Digital Economy
Corporate governance
Transport
Defence

Lifecycle stage(s):

Verify & validate

Target users:

Developer

Risk management stage(s):

Assess
Treat

Modify this metric

Partnership on AI

Disclaimer: The tools and metrics featured herein are solely those of the originating authors and are not vetted or endorsed by the OECD or its member countries. The Organisation cannot be held responsible for possible issues resulting from the posting of links to third parties' tools and metrics on this catalogue. More on the methodology can be found at https://oecd.ai/catalogue/faq.