Correlation-based methods for representative fairness metric selection: An empirical study on efficiency and caveats in model evaluation

Rafael B. Loureiro; Tiago Palma Pagano; Fernanda V.N. Lisboa; Lian F.S. Nascimento; Ewerton L.S. Oliveira; Ingrid Winkler; Erick G. Sperandio Nascimento

doi:10.1016/j.eswa.2024.126344

Back

Correlation-based methods for representative fairness metric selection: An empirical study on efficiency and caveats in model evaluation

Journal article

Open access

Peer reviewed

Correlation-based methods for representative fairness metric selection: An empirical study on efficiency and caveats in model evaluation

Rafael B. Loureiro, Tiago Palma Pagano, Fernanda V.N. Lisboa, Lian F.S. Nascimento, Ewerton L.S. Oliveira, Ingrid Winkler and Erick G. Sperandio Nascimento

Expert systems with applications, Vol.268, 126344

Spring 2025

DOI: https://doi.org/10.1016/j.eswa.2024.126344

Abstract

Bias

Unfairness

Representative metric

Correlation

Addressing bias and unfairness in machine learning models in different application domains is a multifaceted challenge. Despite the variety of fairness metrics available, identifying an optimal set for evaluating a model’s fairness is still an open question due to the diverse nature of these metrics and the lack of a comprehensive approach for ensuring fairness across different applications. This study aims to propose a method that allows the selection of the most representative metrics for bias and fairness assessment, in post-processing for machine learning models in different contexts. We delve into the use of a correlation-based strategy as a heuristic for fairness metric selection, applying bootstrap sampling using the Markov chain Monte Carlo technique, with our proposed improvements, including stratified sampling, stopping criterion, and Kendall correlation, to address the data bias representation, the computational cost, and the robustness, respectively. An average decrease of 64.37% in the number of models and of 20.00% in processing time was achieved. Moreover, the proposed method effectively paired metrics with similar behaviour, highlighting the presence of a similar term as a strong indicator of a direct relationship. While no standout metric emerges across all contexts, within specific models or datasets, certain metrics consistently stand out. In a complex scenario using a large language model for sexism detection, the proposed method achieved a 71.93% reduction in execution time while forming more comprehensive metric groups. Overall, the proposed method successfully selects the representative metric with a considerable gain in computational costs, demonstrating its practicality for real-world applications.

Files and links (1)

url

https://doi.org/10.1016/j.eswa.2024.126344View

Published (Version of record)CC BY V4.0, Open

Metrics

4 Record Views

Details

Title: Correlation-based methods for representative fairness metric selection: An empirical study on efficiency and caveats in model evaluation
Creators: Rafael B. Loureiro
Tiago Palma Pagano
Fernanda V.N. Lisboa
Lian F.S. Nascimento
Ewerton L.S. Oliveira
Ingrid Winkler
Erick G. Sperandio Nascimento (Corresponding Author) - University of Surrey, School of Computer Science and Electronic Engineering
Publication Details: Expert systems with applications, Vol.268, 126344
Publisher: PERGAMON-ELSEVIER SCIENCE LTD
Number of pages: 20
First online publication date: 03/01/2025
Publication Date: 05/04/2025
Date accepted for publication: 27/12/2024
Grant note: This publication is the result of a project regulated by the Brazilian Informatics Law, Brazil (Law No. 8248 of 1991 and subsequent updates) and was developed under the HP 052-21 between SENAI CIMATEC, Brazil and HP Brasil Indústria e Comércio de Equipamentos Eletrônicos Ltda. or Simpress Comércio, Locação e Serviços Ltda, Brazil.
Identifiers: 99955666502346; WOS:001399247600001
Academic Unit: Institute for Sustainability; School of Computer Science and Electronic Engineering
Language: English
Resource Type: Journal article
Data Access Statement: Data will be made available on request.

Correlation-based methods for representative fairness metric selection: An empirical study on efficiency and caveats in model evaluation

Abstract

Files and links (1)

Metrics

Details

Usage Policy