Unveiling Trustworthy AI Challenges: Characterizing Prediction Reliability

Ensuring trust in machine learning (ML) models has become increasingly relevant in recent times, especially in sensitive areas such as healthcare. To this end, this study proposes a methodological framework to characterize prediction reliability while training ML models. The framework relies on bootstrap to compute a metric based on the rank difference between true and predicted event risks. This structure allows for the stratification of a population of patients into different groups according to the reliability of their predictions expressed as a function of the number of variables considered in input by the model. Finally, the characteristics of the groups identified from the previous step are inspected from two different perspectives: the model perspective and the variable perspective. The first analysis utilizes Shapley values to inspect how the model relies on the input variables to perform a prediction of patients assigned to different groups. Instead, the latter investigates differences in variable distributions to ensure that different groups do not represent different populations. To showcase the potential of this approach, a case study on the development and prediction reliability characterization of models to predict death due to amyotrophic lateral sclerosis is included in this paper.

Unveiling Trustworthy AI Challenges: Characterizing Prediction Reliability

Trescato, Isotta;Guazzo, Alessandro;Longato, Enrico;Tavazzi, Erica;Vettoretti, Martina;Manera, Umberto;Chiò, Adriano;Gromicho, Marta;Alves, Inês;De Carvalho, Mamede;Di Camillo, Barbara

2024

Abstract

Ensuring trust in machine learning (ML) models has become increasingly relevant in recent times, especially in sensitive areas such as healthcare. To this end, this study proposes a methodological framework to characterize prediction reliability while training ML models. The framework relies on bootstrap to compute a metric based on the rank difference between true and predicted event risks. This structure allows for the stratification of a population of patients into different groups according to the reliability of their predictions expressed as a function of the number of variables considered in input by the model. Finally, the characteristics of the groups identified from the previous step are inspected from two different perspectives: the model perspective and the variable perspective. The first analysis utilizes Shapley values to inspect how the model relies on the input variables to perform a prediction of patients assigned to different groups. Instead, the latter investigates differences in variable distributions to ensure that different groups do not represent different populations. To showcase the potential of this approach, a case study on the development and prediction reliability characterization of models to predict death due to amyotrophic lateral sclerosis is included in this paper.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	Titolo del Libro
	
				2024 IEEE 8th Forum on Research and Technologies for Society and Industry Innovation (RTSI)
			
	Titolo convegno
	
				2024 IEEE 8th Forum on Research and Technologies for Society and Industry Innovation (RTSI)
			
	Codice DOI
	
				https://dx.doi.org/10.1109/rtsi61910.2024.10761442
			
	Codice WOS
	
				WOS:001540356900073
			
	Codice Scopus
	
				2-s2.0-85213812231
			
	Codice OpenAlex
	
				W4404740417
			
	Codice ISBN
	
				979-8-3503-6213-8
			
	Identificativo progetto
	
	Titolo Progetto
	
									BRinging Artificial INTelligencE home for a better cAre of amyotrophic lateral sclerosis and multiple SclERosis
								
	Acronimo
	
									BRAINTEASER
								
	Nome finanziatore
	
										European Commission
									
	Finanziamento
	
									Horizon 2020 Framework Programme
								
	N. Contratto
	
									101017598
								
	Appare nelle tipologie:
	
				04.01 - Contributo in atti di convegno

File in questo prodotto:

File	Dimensione	Formato
Unveiling_Trustworthy_AI_Challenges_Characterizing_Prediction_Reliability.pdf Accesso riservato Tipologia: Published (Publisher's Version of Record) Licenza: Accesso privato - non pubblico Dimensione 1.67 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.67 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3540492

Citazioni

ND

0

0

0

social impact