In typical machine learning frameworks, model selection is of fundamental impor-tance: commonly, multiple models have to be trained and compared in order to iden-tify the one with the best predictive performances. The aim of this study is toprovide a new tool to improve the model selection process, allowing the user to iden-tify the algorithm which significantly outperforms the other candidates. It proposes arobust model selection procedure based on a multi-aspect permutation test whichmakes it possible to detect differences in both location and variability between twopaired samples of prediction errors. A new extension of the nonparametric combina-tion (NPC) methodology is therefore introduced and is integrated with an appropriateranking procedure in order to deal with the comparison ofC≥2 candidate models. Asimulation study is conducted to evaluate the performances of this testing procedurein 2-sample andC-sample problems, by generating data from various well-known dis-tributions and simulating several possible null and alternative scenarios. The adoptionof the proposed technique in machine learning model selection problems is then dis-cussed by means of multiple real data applications. These applications confirm whatemerges from the simulation study: the introduced NPC-based approach performswell under several different scenarios and represents a valuable tool for robustmachine learning model selection.
Multi-aspect permutation tests for model selection
Elena Barzizza;Nicolò Biasetton;Riccardo Ceccato
2023
Abstract
In typical machine learning frameworks, model selection is of fundamental impor-tance: commonly, multiple models have to be trained and compared in order to iden-tify the one with the best predictive performances. The aim of this study is toprovide a new tool to improve the model selection process, allowing the user to iden-tify the algorithm which significantly outperforms the other candidates. It proposes arobust model selection procedure based on a multi-aspect permutation test whichmakes it possible to detect differences in both location and variability between twopaired samples of prediction errors. A new extension of the nonparametric combina-tion (NPC) methodology is therefore introduced and is integrated with an appropriateranking procedure in order to deal with the comparison ofC≥2 candidate models. Asimulation study is conducted to evaluate the performances of this testing procedurein 2-sample andC-sample problems, by generating data from various well-known dis-tributions and simulating several possible null and alternative scenarios. The adoptionof the proposed technique in machine learning model selection problems is then dis-cussed by means of multiple real data applications. These applications confirm whatemerges from the simulation study: the introduced NPC-based approach performswell under several different scenarios and represents a valuable tool for robustmachine learning model selection.File | Dimensione | Formato | |
---|---|---|---|
Expert Systems - 2023 - Barzizza - Multi‐aspect permutation tests for model selection.pdf
accesso aperto
Tipologia:
Published (publisher's version)
Licenza:
Creative commons
Dimensione
4.98 MB
Formato
Adobe PDF
|
4.98 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.