Searching for unexpected New Physics at the LHC with Machine Learning

Grosso, Gaia

New Physics Learning Machine (NPLM) is a machine-learning based strategy to detect data departures from a Reference hypothesis (the Standard Model), with no prior bias on the source of the discrepancy responsible for it. The main idea behind the method is to approximate the log-likelihood-ratio hypothesis test parametrizing the data distribution with a universal approximating function, and solving its maximum-likelihood fit as a machine-learning problem, with a customized loss function. The method returns a p-value that measures the compatibility of the data with the Reference model. A strategy to account for the uncertainties of the Reference hypothesis has been developed, opening up the way to the application of NPLM to new physics searches at the LHC experiments. Beside that, the most interesting potential applications of NPLM are validation of new Monte Carlo event generators and data quality monitoring. Using efficient large-scale implementations of kernel methods as universal approximators, the NPLM algorithm can be deployed on a GPU-based data acquisition system and be exploited to explore online the readout of an experimental setup. This would allow to spot out detectors malfunctioning or, possibly, unexpected anomalous patters in the data. One crucial advantage of the NPLM algorithm over standard goodness-of-fit tests routinely used in many experiments is its capability of inspect- ing multiple variables at once, taking care of correlations in the process. It also identifies the most discrepant region of the phase-space and it reconstructs the multidimensional data distribution, allowing for further inspection and interpretation of the results. The purpose of this thesis is to develop, test and deploy the NPLM strategy. After presenting its conceptual foundations and main properties, the NPLM algorithm is applied to a real new physics search and a data quality monitoring problem. For the former, we analyze the dimuon final state of proton-proton collision events collected by the CMS experiment at a center-of-mass energy of 13 TeV. For the latter, we monitor the readout of drift tubes chambers collecting cosmic muons at the INFN Legnaro national laboratory.

Searching for unexpected New Physics at the LHC with Machine Learning / Grosso, Gaia. - (2023 May 31).

Searching for unexpected New Physics at the LHC with Machine Learning

GROSSO, GAIA

2023

Abstract

New Physics Learning Machine (NPLM) is a machine-learning based strategy to detect data departures from a Reference hypothesis (the Standard Model), with no prior bias on the source of the discrepancy responsible for it. The main idea behind the method is to approximate the log-likelihood-ratio hypothesis test parametrizing the data distribution with a universal approximating function, and solving its maximum-likelihood fit as a machine-learning problem, with a customized loss function. The method returns a p-value that measures the compatibility of the data with the Reference model. A strategy to account for the uncertainties of the Reference hypothesis has been developed, opening up the way to the application of NPLM to new physics searches at the LHC experiments. Beside that, the most interesting potential applications of NPLM are validation of new Monte Carlo event generators and data quality monitoring. Using efficient large-scale implementations of kernel methods as universal approximators, the NPLM algorithm can be deployed on a GPU-based data acquisition system and be exploited to explore online the readout of an experimental setup. This would allow to spot out detectors malfunctioning or, possibly, unexpected anomalous patters in the data. One crucial advantage of the NPLM algorithm over standard goodness-of-fit tests routinely used in many experiments is its capability of inspect- ing multiple variables at once, taking care of correlations in the process. It also identifies the most discrepant region of the phase-space and it reconstructs the multidimensional data distribution, allowing for further inspection and interpretation of the results. The purpose of this thesis is to develop, test and deploy the NPLM strategy. After presenting its conceptual foundations and main properties, the NPLM algorithm is applied to a real new physics search and a data quality monitoring problem. For the former, we analyze the dimuon final state of proton-proton collision events collected by the CMS experiment at a center-of-mass energy of 13 TeV. For the latter, we monitor the readout of drift tubes chambers collecting cosmic muons at the INFN Legnaro national laboratory.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di discussione
	
				31-mag-2023
			
	Abstract
	
				New Physics Learning Machine (NPLM) is a machine-learning based strategy to detect data departures from a Reference hypothesis (the Standard Model), with no prior bias on the source of the discrepancy responsible for it. The main idea behind the method is to approximate the log-likelihood-ratio hypothesis test parametrizing the data distribution with a universal approximating function, and solving its maximum-likelihood fit as a machine-learning problem, with a customized loss function. The method returns a p-value that measures the compatibility of the data with the Reference model. A strategy to account for the uncertainties of the Reference hypothesis has been developed, opening up the way to the application of NPLM to new physics searches at the LHC experiments. Beside that, the most interesting potential applications of NPLM are validation of new Monte Carlo event generators and data quality monitoring. Using efficient large-scale implementations of kernel methods as universal approximators, the NPLM algorithm can be deployed on a GPU-based data acquisition system and be exploited to explore online the readout of an experimental setup. This would allow to spot out detectors malfunctioning or, possibly, unexpected anomalous patters in the data. One crucial advantage of the NPLM algorithm over standard goodness-of-fit tests routinely used in many experiments is its capability of inspect- ing multiple variables at once, taking care of correlations in the process. It also identifies the most discrepant region of the phase-space and it reconstructs the multidimensional data distribution, allowing for further inspection and interpretation of the results. The purpose of this thesis is to develop, test and deploy the NPLM strategy. After presenting its conceptual foundations and main properties, the NPLM algorithm is applied to a real new physics search and a data quality monitoring problem. For the former, we analyze the dimuon final state of proton-proton collision events collected by the CMS experiment at a center-of-mass energy of 13 TeV. For the latter, we monitor the readout of drift tubes chambers collecting cosmic muons at the INFN Legnaro national laboratory.
			
	Citazione
	
				Searching for unexpected New Physics at the LHC with Machine Learning / Grosso, Gaia. - (2023 May 31).
			
	Appare nelle tipologie:
	
				08.01 - Tesi di Dottorato UNIPD (Deposito Legale)

File in questo prodotto:

File	Dimensione	Formato
Thesis_Gaia_Grosso.pdf Open Access dal 31/05/2024 Descrizione: Thesis_Gaia_Grosso Tipologia: Tesi di dottorato Licenza: Altro Dimensione 32.73 MB Formato Adobe PDF Visualizza/Apri	32.73 MB	Adobe PDF	Visualizza/Apri