New Physics Learning Machine (NPLM) is a machine-learning based strategy to detect data departures from a Reference hypothesis (the Standard Model), with no prior bias on the source of the discrepancy responsible for it. The main idea behind the method is to approximate the log-likelihood-ratio hypothesis test parametrizing the data distribution with a universal approximating function, and solving its maximum-likelihood fit as a machine-learning problem, with a customized loss function. The method returns a p-value that measures the compatibility of the data with the Reference model. A strategy to account for the uncertainties of the Reference hypothesis has been developed, opening up the way to the application of NPLM to new physics searches at the LHC experiments. Beside that, the most interesting potential applications of NPLM are validation of new Monte Carlo event generators and data quality monitoring. Using efficient large-scale implementations of kernel methods as universal approximators, the NPLM algorithm can be deployed on a GPU-based data acquisition system and be exploited to explore online the readout of an experimental setup. This would allow to spot out detectors malfunctioning or, possibly, unexpected anomalous patters in the data. One crucial advantage of the NPLM algorithm over standard goodness-of-fit tests routinely used in many experiments is its capability of inspect- ing multiple variables at once, taking care of correlations in the process. It also identifies the most discrepant region of the phase-space and it reconstructs the multidimensional data distribution, allowing for further inspection and interpretation of the results. The purpose of this thesis is to develop, test and deploy the NPLM strategy. After presenting its conceptual foundations and main properties, the NPLM algorithm is applied to a real new physics search and a data quality monitoring problem. For the former, we analyze the dimuon final state of proton-proton collision events collected by the CMS experiment at a center-of-mass energy of 13 TeV. For the latter, we monitor the readout of drift tubes chambers collecting cosmic muons at the INFN Legnaro national laboratory.

New Physics Learning Machine (NPLM) is a machine-learning based strategy to detect data departures from a Reference hypothesis (the Standard Model), with no prior bias on the source of the discrepancy responsible for it. The main idea behind the method is to approximate the log-likelihood-ratio hypothesis test parametrizing the data distribution with a universal approximating function, and solving its maximum-likelihood fit as a machine-learning problem, with a customized loss function. The method returns a p-value that measures the compatibility of the data with the Reference model. A strategy to account for the uncertainties of the Reference hypothesis has been developed, opening up the way to the application of NPLM to new physics searches at the LHC experiments. Beside that, the most interesting potential applications of NPLM are validation of new Monte Carlo event generators and data quality monitoring. Using efficient large-scale implementations of kernel methods as universal approximators, the NPLM algorithm can be deployed on a GPU-based data acquisition system and be exploited to explore online the readout of an experimental setup. This would allow to spot out detectors malfunctioning or, possibly, unexpected anomalous patters in the data. One crucial advantage of the NPLM algorithm over standard goodness-of-fit tests routinely used in many experiments is its capability of inspect- ing multiple variables at once, taking care of correlations in the process. It also identifies the most discrepant region of the phase-space and it reconstructs the multidimensional data distribution, allowing for further inspection and interpretation of the results. The purpose of this thesis is to develop, test and deploy the NPLM strategy. After presenting its conceptual foundations and main properties, the NPLM algorithm is applied to a real new physics search and a data quality monitoring problem. For the former, we analyze the dimuon final state of proton-proton collision events collected by the CMS experiment at a center-of-mass energy of 13 TeV. For the latter, we monitor the readout of drift tubes chambers collecting cosmic muons at the INFN Legnaro national laboratory.

Searching for unexpected New Physics at the LHC with Machine Learning / Grosso, Gaia. - (2023 May 31).

Searching for unexpected New Physics at the LHC with Machine Learning

GROSSO, GAIA
2023

Abstract

New Physics Learning Machine (NPLM) is a machine-learning based strategy to detect data departures from a Reference hypothesis (the Standard Model), with no prior bias on the source of the discrepancy responsible for it. The main idea behind the method is to approximate the log-likelihood-ratio hypothesis test parametrizing the data distribution with a universal approximating function, and solving its maximum-likelihood fit as a machine-learning problem, with a customized loss function. The method returns a p-value that measures the compatibility of the data with the Reference model. A strategy to account for the uncertainties of the Reference hypothesis has been developed, opening up the way to the application of NPLM to new physics searches at the LHC experiments. Beside that, the most interesting potential applications of NPLM are validation of new Monte Carlo event generators and data quality monitoring. Using efficient large-scale implementations of kernel methods as universal approximators, the NPLM algorithm can be deployed on a GPU-based data acquisition system and be exploited to explore online the readout of an experimental setup. This would allow to spot out detectors malfunctioning or, possibly, unexpected anomalous patters in the data. One crucial advantage of the NPLM algorithm over standard goodness-of-fit tests routinely used in many experiments is its capability of inspect- ing multiple variables at once, taking care of correlations in the process. It also identifies the most discrepant region of the phase-space and it reconstructs the multidimensional data distribution, allowing for further inspection and interpretation of the results. The purpose of this thesis is to develop, test and deploy the NPLM strategy. After presenting its conceptual foundations and main properties, the NPLM algorithm is applied to a real new physics search and a data quality monitoring problem. For the former, we analyze the dimuon final state of proton-proton collision events collected by the CMS experiment at a center-of-mass energy of 13 TeV. For the latter, we monitor the readout of drift tubes chambers collecting cosmic muons at the INFN Legnaro national laboratory.
31-mag-2023
New Physics Learning Machine (NPLM) is a machine-learning based strategy to detect data departures from a Reference hypothesis (the Standard Model), with no prior bias on the source of the discrepancy responsible for it. The main idea behind the method is to approximate the log-likelihood-ratio hypothesis test parametrizing the data distribution with a universal approximating function, and solving its maximum-likelihood fit as a machine-learning problem, with a customized loss function. The method returns a p-value that measures the compatibility of the data with the Reference model. A strategy to account for the uncertainties of the Reference hypothesis has been developed, opening up the way to the application of NPLM to new physics searches at the LHC experiments. Beside that, the most interesting potential applications of NPLM are validation of new Monte Carlo event generators and data quality monitoring. Using efficient large-scale implementations of kernel methods as universal approximators, the NPLM algorithm can be deployed on a GPU-based data acquisition system and be exploited to explore online the readout of an experimental setup. This would allow to spot out detectors malfunctioning or, possibly, unexpected anomalous patters in the data. One crucial advantage of the NPLM algorithm over standard goodness-of-fit tests routinely used in many experiments is its capability of inspect- ing multiple variables at once, taking care of correlations in the process. It also identifies the most discrepant region of the phase-space and it reconstructs the multidimensional data distribution, allowing for further inspection and interpretation of the results. The purpose of this thesis is to develop, test and deploy the NPLM strategy. After presenting its conceptual foundations and main properties, the NPLM algorithm is applied to a real new physics search and a data quality monitoring problem. For the former, we analyze the dimuon final state of proton-proton collision events collected by the CMS experiment at a center-of-mass energy of 13 TeV. For the latter, we monitor the readout of drift tubes chambers collecting cosmic muons at the INFN Legnaro national laboratory.
Searching for unexpected New Physics at the LHC with Machine Learning / Grosso, Gaia. - (2023 May 31).
File in questo prodotto:
File Dimensione Formato  
Thesis_Gaia_Grosso.pdf

Open Access dal 31/05/2024

Descrizione: Thesis_Gaia_Grosso
Tipologia: Tesi di dottorato
Licenza: Altro
Dimensione 32.73 MB
Formato Adobe PDF
32.73 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3490020
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact