Tree Tensor Network implemented on FPGA as ultra-low latency binary classifiers

Tensor Networks (TNs) are a computational framework traditionally used to model quantum many-body systems. Recent research has demonstrated that TNs can also be effectively applied to Machine Learning (ML) tasks, producing results comparable to conventional supervised learning methods. In this work, we investigate the use of Tree Tensor Networks (TTNs) for high-frequency real-time applications by harnessing the low-latency capabilities of Field-Programmable Gate Arrays (FPGAs). We present the implementation of TTN classifiers on FPGA hardware, optimized for performing inference on classical ML benchmarking datasets. Various degrees of parallelization are explored to evaluate the trade-offs between resource utilization and algorithm latency. By deploying these TTNs on a hardware accelerator and utilizing an FPGA integrated into a server, we fully offload the TTN inference process, demonstrating the system’s viability for real-time ML applications.

Tree Tensor Network implemented on FPGA as ultra-low latency binary classifiers

Borella, Lorenzo;Coppi, Alberto;Pazzini, Jacopo;Stanco, Andrea;Triossi, Andrea;Zanetti, Marco

2025

Abstract

Tensor Networks (TNs) are a computational framework traditionally used to model quantum many-body systems. Recent research has demonstrated that TNs can also be effectively applied to Machine Learning (ML) tasks, producing results comparable to conventional supervised learning methods. In this work, we investigate the use of Tree Tensor Networks (TTNs) for high-frequency real-time applications by harnessing the low-latency capabilities of Field-Programmable Gate Arrays (FPGAs). We present the implementation of TTN classifiers on FPGA hardware, optimized for performing inference on classical ML benchmarking datasets. Various degrees of parallelization are explored to evaluate the trade-offs between resource utilization and algorithm latency. By deploying these TTNs on a hardware accelerator and utilizing an FPGA integrated into a server, we fully offload the TTN inference process, demonstrating the system’s viability for real-time ML applications.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Titolo del Libro
	
				42nd International Conference on High Energy Physics (ICHEP2024) - Computing and Data Handling
			
	Titolo convegno
	
				42nd International Conference on High Energy Physics (ICHEP2024) - Computing and Data Handling
			
	Codice DOI
	
				https://dx.doi.org/10.22323/1.476.1004
			
	Codice Scopus
	
				2-s2.0-105004810882
			
	Appare nelle tipologie:
	
				04.01 - Contributo in atti di convegno

File in questo prodotto:

File	Dimensione	Formato
ICHEP2024_1004.pdf accesso aperto Tipologia: Published (Publisher's Version of Record) Licenza: Creative commons Dimensione 366.83 kB Formato Adobe PDF Visualizza/Apri	366.83 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3555532

Citazioni

ND

0

ND

ND

social impact