Distributed Reinforcement Learning for Flexible and Efficient UAV Swarm Control

Over the past few years, the use of swarms of Unmanned Aerial Vehicles (UAVs) in monitoring and remote area surveillance applications has become widespread thanks to the price reduction and the increased capabilities of drones. The drones in the swarm need to cooperatively explore an unknown area, in order to identify and monitor interesting targets, while minimizing their movements. In this work, we propose a distributed Reinforcement Learning (RL) approach that scales to larger swarms without modifications. The proposed framework relies on the possibility for the UAVs to exchange some information through a communication channel, in order to achieve context-awareness and implicitly coordinate the swarm's actions. Our experiments show that the proposed method can yield effective strategies, which are robust to communication channel impairments, and that can easily deal with non-uniform distributions of targets and obstacles. Moreover, when agents are trained in a specific scenario, they can adapt to a new one with minimal additional training. We also show that our approach achieves better performance compared to a computationally intensive look-ahead heuristic.

Distributed Reinforcement Learning for Flexible and Efficient UAV Swarm Control

Venturini F.;Mason F.;Pase F.;Chiariotti F.;Testolin A.;Zanella A.;Zorzi M.

2021

Abstract

Over the past few years, the use of swarms of Unmanned Aerial Vehicles (UAVs) in monitoring and remote area surveillance applications has become widespread thanks to the price reduction and the increased capabilities of drones. The drones in the swarm need to cooperatively explore an unknown area, in order to identify and monitor interesting targets, while minimizing their movements. In this work, we propose a distributed Reinforcement Learning (RL) approach that scales to larger swarms without modifications. The proposed framework relies on the possibility for the UAVs to exchange some information through a communication channel, in order to achieve context-awareness and implicitly coordinate the swarm's actions. Our experiments show that the proposed method can yield effective strategies, which are robust to communication channel impairments, and that can easily deal with non-uniform distributions of targets and obstacles. Moreover, when agents are trained in a specific scenario, they can adapt to a new one with minimal additional training. We also show that our approach achieves better performance compared to a computationally intensive look-ahead heuristic.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2021
			
	Rivista su cui è pubblicata l'opera
	
				IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING
			
	Codice DOI
	
				https://dx.doi.org/10.1109/TCCN.2021.3063170
			
	Codice WOS
	
				WOS:000693757900025
			
	Codice Scopus
	
				2-s2.0-85102309125
			
	Codice OpenAlex
	
				W3139147878
			
	Identificativo progetto
	
	Titolo Progetto
	
									Towards Intelligent Tactical Ad hoc Networks
								
	Acronimo
	
									TITAN
								
	Nome finanziatore
	
										U.S. Army Research Office (ARO)
									
	N. Contratto
	
									W911NF1910232
								
	Appare nelle tipologie:
	
				01.01 - Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
Venturini et al. 2021 - IEEE TCCN.pdf Accesso riservato Tipologia: Published (Publisher's Version of Record) Licenza: Accesso privato - non pubblico Dimensione 3.05 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	3.05 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3494625

Citazioni

ND

74

56

ND

social impact