In modern manufacturing settings, Human-Robot collaboration (HRC) plays a pivotal role in enhancing efficiency, flexibility, and operator well-being. However, existing task allocation algorithms for HRC rely on simplified assumptions and static or heuristic-based approaches. In this paper, we present a value-based Reinforcement Learning (RL) framework for dynamic task scheduling in a collaborative assembly scenario, capable of learning to allocate tasks between human operator and robot in real time, with the goal of minimizing the total tasks completion time. Our presented environment, which can be generalized to other different settings, incorporates both deterministic robot labor and stochastic human behavior. The RL algorithm demonstrates the ability to adaptively manage task assignments under uncertainty, learning efficient policies that generalize across the human variability. This work offers a flexible and scalable foundation for intelligent scheduling in HRC systems and paves the way for future integration of operator-specific parameters and cognitive workload indicators.

Value-Based Reinforcement Learning for Task Scheduling in Human-Robot Applications

A. Dalla Libera;I. Granata;R. Minto;M. Calzavara;M. Faccio;G. A. Susto
2025

Abstract

In modern manufacturing settings, Human-Robot collaboration (HRC) plays a pivotal role in enhancing efficiency, flexibility, and operator well-being. However, existing task allocation algorithms for HRC rely on simplified assumptions and static or heuristic-based approaches. In this paper, we present a value-based Reinforcement Learning (RL) framework for dynamic task scheduling in a collaborative assembly scenario, capable of learning to allocate tasks between human operator and robot in real time, with the goal of minimizing the total tasks completion time. Our presented environment, which can be generalized to other different settings, incorporates both deterministic robot labor and stochastic human behavior. The RL algorithm demonstrates the ability to adaptively manage task assignments under uncertainty, learning efficient policies that generalize across the human variability. This work offers a flexible and scalable foundation for intelligent scheduling in HRC systems and paves the way for future integration of operator-specific parameters and cognitive workload indicators.
2025
Proceedings of the 7th IFAC Conference on Intelligent Control and Automation Sciences 2025 (ICONS 2025)
7th IFAC Conference on Intelligent Control and Automation Sciences 2025 (ICONS 2025)
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S2405896325026990-main.pdf

accesso aperto

Tipologia: Published (Publisher's Version of Record)
Licenza: Creative commons
Dimensione 895.63 kB
Formato Adobe PDF
895.63 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3562612
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact