This work presents a methodology for evaluating the effectiveness of hybrid modelling under varying conditions of mechanistic model quality and information available for model training, that is typically expressed as the amount of data available. While hybrid models – which integrate mechanistic and data-driven components – have gained significant attention in process systems engineering, their advantages over purely mechanistic or data-driven alternatives remain inadequately quantified. We address this gap by investigating two critical factors: (i) the impact of mechanistic model fidelity on hybrid model performance, and (ii) the influence of calibration dataset size on prediction accuracy. Our methodology is validated through an in-silico case study of baker's yeast cultivation and a real-world industrial application of ion-exchange chromatography in biopharmaceutical manufacturing. Results demonstrate that hybrid models consistently outperform purely mechanistic and data-driven approaches when the mechanistic component captures fundamental process behaviours, even with structural simplifications. Notably, hybrid models maintain superior predictive capability in extrapolative scenarios; however, when mechanistic knowledge is severely limited and insufficient information is available for compensation, hybridisation benefits diminish substantially. The work provides quantitative guidance for practitioners to determine when hybrid modelling represents a justified investment of modelling resources in process engineering applications.

On the impact of mechanistic model quality and data availability in hybrid model development

Geremia M.;Marella T.;Facco P.;Barolo M.;Bezzo F.
2026

Abstract

This work presents a methodology for evaluating the effectiveness of hybrid modelling under varying conditions of mechanistic model quality and information available for model training, that is typically expressed as the amount of data available. While hybrid models – which integrate mechanistic and data-driven components – have gained significant attention in process systems engineering, their advantages over purely mechanistic or data-driven alternatives remain inadequately quantified. We address this gap by investigating two critical factors: (i) the impact of mechanistic model fidelity on hybrid model performance, and (ii) the influence of calibration dataset size on prediction accuracy. Our methodology is validated through an in-silico case study of baker's yeast cultivation and a real-world industrial application of ion-exchange chromatography in biopharmaceutical manufacturing. Results demonstrate that hybrid models consistently outperform purely mechanistic and data-driven approaches when the mechanistic component captures fundamental process behaviours, even with structural simplifications. Notably, hybrid models maintain superior predictive capability in extrapolative scenarios; however, when mechanistic knowledge is severely limited and insufficient information is available for compensation, hybridisation benefits diminish substantially. The work provides quantitative guidance for practitioners to determine when hybrid modelling represents a justified investment of modelling resources in process engineering applications.
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3572178
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex 0
social impact