The production of biopeptides from food waste through microbial fermentation faces challenges arising from the diverse proteolytic abilities of microorganisms and substrate variability, impacting both the quality and yield of generated biopeptides. To address these challenges, preliminary in-silico bioinformatics analyses play a crucial role in evaluating suitable substrates and proteases for the fermentation process. However, existing tools lack comprehensive predictive capabilities for relevant proteases, substrate performance assessment, and final biopeptide family characterization. To overcome these limitations, we developed FEEDS (Food wastE biopEptiDe claSsifier), a novel biopeptide prediction and classification tool. FEEDS predicts biopeptide families based on microbial genome protease profiles and substrate composition during proteolysis. The tool also employs a machine learning approach for functional biopeptide classification. Results from testing on 1000 microbial genomes demonstrate the effectiveness of biopeptide classification, particularly in categorizing peptides derived from substrates like Hordeum vulgare and Vitis vinifera seed storage proteins. In addition to biopeptide classification, our study delves into the distinctive protease profiles of bacteria and yeast genomes. Bacterial genomes exhibited 60 to 100 proteases across 40–55 families. Contrastingly, yeast genomes displayed a more evenly distributed pattern with 150 to 160 protease-encoding genes across 60 to 67 families, surpassing bacterial counts. Remarkably, a substantial portion of yeast proteases (∼66 %) was secreted. Moreover, our integration of a machine learning methodology within the FEEDS pipeline proved highly effective, achieving over 80 % accuracy in predicting the function of peptides derived from seed storage proteins. Notably, longer peptide sequences exceeding 20 amino acids consistently displayed a higher probability of correct assignment compared to shorter counterparts.

FEEDS, the Food wastE biopEptiDe claSsifier: From microbial genomes and substrates to biopeptides function

Bizzotto E.;Zampieri G.;Campanaro S.
2024

Abstract

The production of biopeptides from food waste through microbial fermentation faces challenges arising from the diverse proteolytic abilities of microorganisms and substrate variability, impacting both the quality and yield of generated biopeptides. To address these challenges, preliminary in-silico bioinformatics analyses play a crucial role in evaluating suitable substrates and proteases for the fermentation process. However, existing tools lack comprehensive predictive capabilities for relevant proteases, substrate performance assessment, and final biopeptide family characterization. To overcome these limitations, we developed FEEDS (Food wastE biopEptiDe claSsifier), a novel biopeptide prediction and classification tool. FEEDS predicts biopeptide families based on microbial genome protease profiles and substrate composition during proteolysis. The tool also employs a machine learning approach for functional biopeptide classification. Results from testing on 1000 microbial genomes demonstrate the effectiveness of biopeptide classification, particularly in categorizing peptides derived from substrates like Hordeum vulgare and Vitis vinifera seed storage proteins. In addition to biopeptide classification, our study delves into the distinctive protease profiles of bacteria and yeast genomes. Bacterial genomes exhibited 60 to 100 proteases across 40–55 families. Contrastingly, yeast genomes displayed a more evenly distributed pattern with 150 to 160 protease-encoding genes across 60 to 67 families, surpassing bacterial counts. Remarkably, a substantial portion of yeast proteases (∼66 %) was secreted. Moreover, our integration of a machine learning methodology within the FEEDS pipeline proved highly effective, achieving over 80 % accuracy in predicting the function of peptides derived from seed storage proteins. Notably, longer peptide sequences exceeding 20 amino acids consistently displayed a higher probability of correct assignment compared to shorter counterparts.
File in questo prodotto:
File Dimensione Formato  
2024_Centurion_FEEDS the Food wastE biopEptiDe claSsifier from microbial genomes and substrates to biopeptides function_CurrResBiotech.pdf

accesso aperto

Tipologia: Published (publisher's version)
Licenza: Creative commons
Dimensione 1.81 MB
Formato Adobe PDF
1.81 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3517162
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
  • OpenAlex ND
social impact