Metagenomics is the study of heterogeneous microbial samples extracted directly from their natural environment, e.g., from soil, water, or the human body. The detection and quantification of species that populate microbial communities have been the subject of many recent studies based on classification and clustering, motivated by being the first step in more complex pipelines (e.g. for functional analysis, de-novo assembly or comparison of metagenomes). In this paper we explore the idea of improving the overall quality of metagenomics binning at reads-level by proposing a framework that sequentially combine two complementary read binning approaches: one based on species abundances determination and another one relying on reads overlap in order to cluster reads together. Our preliminary results show that the combination of the two tools can lead to the improvement of the clustering quality in realistic conditions where the number of species is not known beforehand.

On Multi-phase Metagenomics Reads Binning

Cinzia Pizzi
2025

Abstract

Metagenomics is the study of heterogeneous microbial samples extracted directly from their natural environment, e.g., from soil, water, or the human body. The detection and quantification of species that populate microbial communities have been the subject of many recent studies based on classification and clustering, motivated by being the first step in more complex pipelines (e.g. for functional analysis, de-novo assembly or comparison of metagenomes). In this paper we explore the idea of improving the overall quality of metagenomics binning at reads-level by proposing a framework that sequentially combine two complementary read binning approaches: one based on species abundances determination and another one relying on reads overlap in order to cluster reads together. Our preliminary results show that the combination of the two tools can lead to the improvement of the clustering quality in realistic conditions where the number of species is not known beforehand.
2025
Lecture Notes in Computer Science/Lecture Notes in Bioinformatics (LNBI)
The 12th International Conference on Computational Advances in Bio and Medical Sciences - ICCABS
978-3-031-82768-6
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3506342
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex 0
social impact