The unresolved struggle of 16S rRNA amplicon sequencing: a benchmarking analysis of clustering and denoising methods

  • M Fares
  • , EK Tharwat
  • , I Cleenwerck
  • , P Monsieurs
  • , R Van Houdt
  • , P Vandamme
  • , M El-Hadidi
  • , M Mysara

Research output: Contribution to journalA1: Web of Science-articlepeer-review

Abstract

Background
Although 16S rRNA gene amplicon sequencing has become an indispensable method for microbiome studies, this analysis is not error-free, and remains prone to several biases and errors. Numerous algorithms have been developed to eliminate these errors and consolidate the output into distance-based Operational Taxonomic Units (OTUs) or denoising-based Amplicon Sequence Variants (ASVs). An objective comparison between them has been obscured by various experimental setups and parameters. In the present study, we conducted a comprehensive benchmarking analysis of the error rates, microbial composition, over-merging/over-splitting of reference sequences, and diversity analyses using the most complex mock community, comprising 227 bacterial strains and the Mockrobiota database. Using unified preprocessing steps, we were able to compare DADA2, Deblur, MED, UNOISE3, UPARSE, DGC (Distance-based Greedy Clustering), AN (Average Neighborhood), and Opticlust objectively.

Results
ASV algorithms—led by DADA2— resulted in having a consistent output, yet suffered from over-splitting, while OTU algorithms—led by UPARSE—achieved clusters with lower errors, yet with more over-merging. Notably, UPARSE and DADA2 showed the closest resemblance to the intended microbial community, especially when considering measures for alpha and beta diversity.

Conclusion
Our unbiased comparative evaluation examined the performance of eight algorithms dedicated to the analysis of 16S rRNA amplicon sequences with a wide range of mock datasets. Our analysis shed light on the pros and cons of each algorithm and the accuracy of the produced OTUs or ASVs. The utilization of the most complex mock community and the benchmarking comparison presented here offer a framework for the comparison between OTU/ASV algorithms and an objective method for the assessment of new tools and algorithms.
Original languageEnglish
Article number51
JournalEnvironmental Microbiome
Volume20
Issue number1
Number of pages14
ISSN2524-6372
DOIs
Publication statusPublished - 13-May-2025

Keywords

  • 16S rRNA amplicon sequencing
  • Amplicon sequence variants (ASVs)
  • Denoising.
  • Operational taxonomical units (OTUs)

Fingerprint

Dive into the research topics of 'The unresolved struggle of 16S rRNA amplicon sequencing: a benchmarking analysis of clustering and denoising methods'. Together they form a unique fingerprint.

Cite this