Comprehensive benchmark and architectural analysis of deep learning models for nanopore sequencing basecalling

Marc Pagès-Gallego; Jeroen de Ridder

doi:10.1186/s13059-023-02903-2

Comprehensive benchmark and architectural analysis of deep learning models for nanopore sequencing basecalling

Genome Biol. 2023 Apr 11;24(1):71. doi: 10.1186/s13059-023-02903-2.

Authors

Marc Pagès-Gallego^{1

2}, Jeroen de Ridder^{3

4}

Affiliations

¹ Center for Molecular Medicine, University Medical Center Utrecht, Universiteitsweg 100, 3584 CG, Utrecht, The Netherlands.
² Oncode Institute, Utrecht, The Netherlands.
³ Center for Molecular Medicine, University Medical Center Utrecht, Universiteitsweg 100, 3584 CG, Utrecht, The Netherlands. j.deridder-4@umcutrecht.nl.
⁴ Oncode Institute, Utrecht, The Netherlands. j.deridder-4@umcutrecht.nl.

Abstract

Background: Nanopore-based DNA sequencing relies on basecalling the electric current signal. Basecalling requires neural networks to achieve competitive accuracies. To improve sequencing accuracy further, new models are continuously proposed with new architectures. However, benchmarking is currently not standardized, and evaluation metrics and datasets used are defined on a per publication basis, impeding progress in the field. This makes it impossible to distinguish data from model driven improvements.

Results: To standardize the process of benchmarking, we unified existing benchmarking datasets and defined a rigorous set of evaluation metrics. We benchmarked the latest seven basecaller models by recreating and analyzing their neural network architectures. Our results show that overall Bonito's architecture is the best for basecalling. We find, however, that species bias in training can have a large impact on performance. Our comprehensive evaluation of 90 novel architectures demonstrates that different models excel at reducing different types of errors and using recurrent neural networks (long short-term memory) and a conditional random field decoder are the main drivers of high performing models.

Conclusions: We believe that our work can facilitate the benchmarking of new basecaller tools and that the community can further expand on this work.

Keywords: Basecalling; Benchmark; Deep learning; Nanopore.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Benchmarking
Deep Learning*
Nanopore Sequencing* / methods
Neural Networks, Computer
Sequence Analysis, DNA / methods