UMI-linked consensus sequencing enables phylogenetic analysis of directed evolution
- PMID: 33243970
- PMCID: PMC7691348
- DOI: 10.1038/s41467-020-19687-9
UMI-linked consensus sequencing enables phylogenetic analysis of directed evolution
Abstract
The success of protein evolution campaigns is strongly dependent on the sequence context in which mutations are introduced, stemming from pervasive non-additive interactions between a protein's amino acids ('intra-gene epistasis'). Our limited understanding of such epistasis hinders the correct prediction of the functional contributions and adaptive potential of mutations. Here we present a straightforward unique molecular identifier (UMI)-linked consensus sequencing workflow (UMIC-seq) that simplifies mapping of evolutionary trajectories based on full-length sequences. Attaching UMIs to gene variants allows accurate consensus generation for closely related genes with nanopore sequencing. We exemplify the utility of this approach by reconstructing the artificial phylogeny emerging in three rounds of directed evolution of an amine dehydrogenase biocatalyst via ultrahigh throughput droplet screening. Uniquely, we are able to identify lineages and their founding variant, as well as non-additive interactions between mutations within a full gene showing sign epistasis. Access to deep and accurate long reads will facilitate prediction of key beneficial mutations and adaptive potential based on in silico analysis of large sequence datasets.
Conflict of interest statement
The authors declare no competing interests.
Figures
Similar articles
-
Gencore: an efficient tool to generate consensus reads for error suppressing and duplicate removing of NGS data.BMC Bioinformatics. 2019 Dec 27;20(Suppl 23):606. doi: 10.1186/s12859-019-3280-9. BMC Bioinformatics. 2019. PMID: 31881822 Free PMC article.
-
Speeding up enzyme discovery and engineering with ultrahigh-throughput methods.Curr Opin Struct Biol. 2018 Feb;48:149-156. doi: 10.1016/j.sbi.2017.12.010. Epub 2018 Feb 3. Curr Opin Struct Biol. 2018. PMID: 29413955 Review.
-
On synergy between ultrahigh throughput screening and machine learning in biocatalyst engineering.Faraday Discuss. 2024 Sep 11;252(0):89-114. doi: 10.1039/d4fd00065j. Faraday Discuss. 2024. PMID: 39133073 Free PMC article. Review.
-
Exploiting models of molecular evolution to efficiently direct protein engineering.J Mol Evol. 2011 Feb;72(2):193-203. doi: 10.1007/s00239-010-9415-2. Epub 2010 Dec 4. J Mol Evol. 2011. PMID: 21132281 Free PMC article.
-
How mutational epistasis impairs predictability in protein evolution and design.Protein Sci. 2016 Jul;25(7):1260-72. doi: 10.1002/pro.2876. Epub 2016 Jan 22. Protein Sci. 2016. PMID: 26757214 Free PMC article.
Cited by
-
Deep mutational scanning: A versatile tool in systematically mapping genotypes to phenotypes.Front Genet. 2023 Jan 12;14:1087267. doi: 10.3389/fgene.2023.1087267. eCollection 2023. Front Genet. 2023. PMID: 36713072 Free PMC article. Review.
-
Ultra-High-Throughput Absorbance-Activated Droplet Sorting for Enzyme Screening at Kilohertz Frequencies.Anal Chem. 2023 Mar 14;95(10):4597-4604. doi: 10.1021/acs.analchem.2c04144. Epub 2023 Feb 27. Anal Chem. 2023. PMID: 36848587 Free PMC article.
-
Ultrahigh Throughput Evolution of Tryptophan Synthase in Droplets via an Aptamer Sensor.ACS Catal. 2024 Apr 10;14(8):6259-6271. doi: 10.1021/acscatal.4c00230. eCollection 2024 Apr 19. ACS Catal. 2024. PMID: 38660603 Free PMC article.
-
Nanopore sequencing with unique molecular identifiers enables accurate mutation analysis and haplotyping in the complex lipoprotein(a) KIV-2 VNTR.Genome Med. 2024 Oct 8;16(1):117. doi: 10.1186/s13073-024-01391-8. Genome Med. 2024. PMID: 39380090 Free PMC article.
-
Recent trends in biocatalysis.Chem Soc Rev. 2021 Jul 21;50(14):8003-8049. doi: 10.1039/d0cs01575j. Epub 2021 Jun 18. Chem Soc Rev. 2021. PMID: 34142684 Free PMC article. Review.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
