PEAT: an intelligent and efficient paired-end sequencing adapter trimming algorithm
- PMID: 25707528
- PMCID: PMC4331701
- DOI: 10.1186/1471-2105-16-S1-S2
PEAT: an intelligent and efficient paired-end sequencing adapter trimming algorithm
Abstract
Background: In modern paired-end sequencing protocols short DNA fragments lead to adapter-appended reads. Current paired-end adapter removal approaches trim adapter by scanning the fragment of adapter on the 3' end of the reads, which are not competent in some applications.
Results: Here, we propose a fast and highly accurate adapter-trimming algorithm, PEAT, designed specifically for paired-end sequencing. PEAT requires no a priori adaptor sequence, which is convenient for large-scale meta-analyses. We assessed the performance of PEAT with many adapter trimmers in both simulated and real life paired-end sequencing libraries. The importance of adapter trimming was exemplified by the influence of the downstream analyses on RNA-seq, ChIP-seq and MNase-seq. Several useful guidelines of applying adapter trimmers with aligners were suggested.
Conclusions: PEAT can be easily included in the routine paired-end sequencing pipeline. The executable binaries and the standalone C++ source code package of PEAT are freely available online.
Figures
Similar articles
-
Skewer: a fast and accurate adapter trimmer for next-generation sequencing paired-end reads.BMC Bioinformatics. 2014 Jun 12;15:182. doi: 10.1186/1471-2105-15-182. BMC Bioinformatics. 2014. PMID: 24925680 Free PMC article.
-
SeqPurge: highly-sensitive adapter trimming for paired-end NGS data.BMC Bioinformatics. 2016 May 10;17:208. doi: 10.1186/s12859-016-1069-7. BMC Bioinformatics. 2016. PMID: 27161244 Free PMC article.
-
Btrim: a fast, lightweight adapter and quality trimming program for next-generation sequencing technologies.Genomics. 2011 Aug;98(2):152-3. doi: 10.1016/j.ygeno.2011.05.009. Epub 2011 May 30. Genomics. 2011. PMID: 21651976
-
AlienTrimmer: a tool to quickly and accurately trim off multiple short contaminant sequences from high-throughput sequencing reads.Genomics. 2013 Nov-Dec;102(5-6):500-6. doi: 10.1016/j.ygeno.2013.07.011. Epub 2013 Aug 1. Genomics. 2013. PMID: 23912058
-
EARRINGS: an efficient and accurate adapter trimmer entails no a priori adapter sequences.Bioinformatics. 2021 Jul 27;37(13):1846-1852. doi: 10.1093/bioinformatics/btab025. Bioinformatics. 2021. PMID: 33459339
Cited by
-
Targeted base editing in the mitochondrial genome of Arabidopsis thaliana.Proc Natl Acad Sci U S A. 2022 May 17;119(20):e2121177119. doi: 10.1073/pnas.2121177119. Epub 2022 May 13. Proc Natl Acad Sci U S A. 2022. PMID: 35561225 Free PMC article.
-
From next-generation resequencing reads to a high-quality variant data set.Heredity (Edinb). 2017 Feb;118(2):111-124. doi: 10.1038/hdy.2016.102. Epub 2016 Oct 19. Heredity (Edinb). 2017. PMID: 27759079 Free PMC article. Review.
-
VecScreen_plus_taxonomy: imposing a tax(onomy) increase on vector contamination screening.Bioinformatics. 2018 Mar 1;34(5):755-759. doi: 10.1093/bioinformatics/btx669. Bioinformatics. 2018. PMID: 29069347 Free PMC article.
-
Isolation and characterization of microsatellite loci for Prunus mongolica (Rosaceae).Appl Plant Sci. 2018 Jun 26;6(6):e01158. doi: 10.1002/aps3.1158. eCollection 2018 Jun. Appl Plant Sci. 2018. PMID: 30131900 Free PMC article.
-
Genome-resolved metagenomics using environmental and clinical samples.Brief Bioinform. 2021 Sep 2;22(5):bbab030. doi: 10.1093/bib/bbab030. Brief Bioinform. 2021. PMID: 33758906 Free PMC article. Review.
References
-
- Paired-End Sequencing | Achieve maximum coverage across the genome. http://illumina.com
-
- FASTX-Toolkit. http://hannonlab.cshl.edu
-
- Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnetjournal. 2011;17(1):10–22.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
