Comparison of different sequencing and assembly strategies for a repeat-rich fungal genome, Ophiocordyceps sinensis
- PMID: 27343682
- DOI: 10.1016/j.mimet.2016.06.025
Comparison of different sequencing and assembly strategies for a repeat-rich fungal genome, Ophiocordyceps sinensis
Abstract
Ophiocordyceps sinensis is one of the most expensive medicinal fungi world-wide, and has been used as a traditional Chinese medicine for centuries. In a recent report, the genome of this fungus was found to be expanded by extensive repetitive elements after assembly of Roche 454 (223Mb) and Illumina HiSeq (10.6Gb) sequencing data, producing a genome of 87.7Mb with an N50 scaffold length of 12kb and 6972 predicted genes. To test whether the assembly could be improved by deeper sequencing and to assess the amount of data needed for optimal assembly, genomic sequencing was run several times on genomic DNA extractions of a single ascospore isolate (strain 1229) on an Illumina HiSeq platform (25Gb total data). Assemblies were produced using different data types (raw vs. trimmed) and data amounts, and using three freely available assembly programs (ABySS, SOAP and Velvet). In nearly all cases, trimming the data for low quality base calls did not provide assemblies with higher N50 values compared to the non-trimmed data, and increasing the amount of input data (i.e. sequence reads) did not always lead to higher N50 values. Depending on the assembly program and data type, the maximal N50 was reached with between 50% to 90% of the total read data, equivalent to 100× to 200× coverage. The draft genome assembly was improved over the previously published version resulting in a 114Mb assembly, scaffold N50 of 70kb and 9610 predicted genes. Among the predicted genes, 9213 were validated by RNA-Seq analysis in this study, of which 8896 were found to be singletons. Evidence from genome and transcriptome analyses indicated that species assemblies could be improved with defined input material (e.g. haploid mono-ascospore isolate) without the requirement of multiple sequencing technologies, multiple library sizes or data trimming for low quality base calls, and with genome coverages between 100× and 200×.
Keywords: Assembly; Coverage; Genome; RNA-Seq; Sequencing; Trimming.
Copyright © 2016 Elsevier B.V. All rights reserved.
Similar articles
-
Genome Sequencing.Methods Mol Biol. 2018;1775:37-52. doi: 10.1007/978-1-4939-7804-5_4. Methods Mol Biol. 2018. PMID: 29876807
-
HGA: de novo genome assembly method for bacterial genomes using high coverage short sequencing reads.BMC Genomics. 2016 Mar 5;17:193. doi: 10.1186/s12864-016-2515-7. BMC Genomics. 2016. PMID: 26945881 Free PMC article.
-
Subset selection of high-depth next generation sequencing reads for de novo genome assembly using MapReduce framework.BMC Genomics. 2015;16 Suppl 12(Suppl 12):S9. doi: 10.1186/1471-2164-16-S12-S9. Epub 2015 Dec 9. BMC Genomics. 2015. PMID: 26678408 Free PMC article.
-
Toward a statistically explicit understanding of de novo sequence assembly.Bioinformatics. 2013 Dec 1;29(23):2959-63. doi: 10.1093/bioinformatics/btt525. Epub 2013 Sep 10. Bioinformatics. 2013. PMID: 24021385 Review.
-
The value of new genome references.Exp Cell Res. 2017 Sep 15;358(2):433-438. doi: 10.1016/j.yexcr.2016.12.014. Epub 2016 Dec 23. Exp Cell Res. 2017. PMID: 28017728 Free PMC article. Review.
Cited by
-
Altered GC- and AT-biased genotypes of Ophiocordyceps sinensis in the stromal fertile portions and ascospores of natural Cordyceps sinensis.PLoS One. 2023 Jun 8;18(6):e0286865. doi: 10.1371/journal.pone.0286865. eCollection 2023. PLoS One. 2023. PMID: 37289817 Free PMC article.
-
Differential coexistence of multiple genotypes of Ophiocordyceps sinensis in the stromata, ascocarps and ascospores of natural Cordyceps sinensis.PLoS One. 2023 Mar 9;18(3):e0270776. doi: 10.1371/journal.pone.0270776. eCollection 2023. PLoS One. 2023. PMID: 36893131 Free PMC article.
-
Genome and Comparative Transcriptome Dissection Provide Insights Into Molecular Mechanisms of Sclerotium Formation in Culinary-Medicinal Mushroom Pleurotus tuber-regium.Front Microbiol. 2022 Feb 17;12:815954. doi: 10.3389/fmicb.2021.815954. eCollection 2021. Front Microbiol. 2022. PMID: 35250915 Free PMC article.
-
RIP mutated ITS genes in populations of Ophiocordyceps sinensis and their implications for molecular systematics.IMA Fungus. 2020 Sep 16;11:18. doi: 10.1186/s43008-020-00040-0. eCollection 2020. IMA Fungus. 2020. PMID: 32974122 Free PMC article.
-
A New High-Quality Draft Genome Assembly of the Chinese Cordyceps Ophiocordyceps sinensis.Genome Biol Evol. 2020 Jul 1;12(7):1074-1079. doi: 10.1093/gbe/evaa112. Genome Biol Evol. 2020. PMID: 32579174 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
