Developing genomic resources in two Linum species via 454 pyrosequencing and genomic reduction

Mol Ecol Resour. 2012 May;12(3):492-500. doi: 10.1111/j.1755-0998.2011.03100.x. Epub 2011 Dec 18.

Abstract

Recent advances in next-generation DNA sequencing (NGS) have enhanced the development of genomic resources such as contigs or single-nucleotide polymorphisms (SNPs) for evolutionary studies of a nonmodel species with a complex and unsequenced genome. This study presents an application of a NGS technique in combination with genomic reduction and advanced bioinformatics tools to identify contigs and SNPs from multiple samples of two Linum species. A full Roche 454 GS FLX run of 16 diverse Linum samples representing cultivated flax (Linum usitatissimum L.) and its wild progenitor (Linum bienne Mill.) generated approximately 1.6 million sequence reads with a total length of 498 Mbp. Application of the computational pipeline de novo identification of alleles identified 713 contigs and 1067 SNPs. A blast search revealed alignments of all 713 contigs with 491 existing Linum scaffolds and gene annotations associated with 512 contigs. Sanger sequencing confirmed 95% of 79 selected contigs and 94% of 272 SNPs and identified 211 new SNPs and 19 new indels. The scored 454 SNP data were highly imbalanced for assayed samples. These findings not only are useful for evolutionary studies of Linum species but also help to illustrate the utility of NGS technologies in SNP discovery for nonmodel organisms.

MeSH terms

  • Classification / methods*
  • Computational Biology / methods
  • DNA, Plant / chemistry
  • DNA, Plant / genetics
  • Evolution, Molecular
  • Flax / classification*
  • Flax / genetics*
  • Genotype
  • Sequence Analysis, DNA / methods*

Substances

  • DNA, Plant