Comprehensive functional analyses of expressed sequence tags in common wheat (Triticum aestivum)

DNA Res. 2012 Apr;19(2):165-77. doi: 10.1093/dnares/dss001. Epub 2012 Feb 14.

Abstract

About 1 million expressed sequence tag (EST) sequences comprising 125.3 Mb nucleotides were accreted from 51 cDNA libraries constructed from a variety of tissues and organs under a range of conditions, including abiotic stresses and pathogen challenges in common wheat (Triticum aestivum). Expressed sequence tags were assembled with stringent parameters after processing with inbuild scripts, resulting in 37,138 contigs and 215,199 singlets. In the assembled sequences, 10.6% presented no matches with existing sequences in public databases. Functional characterization of wheat unigenes by gene ontology annotation, mining transcription factors, full-length cDNA, and miRNA targeting sites were carried out. A bioinformatics strategy was developed to discover single-nucleotide polymorphisms (SNPs) within our large EST resource and reported the SNPs between and within (homoeologous) cultivars. Digital gene expression was performed to find the tissue-specific gene expression, and correspondence analysis was executed to identify common and specific gene expression by selecting four biotic stress-related libraries. The assembly and associated information cater a framework for future investigation in functional genomics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / methods
  • DNA, Complementary / isolation & purification
  • Databases, Genetic
  • Expressed Sequence Tags*
  • Gene Expression Profiling / methods*
  • Gene Expression Regulation, Plant
  • Gene Library
  • Genes, Plant*
  • Molecular Sequence Annotation
  • Polymorphism, Single Nucleotide
  • Sequence Analysis, DNA / methods
  • Triticum / genetics*
  • Triticum / growth & development

Substances

  • DNA, Complementary