SSRs and INDELs mined from the sunflower EST database: abundance, polymorphisms, and cross-taxa utility

Theor Appl Genet. 2008 Nov;117(7):1021-9. doi: 10.1007/s00122-008-0841-0. Epub 2008 Jul 17.


Simple sequence repeats (SSRs) are abundant and frequently highly polymorphic in transcribed sequences and widely targeted for marker development in eukaryotes. Sunflower (Helianthus annuus) transcript assemblies were built and mined to identify SSRs and insertions-deletions (INDELs) for marker development, comparative mapping, and other genomics applications in sunflower. We describe the spectrum and frequency of SSRs identified in the sunflower EST database, a catalog of 16,643 EST-SSRs, a collection of 484 EST-SSR and 43 EST-INDEL markers developed from common sunflower ESTs, polymorphisms of the markers among the parents of several intraspecific and interspecific mapping populations, and the transferability of the markers to closely and distantly related species in the Compositae. Of 17,904 unigenes in the transcript assembly, 1,956 (10.9%) harbored one or more SSRs with repeat counts of n > or = 5. EST-SSR markers were 1.6-fold more polymorphic among exotic than elite genotypes and 0.7-fold less polymorphic than non-genic SSR markers. Of 466 EST-SSR or INDEL markers screened for cross-species amplification and polymorphisms, 413 (88.6%) amplified alleles from one or more wild species (H. argophyllus, H. tuberosus, H. anomalus, H. paradoxus, and H. deserticola), whereas 69 (14.8%) amplified alleles from safflower (Carthamus tinctorius) and 67 (14.4%) amplified alleles from lettuce (Lactuca sativa); hence, only a fraction were transferable to distantly related genera in the Compositae, whereas most were transferable to wild relatives of H. annuus. Several thousand additional SSRs were identified in the EST database and supply a wealth of templates for EST-SSR marker development in sunflower.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Asteraceae / classification
  • Computational Biology
  • Databases, Genetic
  • Expressed Sequence Tags*
  • Genetic Markers
  • Helianthus / genetics*
  • INDEL Mutation*
  • Minisatellite Repeats*
  • Polymorphism, Genetic*
  • Species Specificity


  • Genetic Markers