A tripartite approach identifies the major sunflower seed albumins

Theor Appl Genet. 2016 Mar;129(3):613-29. doi: 10.1007/s00122-015-2653-3. Epub 2016 Jan 14.


We have used a combination of genomic, transcriptomic, and proteomic approaches to identify the napin-type albumin genes in sunflower and define their contributions to the seed albumin pool. Seed protein content is determined by the expression of what are typically large gene families. A major class of seed storage proteins is the napin-type, water soluble albumins. In this work we provide a comprehensive analysis of the napin-type albumin content of the common sunflower (Helianthus annuus) by analyzing a draft genome, a transcriptome and performing a proteomic analysis of the seed albumin fraction. We show that although sunflower contains at least 26 genes for napin-type albumins, only 15 of these are present at the mRNA level. We found protein evidence for 11 of these but the albumin content of mature seeds is dominated by the encoded products of just three genes. So despite high genetic redundancy for albumins, only a small sub-set of this gene family contributes to total seed albumin content. The three genes identified as producing the majority of sunflower seed albumin are potential future candidates for manipulation through genetics and breeding.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • 2S Albumins, Plant / genetics*
  • Amino Acid Sequence
  • Cloning, Molecular
  • DNA, Plant / genetics
  • Genes, Plant
  • Genome, Plant
  • Helianthus / genetics*
  • Molecular Sequence Data
  • Proteome
  • Seeds / chemistry*
  • Transcriptome


  • 2S Albumins, Plant
  • DNA, Plant
  • Proteome