Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jul 2;8(6):1971-9.
doi: 10.1093/gbe/evw137.

Complex Selection on Human Polyadenylation Signals Revealed by Polymorphism and Divergence Data

Affiliations

Complex Selection on Human Polyadenylation Signals Revealed by Polymorphism and Divergence Data

Yaroslav A Kainov et al. Genome Biol Evol. .

Abstract

Polyadenylation is a step of mRNA processing which is crucial for its expression and stability. The major polyadenylation signal (PAS) represents a nucleotide hexamer that adheres to the AATAAA consensus sequence. Over a half of human genes have multiple cleavage and polyadenylation sites, resulting in a great diversity of transcripts differing in function, stability, and translational activity. Here, we use available whole-genome human polymorphism data together with data on interspecies divergence to study the patterns of selection acting on PAS hexamers. Common variants of PAS hexamers are depleted of single nucleotide polymorphisms (SNPs), and SNPs within PAS hexamers have a reduced derived allele frequency (DAF) and increased conservation, indicating prevalent negative selection; at the same time, the SNPs that "improve" the PAS (i.e., those leading to higher cleavage efficiency) have increased DAF, compared to those that "impair" it. SNPs are rarer at PAS of "unique" polyadenylation sites (one site per gene); among alternative polyadenylation sites, at the distal PAS and at exonic PAS. Similar trends were observed in DAFs and divergence between species of placental mammals. Thus, selection permits PAS mutations mainly at redundant and/or weakly functional PAS. Nevertheless, a fraction of the SNPs at PAS hexamers likely affect gene functions; in particular, some of the observed SNPs are associated with disease.

Keywords: 1000 genomes; AATAAA; SNP; mRNA processing; polyadenylation.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Patterns of selection in PAS hexamers. Whiskers represent standard errors of the mean. Asterisks correspond to P < 0.05 according to the Mann–Whitney’s U-test. (A) Densities of SNPs in PAS hexamers and in the control sample. (B) Mean DAF of SNPs is reduced in PAS hexamers, compared to the control sample. (C) Mean phastCons score is increased in PAS hexamers, compared to the control sample. (D) DAF depends on the effect of SNPs on the functional activity of the PAS hexamer (box width represents the number of sites SNPs in the category).
Fig. 2.
Fig. 2.
Schematic representation of functional classification of PAS hexamers. PAS hexamers are categorized according to the number and position of corresponding cleavage sites (A); number of PAS hexamers corresponding to a single cleavage site (B); localization within exon or intron (C); or localization within CDS or 3′-UTR (D). Gray boxes, coding exons; thick lines, 3′-UTR exons; angled lines, introns; arrows, cleavage sites.
Fig. 3.
Fig. 3.
Polymorphism in different functional groups of PAS hexamers. (A) SNP densities; (B) DAFs; (C) PhastCons scores. In A and B, box width represents the number of SNPs in the group. Dashed lines represent the mean value in the entire sample, and dotted lines, its standard error. Whiskers represent standard error of the mean. Asterisks identify difference of the particular group from the remaining PAS hexamers, according to Fisher’s exact test or Mann–Whitney U-test; *P < 0.05, **P < 10 3, ***P < 10 10.

Similar articles

Cited by

References

    1. Adzhubei IA, et al. 2010. A method and server for predicting damaging missense mutations. Nat Methods. 7:248–249. - PMC - PubMed
    1. Ara T, Lopez F, Ritchie W, Benech P, Gautheret D. 2006. Conservation of alternative polyadenylation patterns in mammalian genes. BMC Genomics 7:189.. - PMC - PubMed
    1. Beaudoing E, Freier S, Wyatt JR, Claverie JM, Gautheret D. 2000. Patterns of variant polyadenylation signal usage in human genes. Genome Res. 10:1001–1010. - PMC - PubMed
    1. Bennett CL, et al. 2001. A rare polyadenylation signal mutation of the FOXP3 gene (AAUAAA–>AAUGAA) leads to the IPEX syndrome. Immunogenetics 53:435–439. - PubMed
    1. Castelo-Branco P, et al. 2004. Polypyrimidine tract binding protein modulates efficiency of polyadenylation. Mol Cell Biol. 24:4174–4183. - PMC - PubMed