Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Dec 9;38(12):5563-5575.
doi: 10.1093/molbev/msab270.

Selection on Accessible Chromatin Regions in Capsella grandiflora

Affiliations

Selection on Accessible Chromatin Regions in Capsella grandiflora

Robert Horvath et al. Mol Biol Evol. .

Abstract

Accurate estimates of genome-wide rates and fitness effects of new mutations are essential for an improved understanding of molecular evolutionary processes. Although eukaryotic genomes generally contain a large noncoding fraction, functional noncoding regions and fitness effects of mutations in such regions are still incompletely characterized. A promising approach to characterize functional noncoding regions relies on identifying accessible chromatin regions (ACRs) tightly associated with regulatory DNA. Here, we applied this approach to identify and estimate selection on ACRs in Capsella grandiflora, a crucifer species ideal for population genomic quantification of selection due to its favorable population demography. We describe a population-wide ACR distribution based on ATAC-seq data for leaf samples of 16 individuals from a natural population. We use population genomic methods to estimate fitness effects and proportions of positively selected fixations (α) in ACRs and find that intergenic ACRs harbor a considerable fraction of weakly deleterious new mutations, as well as a significantly higher proportion of strongly deleterious mutations than comparable inaccessible intergenic regions. ACRs are enriched for expression quantitative trait loci (eQTL) and depleted of transposable element insertions, as expected if intergenic ACRs are under selection because they harbor regulatory regions. By integrating empirical identification of intergenic ACRs with analyses of eQTL and population genomic analyses of selection, we demonstrate that intergenic regulatory regions are an important source of nearly neutral mutations. These results improve our understanding of selection on noncoding regions and the role of nearly neutral mutations for evolutionary processes in outcrossing Brassicaceae species.

Keywords: ATAC-sequencing; distribution of fitness effects; functional noncoding sequences; gene expression variation; natural selection; open chromatin region.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Genetic profile plots. (A) Proportion of ACRs in and around genes (TSS: transcription start site; TES: transcription end site). (B) Proportion of ACRs in and around CNSs. (C) Proportion of SNPs in and around intergenic ACRs. (D) Proportion of TE insertions in and around ACRs. All plots only include mappable regions of the genome (see Materials and Methods) and the inferred boundaries of the CNS and ACR are labeled upstream boundary (UB) and downstream boundary (DB).
Fig. 2.
Fig. 2.
Estimated distribution of fitness effects of new mutations in intergenic ACRs, intergenic control regions, and 0-fold degenerate sites using a one-epoch demographic model. The estimated DFE was binned based on the mean selective effect (−Nes) into three bins: effectively neutral (0 < −Nes < 1), intermediate fitness effect (1 < −Nes ≤ 10), and strongly deleterious (−Nes > 10) mutations. (A) DFE of intergenic ACRs and 0-fold degenerate sites. (B) DFE of distal ACRs (dACRs), distal control regions (d-control), proximal ACRs (pACRs), and proximal control regions (P-control). (C) DFE of intergenic ACRs split up into unique (u), common (c), and high-frequency (h) ACRs. Error bars show the 95% CI of each estimate based on 100 bootstrap replicates. Significant differences between estimates are shown by asterisks (Kruskal–Wallis test with a Dunn post hoc test, P-value < 0.05).
Fig. 3.
Fig. 3.
Comparison between the observed and expected number of cis- and trans-eQTL located in ACRs and their 500 bp surroundings in C. grandiflora. (A) The expected number of cis-eQTL found within and in the proximity of ACRs located in and around genes (5 kb up- and downstream). (B) The expected number of trans-eQTL found within and in the proximity of ACRs at least 5 kb away from genes. The expected number of cis- and trans-eQTL within and in the proximity of ACRs (gray) were based on 1,000 permutations. The observed number of cis- and trans-eQTL located within and in the proximity of ACRs is indicated by the red lines. The blue dashed lines delimit the 95% confidence interval of the permutation test.

Similar articles

Cited by

References

    1. Alexa A, Rahnenfuhrer J.. 2021. topGO: enrichment analysis for gene ontology. R package version 2.44.0. Saarbrücken, Germany: Max Planck Institut für Informatik.
    1. Alexandre CM, Urton JR, Jean-Baptiste K, Huddleston J, Dorrity MW, Cuperus JT, Sullivan AM, Bemm F, Jolic D, Arsovski AA, et al.2018. Complex relationships between chromatin accessibility, sequence divergence, and gene expression in Arabidopsis thaliana. Mol Biol Evol. 35(4):837–854. - PMC - PubMed
    1. Andolfatto P. 2005. Adaptive evolution of non-coding DNA in Drosophila. Nature 437(7062):1149–1152. - PubMed
    1. Bajic M, Maher KA, Deal RB.. 2018. Identification of open chromatin regions in plant genomes using ATAC-Seq. Methods Mol Biol. 1675:183–201. - PMC - PubMed
    1. Bolger AM, Lohse M, Usadel B.. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30(15):2114–2120. - PMC - PubMed

Publication types