Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2020 Dec 16;37(12):3507-3524.
doi: 10.1093/molbev/msaa185.

Evolutionary Genomics of Structural Variation in Asian Rice (Oryza sativa) Domestication

Affiliations
Comparative Study

Evolutionary Genomics of Structural Variation in Asian Rice (Oryza sativa) Domestication

Yixuan Kou et al. Mol Biol Evol. .

Abstract

Structural variants (SVs) are a largely unstudied feature of plant genome evolution, despite the fact that SVs contribute substantially to phenotypes. In this study, we discovered SVs across a population sample of 347 high-coverage, resequenced genomes of Asian rice (Oryza sativa) and its wild ancestor (O. rufipogon). In addition to this short-read data set, we also inferred SVs from whole-genome assemblies and long-read data. Comparisons among data sets revealed different features of genome variability. For example, genome alignment identified a large (∼4.3 Mb) inversion in indica rice varieties relative to japonica varieties, and long-read analyses suggest that ∼9% of genes from the outgroup (O. longistaminata) are hemizygous. We focused, however, on the resequencing sample to investigate the population genomics of SVs. Clustering analyses with SVs recapitulated the rice cultivar groups that were also inferred from SNPs. However, the site-frequency spectrum of each SV type-which included inversions, duplications, deletions, translocations, and mobile element insertions-was skewed toward lower frequency variants than synonymous SNPs, suggesting that SVs may be predominantly deleterious. Among transposable elements, SINE and mariner insertions were found at especially low frequency. We also used SVs to study domestication by contrasting between rice and O. rufipogon. Cultivated genomes contained ∼25% more derived SVs and mobile element insertions than O. rufipogon, indicating that SVs contribute to the cost of domestication in rice. Peaks of SV divergence were enriched for known domestication genes, but we also detected hundreds of genes gained and lost during domestication, some of which were enriched for traits of agronomic interest.

Keywords: domestication; gene gain and loss; rice; structural variation; transposable element insertions.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Features of the data and SV data sets (A) A phylogeny based on SNPs of n =347 accessions of Asian rice with outgroups Oryza meridionalis and O. longistaminata. (B) Population structure inference based on SNPs (top) and SVs (below) for the short-read data set of 347 individuals. The accessions are arranged in the same order for the SNP and SV plots, the x-axis labels denote the different groups, with “aro,” “rufi,” and “niv” referring to aromatic, rufipogon, and O. nivara. (C) A dotplot of chromosome 6 showing the large (∼4.3 Mb) inversion in indica accessions relative to the O. longistaminata outgroup. The inversion is not shared with the japonica accessions in our sample. (D) A Venn diagram based on the combined results from three SV types (DEL, DUP, and INV) that compares SVs among three data sets based on short reads (Illumina, n =347), long reads (SMRT, n =10), and genome alignments (n =14). Results for each SV type separately are available in supplementary figure S3, Supplementary Material online.
Fig. 2.
Fig. 2.
Population information about SVs. (A) The plot graphs SNP and SV average pairwise diversity (π) for the rufipogon sample across 20-kb windows of the genome, with the line indicating the correlation, which is weakly positive but significant (Pearson r =0.0332, P =1.07 × 10−5). Similar graphs for the japonica and indica samples are in supplementary figure S7, Supplementary Material online. Plots (B), (C), and (D) show the unfolded SFS of different types of SVs in (B) rufipogon, (C) indica, and (D) japonica. Each SFS contains synonymous SNPs (Syn), nonsynonymous SNPs (Nsyn), and SV events that fall into the duplication (DUP), deletion (DEL) translocation (TRA), mobile element insertion (MEI), and inversion (INV) categories. (E) The decay of linkage disequilibrium (LD) of SNPs and SVs measured by r2 for the three population groups based on SNPs, SVs, and SNPs+SVs.
Fig. 3.
Fig. 3.
Feature of SVs associated with domestication. (A) The genetic load of SVs for rufipogon, japonica, and indica. The selfing cultivars have a higher recessive (homozygous) load and correspondingly larger total SV burden, suggesting a cost of domestication. (B) Manhattan plots of FST values between rufipogon, based on SVs within 20-kb windows, with japonica on the left and indica on the right. The corresponding Manhattan plots for SNPs are provided in supplementary figure S10, Supplementary Material online. (C) Manhattan plots of CLR values for japonica (left) and indica (right), based on SVs within 20-kb windows. The corresponding Manhattan plots for SNPs are provided in supplementary figure S12, Supplementary Material online. The proportion of different types of SVs under the selective sweeps detected by FST (D) and SweeD (E) analyses.
Fig. 4.
Fig. 4.
Features of TE diversity in rice. Plot (A) provides the SFS for ten element types along with synonymous SNPs (Syn) and nonsynonymous SNPs (Nsyn). This plot is for rufipogon; analogous plots for japonica and indica are provided in supplementary figures S14 and S15, Supplementary Material online. (B) The inferred distribution of fitness effects (DFE) in rufipogon relative to nSNPs. The y-axis provides the proportion of TE insertions, and the x-axis reports Nes. The color scheme for TE families is the same as (A). (C) The estimated proportion of adaptive variation (α) for each TE family and each of the three taxonomic groups. (D) Distributions of inferred insertion times for TE families in the Nipponbare reference. (E) The ratio of homozygous to heterozygous MEI variants in the three taxa for each TE family, which shows that the families under strong selection have relatively fewer homozygous variants.
Fig. 5.
Fig. 5.
Two example regions of complex SVs across Oryza taxa. (A) A segmental tandem duplication of an NBS-LRR-encoding gene (LOC_Os01g05600/Os01g0149350) that was found to be gained in indica and japonica relative to rufipogon. The synteny map is shown for a region corresponding to a 35-kb region in japonica (Nipponbare). (B) Gene and TE copy number variation in a 100-kb region of chromosome 9 that includes the Sub1A gene, which confers flooding tolerance. Both an indica accession and O. longistaminata contain three copies of genes. For both (A) and (B), gene copies are tracked by dotted red lines.

Similar articles

Cited by

References

    1. Alexander DH, Novembre J, Lange K.. 2009. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19(9):1655–1664. - PMC - PubMed
    1. Alkan C, Coe BP, Eichler EE.. 2011. Genome structural variation discovery and genotyping. Nat Rev Genet. 12(5):363–376. - PMC - PubMed
    1. Bolger AM, Lohse M, Usadel B.. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30(15):2114–2120. - PMC - PubMed
    1. Cai Z, Zhou L, Ren N-N, Xu X, Liu R, Huang L, Zheng X-M, Meng Q-L, Du Y-S, Wang M-X, et al.2019. Parallel speciation of wild rice associated with habitat shifts. Mol Biol Evol. 36(5):875–889. - PMC - PubMed
    1. Caicedo AL, Williamson SH, Hernandez RD, Boyko A, Fledel-Alon A, York TL, Polato NR, Olsen KM, Nielsen R, McCouch SR, et al.2007. Genome-wide patterns of nucleotide polymorphism in domesticated rice. PLoS Genet. 3(9):e163. - PMC - PubMed

Publication types

Substances