Evolutionary analysis of amino acid repeats across the genomes of 12 Drosophila species
- PMID: 17602168
- DOI: 10.1093/molbev/msm129
Evolutionary analysis of amino acid repeats across the genomes of 12 Drosophila species
Abstract
Repeated motifs of amino acids within proteins are an abundant feature of eukaryotic sequences and may catalyze the rapid production of genetic and even phenotypic variation among organisms. The completion of the genome sequencing projects of 12 distinct Drosophila species provides a unique dataset to study these intriguing sequence features on a phylogeny with a variety of timescales. We show that there is a higher percentage of proteins containing repeats within the Drosophila genus than most other eukaryotes, including non-Drosphila insects, which makes this collection of species particularly useful for the study of protein repeats. We also find that proteins containing repeats are overrepresented in functional categories involving developmental processes, signaling, and gene regulation. Using the set of 1-to-1 ortholog alignments for the 12 Drosophila species, we test the ability of repeats to act as reliable phylogenetic signals and find that they resolve the generally accepted phylogeny despite the noise caused by their accelerated rate of evolution. We also determine that in general the position of repeats within a protein sequence is non-random, with repeats more often being absent from the middle regions of sequences. Finally we find evidence to suggest that the presence of repeats is associated with an increase in evolutionary rate upon the entire sequence in which they are embedded. With additional evidence to suggest a corresponding elevation in positive selection we propose that some repeats may be inducing compensatory substitutions in their surrounding sequence.
Similar articles
-
Compensated deleterious mutations in insect genomes.Science. 2004 Nov 26;306(5701):1553-4. doi: 10.1126/science.1100522. Epub 2004 Oct 21. Science. 2004. PMID: 15498973
-
Evolution of the GST omega gene family in 12 Drosophila species.J Hered. 2009 Nov-Dec;100(6):742-53. doi: 10.1093/jhered/esp043. Epub 2009 Jul 16. J Hered. 2009. PMID: 19608790
-
Evolutionary turnover of two pBuM satellite DNA subfamilies in the Drosophila buzzatii species cluster (repleta group): from alpha to alpha/beta arrays.Gene. 2005 Apr 11;349:77-85. doi: 10.1016/j.gene.2004.11.032. Gene. 2005. PMID: 15777676
-
Simple sequence repeats in proteins and their significance for network evolution.Gene. 2005 Jan 17;345(1):113-8. doi: 10.1016/j.gene.2004.11.023. Epub 2004 Dec 15. Gene. 2005. PMID: 15716087 Review.
-
Comparison of ARM and HEAT protein repeats.J Mol Biol. 2001 May 25;309(1):1-18. doi: 10.1006/jmbi.2001.4624. J Mol Biol. 2001. PMID: 11491282 Review.
Cited by
-
A unified view of low complexity regions (LCRs) across species.Elife. 2022 Sep 13;11:e77058. doi: 10.7554/eLife.77058. Elife. 2022. PMID: 36098382 Free PMC article.
-
Homopeptide and homocodon levels across fungi are coupled to GC/AT-bias and intrinsic disorder, with unique behaviours for some amino acids.Sci Rep. 2021 May 11;11(1):10025. doi: 10.1038/s41598-021-89650-1. Sci Rep. 2021. PMID: 33976321 Free PMC article.
-
Dissecting the role of low-complexity regions in the evolution of vertebrate proteins.BMC Evol Biol. 2012 Aug 24;12:155. doi: 10.1186/1471-2148-12-155. BMC Evol Biol. 2012. PMID: 22920595 Free PMC article.
-
Polyglutamine repeats are associated to specific sequence biases that are conserved among eukaryotes.PLoS One. 2012;7(2):e30824. doi: 10.1371/journal.pone.0030824. Epub 2012 Feb 1. PLoS One. 2012. PMID: 22312432 Free PMC article.
-
Comparative genomic analysis of Drosophila melanogaster and vector mosquito developmental genes.PLoS One. 2011;6(7):e21504. doi: 10.1371/journal.pone.0021504. Epub 2011 Jul 6. PLoS One. 2011. PMID: 21754989 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
