Athila4 of Arabidopsis and Calypso of soybean define a lineage of endogenous plant retroviruses

Genome Res. 2002 Jan;12(1):122-31. doi: 10.1101/gr.196001.

Abstract

The Athila retroelements of Arabidopsis thaliana encode a putative envelope gene, suggesting that they are infectious retroviruses. Because most insertions are highly degenerate, we undertook a comprehensive analysis of the A. thaliana genome sequence to discern their conserved features. One family (Athila4) was identified whose members are largely intact and share >94% nucleotide identity. As a basis for comparison, related elements (the Calypso elements) were characterized from soybean. Consensus Calypso and Athila4 elements are 12-14 kb in length and have long terminal repeats of 1.3-1.8 kb. Gag and Pol are encoded on a single open reading frame (ORF) of 1801 (Calypso) and 1911 (Athila4) amino acids. Following the Gag-Pol ORF are noncoding regions of ~0.7 and 2 kb, which, respectively, flank the env-like gene. The env-like ORF begins with a putative splice acceptor site and encodes a protein with a predicted central transmembrane domain, similar to retroviral env genes. RNA of Athila elements was detected in an A. thaliana strain with decreased DNA methylation (ddm1). Additionally, a PCR survey identified related reverse transcriptases in diverse angiosperm genomes. Their ubiquitous nature and the potential for horizontal transfer by infection implicates these endogenous retroviruses as important vehicles for plant genome evolution.

Publication types

  • Letter
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Arabidopsis / genetics*
  • Arabidopsis / virology*
  • Endogenous Retroviruses / genetics*
  • Evolution, Molecular
  • Gene Frequency
  • Genes, env
  • Glycine max / genetics*
  • Glycine max / virology*
  • Molecular Sequence Data
  • Plant Viruses / genetics
  • Retroelements* / genetics*
  • Viral Envelope Proteins / chemistry
  • Viral Envelope Proteins / genetics

Substances

  • Retroelements
  • Viral Envelope Proteins

Associated data

  • GENBANK/AF186182
  • GENBANK/AF186183
  • GENBANK/AF186184
  • GENBANK/AF186185
  • GENBANK/AF186186
  • GENBANK/AF378012
  • GENBANK/AF378013
  • GENBANK/AF378014
  • GENBANK/AF378015
  • GENBANK/AF378016
  • GENBANK/AF378017
  • GENBANK/AF378018
  • GENBANK/AF378019
  • GENBANK/AF378020
  • GENBANK/AF378021
  • GENBANK/AF378022
  • GENBANK/AF378023
  • GENBANK/AF378024
  • GENBANK/AF378025
  • GENBANK/AF378026
  • GENBANK/AF378027
  • GENBANK/AF378028
  • GENBANK/AF378029
  • GENBANK/AF378030
  • GENBANK/AF378031
  • GENBANK/AF378032
  • GENBANK/AF378033
  • GENBANK/AF378034
  • GENBANK/AF378035
  • GENBANK/AF378036
  • GENBANK/AF378037
  • GENBANK/AF378038
  • GENBANK/AF378039
  • GENBANK/AF378040
  • GENBANK/AF378041
  • GENBANK/AF378042
  • GENBANK/AF378043
  • GENBANK/AF378044
  • GENBANK/AF378045
  • GENBANK/AF378046
  • GENBANK/AF378047
  • GENBANK/AF378048
  • GENBANK/AF378049
  • GENBANK/AF378050
  • GENBANK/AF378051
  • GENBANK/AF378052
  • GENBANK/AF378053
  • GENBANK/AF378054
  • GENBANK/AF378055
  • GENBANK/AF378056
  • GENBANK/AF378057
  • GENBANK/AF378058
  • GENBANK/AF378059
  • GENBANK/AF378060
  • GENBANK/AF378061
  • GENBANK/AF378062
  • GENBANK/AF378063
  • GENBANK/AF378064
  • GENBANK/AF378065
  • GENBANK/AF378066
  • GENBANK/AF378067
  • GENBANK/AF378068
  • GENBANK/AF378069
  • GENBANK/AF378070
  • GENBANK/AF378071
  • GENBANK/AF378072
  • GENBANK/AF378073
  • GENBANK/AF378074
  • GENBANK/AF378075
  • GENBANK/AF378076
  • GENBANK/AF378077
  • GENBANK/AF378078
  • GENBANK/AF378079
  • GENBANK/AF378080
  • GENBANK/AF378081