The nucleoprotein gene of Ebola virus: cloning, sequencing, and in vitro expression

Virology. 1989 May;170(1):81-91. doi: 10.1016/0042-6822(89)90354-1.

Abstract

Genomic and messenger RNAs of a Zaire strain of Ebola virus were cloned, and inserts specific for the nucleoprotein gene were isolated and sequenced. The nucleoprotein gene is located proximal to the 3' end of the genome and is preceeded by a putative leader sequence. The gene begins with the transcriptional start site sequence 3'-UACUCCUUCUAAUU..., and ends with the polyadenylation site sequence 3'-... UAAUUCUUUUUU. The predicted coding region is 2217 bases in length and encodes a protein that contains 739 amino acids, with a calculated molecular weight of 83.3 kDa. The protein has an approximate net charge of -30 and can be divided into a hydrophobic N-terminal half and a hydrophilic and highly acidic C-terminal half. An in vitro transcript, generated from plasmid DNA containing the entire coding region, directs the synthesis of authentic nucleoprotein in a rabbit reticulocyte lysate system. The genomic organization and transcriptional signals of Ebola are similar to those of other nonsegmented, negative-strand RNA viruses, but nucleic acid or amino acid sequence comparisons indicate a lack of similarity.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Base Sequence
  • Cloning, Molecular
  • Ebolavirus / genetics*
  • Genes
  • Genes, Viral
  • Molecular Sequence Data
  • Nucleocapsid Proteins
  • Nucleoproteins / genetics*
  • RNA, Messenger / genetics
  • RNA, Viral / genetics
  • Rhabdoviridae / genetics*
  • Transcription, Genetic
  • Viral Core Proteins / genetics*

Substances

  • Nucleocapsid Proteins
  • Nucleoproteins
  • RNA, Messenger
  • RNA, Viral
  • Viral Core Proteins

Associated data

  • GENBANK/J04337