Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Aug 18;43(14):6799-813.
doi: 10.1093/nar/gkv656. Epub 2015 Jul 6.

Transcriptomic profiling of gene expression and RNA processing during Leishmania major differentiation

Affiliations

Transcriptomic profiling of gene expression and RNA processing during Leishmania major differentiation

Laura A L Dillon et al. Nucleic Acids Res. .

Abstract

Protozoan parasites of the genus Leishmania are the etiological agents of leishmaniasis, a group of diseases with a worldwide incidence of 0.9-1.6 million cases per year. We used RNA-seq to conduct a high-resolution transcriptomic analysis of the global changes in gene expression and RNA processing events that occur as L. major transforms from non-infective procyclic promastigotes to infective metacyclic promastigotes. Careful statistical analysis across multiple biological replicates and the removal of batch effects provided a high quality framework for comprehensively analyzing differential gene expression and transcriptome remodeling in this pathogen as it acquires its infectivity. We also identified precise 5' and 3' UTR boundaries for a majority of Leishmania genes and detected widespread alternative trans-splicing and polyadenylation. An investigation of possible correlations between stage-specific preferential trans-splicing or polyadenylation sites and differentially expressed genes revealed a lack of systematic association, establishing that differences in expression levels cannot be attributed to stage-regulated alternative RNA processing. Our findings build on and improve existing expression datasets and provide a substantially more detailed view of L. major biology that will inform the field and potentially provide a stronger basis for drug discovery and vaccine development efforts.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Global gene expression profiles of the procyclic and metacyclic promastigote forms of Leishmania major. RNA-seq was carried out on L. major procyclic (log phase) promastigotes and metacyclic promastigotes isolated after enrichment using Ficoll or PNA. A principal component analysis (PCA) plot (A) and heatmap of a hierarchical clustering analysis using the Euclidean distance metric (B) are shown. Both analyses were performed using all L. major annotated genes (8475) after filtering for low counts, quantile normalization and accounting for batch effects in the statistical model used by limma. In the PCA plot, each point represents an experimental sample with point color indicating L. major developmental stage (blue = procyclic promastigote, orange = metacyclic promastigote) and point shape indicating batch/experimental date. Colors along the top of the heatmap indicate the developmental stage (blue = procyclic promastigote, orange = metacyclic promastigote) and colors along the left side of the heatmap indicate the batch/experimental date.
Figure 2.
Figure 2.
The differentiation of Leishmania major from the procyclic form to the infective metacyclic form reveals global changes in the parasite's gene expression program. RNA-seq was carried out on L. major procyclic and metacyclic promastigotes. DE analysis was done using limma after voom transformation, taking experimental batch into account as part of the limma statistical model. The MA plot shows the relationship between mean expression (log2 counts per million with an offset of 0.5) and fold change. Each point represents one gene. Points colored in red represent 3138 genes expressed at significantly different levels between procyclic and metacyclic promastigotes at an adjusted P-value of <0.05, with genes upregulated in the metacyclic stage relative to the procyclic stage exhibiting positive fold changes.
Figure 3.
Figure 3.
Length and position distribution of gene structure components in Leishmania major. Data from procyclic and metacyclic promastigote samples were combined to describe the gene structure elements of L. major. (A) Distribution of CDS lengths. Start and stop coordinates for coding sequences for previously annotated protein-coding genes (TriTrypDB v. 6.0) and novel ORFs were used to compute CDS lengths. For genes with multiple isoforms, the first isoform listed in TriTrypDB was included in the analysis. (B) Distribution of 5′ UTR lengths. The exact trans-splicing sites associated with each CDS were used to determine the coordinates and lengths of 5′ UTRs. A total of 154 046 trans-splicing sites were identified, corresponding to 5′ UTRs ranging from 0 to 7252 nt in length. (C) Distribution of 3′ UTR lengths. An analysis of polyadenylation sites associated with each CDS was performed to determine 3′ UTR coordinates and lengths. A total of 84 331 polyadenylation sites were identified, corresponding to 3′ UTRs ranging from 0 to 7133 bases in length. (D) Distribution of polypyrimidine (polyPy) tract lengths. A window of 250 nt upstream of the each primary trans-splicing site (8981 total) was scanned to identify its corresponding polyPy tract, defined as the longest stretch of pyrimidine residues interrupted by no more than one purine. (E) Distribution of distances between each primary trans-splicing site and its corresponding polypyrimidine (polyPy) tract. (F) Distribution of distances between each polyPy tract and the polyadenylation site of the upstream gene. A total of 6174 instances of a neighboring polyadenylation site and polyPy tract were identified. (G) Diagram of a ‘typical’ L. major genic region. The median lengths of the gene structure components of L. major were used to construct the structure of a ‘typical’ gene region (two genes and the intergenic region). The median values of each component are shown. Colors correspond to the features depicted in panels A–F.
Figure 4.
Figure 4.
Characterization of alternative RNA processing. Alternative RNA processing events were detected in both promastigote developmental stages and pooled for this analysis. The distribution of distances between primary and minor trans-splicing sites is shown for minor splice sites that use (A) the canonical AG acceptor sequence or (B) an acceptor sequence other than AG. Panel (C) depicts the distribution of distances between primary and minor polyadenylation sites. Only minor sites within 1000 nt of the primary site are plotted. The full ranges for A, B and C, are −7234 to 5973, −7178 to 5876 and −6429 to 5993, respectively. About 18% (11 201 of 62 098), 29% (24 303 of 82 947) and 10% (7301 of 75 487) of values fell outside of the plotted range for A, B and C, respectively. (D) Sequence composition of the region spanning from 90 nt upstream to 10 nt downstream of each primary trans-splicing site. (E) Sequence composition of the region spanning from 50 nt upstream to 50 nt downstream of each primary polyadenylation site.
Figure 5.
Figure 5.
Preferential usage of primary site across developmental stages. (A) For each gene, the length of the 5′ UTR determined by the primary trans-splicing site for the metacyclic stage was plotted against the length of the 5′ UTR determined by the primary trans-splicing site for the procyclic stage. Each point represents a single gene. Points that do not fall along the diagonal represent a change in the primary site between the stages. The color of each point represents the dominance of the primary site (preference for usage of the primary site over other sites within a stage), as determined by the ratio of primary site read counts to secondary site read counts (P/S), averaged for both stages. The size of each point depicts average read count for the primary sites from both stages, thereby showing expression levels and providing a measure of confidence in the data. Three genes highlighted in Supplementary Figure S7 are circled with the gene identifier labeled in blue. (B) A similar plot was done for 3′ UTR length as determined by the primary polyadenylation site for each stage.

Similar articles

Cited by

References

    1. Alvar J., Vélez I.D., Bern C., Herrero M., Desjeux P., Cano J., Jannin J., den Boer M., WHO Leishmaniasis Control Team Leishmaniasis worldwide and global estimates of its incidence. PLoS One. 2012;7:e35671. - PMC - PubMed
    1. Ambit A., Woods K.L., Cull B., Coombs G.H., Mottram J.C. Morphological events during the cell cycle of Leishmania major. Eukaryot. Cell. 2011;10:1429–1438. - PMC - PubMed
    1. Beverley S.M., Turco S.J. Lipophosphoglycan (LPG) and the identification of virulence genes in the protozoan parasite Leishmania. Trends Microbiol. 1998;6:35–40. - PubMed
    1. Wheeler R.J., Gluenz E., Gull K. The cell cycle of Leishmania: morphogenetic events and their implications for parasite biology. Mol. Microbiol. 2011;79:647–662. - PMC - PubMed
    1. El-Sayed N.M., Myler P.J., Blandin G., Berriman M., Crabtree J., Aggarwal G., Caler E., Renauld H., Worthey E.A., Hertz-Fowler C., et al. Comparative genomics of trypanosomatid parasitic protozoa. Science. 2005;309:404–409. - PubMed

Publication types