Exploratory bioinformatics investigation reveals importance of "junk" DNA in early embryo development
- PMID: 28231763
- PMCID: PMC5324221
- DOI: 10.1186/s12864-017-3566-0
Exploratory bioinformatics investigation reveals importance of "junk" DNA in early embryo development
Abstract
Background: Instead of testing predefined hypotheses, the goal of exploratory data analysis (EDA) is to find what data can tell us. Following this strategy, we re-analyzed a large body of genomic data to study the complex gene regulation in mouse pre-implantation development (PD).
Results: Starting with a single-cell RNA-seq dataset consisting of 259 mouse embryonic cells derived from zygote to blastocyst stages, we reconstructed the temporal and spatial gene expression pattern during PD. The dynamics of gene expression can be partially explained by the enrichment of transposable elements in gene promoters and the similarity of expression profiles with those of corresponding transposons. Long Terminal Repeats (LTRs) are associated with transient, strong induction of many nearby genes at the 2-4 cell stages, probably by providing binding sites for Obox and other homeobox factors. B1 and B2 SINEs (Short Interspersed Nuclear Elements) are correlated with the upregulation of thousands of nearby genes during zygotic genome activation. Such enhancer-like effects are also found for human Alu and bovine tRNA SINEs. SINEs also seem to be predictive of gene expression in embryonic stem cells (ESCs), raising the possibility that they may also be involved in regulating pluripotency. We also identified many potential transcription factors underlying PD and discussed the evolutionary necessity of transposons in enhancing genetic diversity, especially for species with longer generation time.
Conclusions: Together with other recent studies, our results provide further evidence that many transposable elements may play a role in establishing the expression landscape in early embryos. It also demonstrates that exploratory bioinformatics investigation can pinpoint developmental pathways for further study, and serve as a strategy to generate novel insights from big genomic data.
Keywords: Early embryogenesis; Exploratory data analysis; Pre-implantation development; Repetitive DNA; Single-cell RNA-seq; Transposons.
Figures
Similar articles
-
Expression dynamics of repetitive DNA in early human embryonic development.BMC Genomics. 2019 May 31;20(1):439. doi: 10.1186/s12864-019-5803-1. BMC Genomics. 2019. PMID: 31151386 Free PMC article.
-
Identification and functional analysis of long non-coding RNAs in human and mouse early embryos based on single-cell transcriptome data.Oncotarget. 2016 Sep 20;7(38):61215-61228. doi: 10.18632/oncotarget.11304. Oncotarget. 2016. PMID: 27542205 Free PMC article.
-
Chromatin analysis in human early development reveals epigenetic transition during ZGA.Nature. 2018 May;557(7704):256-260. doi: 10.1038/s41586-018-0080-8. Epub 2018 May 2. Nature. 2018. PMID: 29720659
-
Epigenetic regulatory mechanisms during preimplantation development.Birth Defects Res C Embryo Today. 2009 Dec;87(4):297-313. doi: 10.1002/bdrc.20165. Birth Defects Res C Embryo Today. 2009. PMID: 19960551 Review.
-
The diverse roles of DNA methylation in mammalian development and disease.Nat Rev Mol Cell Biol. 2019 Oct;20(10):590-607. doi: 10.1038/s41580-019-0159-6. Epub 2019 Aug 9. Nat Rev Mol Cell Biol. 2019. PMID: 31399642 Review.
Cited by
-
Rise and SINE: roles of transcription factors and retrotransposons in zygotic genome activation.Nat Rev Mol Cell Biol. 2025 Jan;26(1):68-79. doi: 10.1038/s41580-024-00772-6. Epub 2024 Oct 2. Nat Rev Mol Cell Biol. 2025. PMID: 39358607
-
The Role of Hypoxia-Associated Long Non-Coding RNAs in Breast Cancer.Cells. 2022 May 18;11(10):1679. doi: 10.3390/cells11101679. Cells. 2022. PMID: 35626715 Free PMC article. Review.
-
Pleomorphic Adenoma Gene 1 Is Needed For Timely Zygotic Genome Activation and Early Embryo Development.Sci Rep. 2019 Jun 10;9(1):8411. doi: 10.1038/s41598-019-44882-0. Sci Rep. 2019. PMID: 31182756 Free PMC article.
-
Locus-specific expression of transposable elements in single cells with CELLO-seq.Nat Biotechnol. 2022 Apr;40(4):546-554. doi: 10.1038/s41587-021-01093-1. Epub 2021 Nov 15. Nat Biotechnol. 2022. PMID: 34782740 Free PMC article.
-
Dynamic Transcriptional Landscape of Grass Carp (Ctenopharyngodon idella) Reveals Key Transcriptional Features Involved in Fish Development.Int J Mol Sci. 2022 Sep 30;23(19):11547. doi: 10.3390/ijms231911547. Int J Mol Sci. 2022. PMID: 36232849 Free PMC article.
References
-
- Popper KR. Conjectures and refutations; the growth of scientific knowledge. New York: Basic Books; 1962.
-
- Tukey JW. Exploratory data analysis. Massachusetts: Addison-Wesley Pub. Co; 1977.
-
- Tufte ER. The visual display of quantitative information. Cheshire, Conn. (Box 430, Cheshire 06410): Graphics Press; 1983.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
