Linear models enable powerful differential activity analysis in massively parallel reporter assays
- PMID: 30866806
- PMCID: PMC6417258
- DOI: 10.1186/s12864-019-5556-x
Linear models enable powerful differential activity analysis in massively parallel reporter assays
Abstract
Background: Massively parallel reporter assays (MPRAs) have emerged as a popular means for understanding noncoding variation in a variety of conditions. While a large number of experiments have been described in the literature, analysis typically uses ad-hoc methods. There has been little attention to comparing performance of methods across datasets.
Results: We present the mpralm method which we show is calibrated and powerful, by analyzing its performance on multiple MPRA datasets. We show that it outperforms existing statistical methods for analysis of this data type, in the first comprehensive evaluation of statistical methods on several datasets. We investigate theoretical and real-data properties of barcode summarization methods and show an unappreciated impact of summarization method for some datasets. Finally, we use our model to conduct a power analysis for this assay and show substantial improvements in power by performing up to 6 replicates per condition, whereas sequencing depth has smaller impact; we recommend to always use at least 4 replicates. An R package is available from the Bioconductor project.
Conclusions: Together, these results inform recommendations for differential analysis, general group comparisons, and power analysis and will help improve design and analysis of MPRA experiments.
Keywords: Enhancer; Massively parallel reporter assays; Statistics.
Conflict of interest statement
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figures
Similar articles
-
Deciphering regulatory DNA sequences and noncoding genetic variants using neural network models of massively parallel reporter assays.PLoS One. 2019 Jun 17;14(6):e0218073. doi: 10.1371/journal.pone.0218073. eCollection 2019. PLoS One. 2019. PMID: 31206543 Free PMC article.
-
Meta-analysis of massively parallel reporter assays enables prediction of regulatory function across cell types.Hum Mutat. 2019 Sep;40(9):1299-1313. doi: 10.1002/humu.23820. Epub 2019 Jun 18. Hum Mutat. 2019. PMID: 31131957 Free PMC article.
-
Design and Analysis of Massively Parallel Reporter Assays Using FORECAST.Methods Mol Biol. 2023;2553:41-56. doi: 10.1007/978-1-0716-2617-7_3. Methods Mol Biol. 2023. PMID: 36227538
-
Decoding enhancers using massively parallel reporter assays.Genomics. 2015 Sep;106(3):159-164. doi: 10.1016/j.ygeno.2015.06.005. Epub 2015 Jun 10. Genomics. 2015. PMID: 26072433 Free PMC article. Review.
-
STARR-seq - principles and applications.Genomics. 2015 Sep;106(3):145-150. doi: 10.1016/j.ygeno.2015.06.001. Epub 2015 Jun 11. Genomics. 2015. PMID: 26072434 Review.
Cited by
-
MPRAdecoder: Processing of the Raw MPRA Data With a priori Unknown Sequences of the Region of Interest and Associated Barcodes.Front Genet. 2021 May 11;12:618189. doi: 10.3389/fgene.2021.618189. eCollection 2021. Front Genet. 2021. PMID: 34046055 Free PMC article.
-
Liver eQTL meta-analysis illuminates potential molecular mechanisms of cardiometabolic traits.Am J Hum Genet. 2024 Sep 5;111(9):1899-1913. doi: 10.1016/j.ajhg.2024.07.017. Epub 2024 Aug 21. Am J Hum Genet. 2024. PMID: 39173627
-
Integrative functional genomic analyses identify genetic variants influencing skin pigmentation in Africans.Nat Genet. 2024 Feb;56(2):258-272. doi: 10.1038/s41588-023-01626-1. Epub 2024 Jan 10. Nat Genet. 2024. PMID: 38200130 Free PMC article.
-
MPRAnalyze: statistical framework for massively parallel reporter assays.Genome Biol. 2019 Sep 2;20(1):183. doi: 10.1186/s13059-019-1787-z. Genome Biol. 2019. PMID: 31477158 Free PMC article.
-
Systematic identification of cis-regulatory variants that cause gene expression differences in a yeast cross.Elife. 2020 Nov 12;9:e62669. doi: 10.7554/eLife.62669. Elife. 2020. PMID: 33179598 Free PMC article.
References
-
- Grossman SR, Zhang X, Wang L, Engreitz J, Melnikov A, Rogov P, Tewhey R, Isakova A, Deplancke B, Bernstein BE, Mikkelsen TS, Lander ES. Systematic dissection of genomic features determining transcription factor binding and enhancer function. PNAS. 2017;114:1291–300. doi: 10.1073/pnas.1621150114. - DOI - PMC - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
