The sva package for removing batch effects and other unwanted variation in high-throughput experiments
- PMID: 22257669
- PMCID: PMC3307112
- DOI: 10.1093/bioinformatics/bts034
The sva package for removing batch effects and other unwanted variation in high-throughput experiments
Abstract
Heterogeneity and latent variables are now widely recognized as major sources of bias and variability in high-throughput experiments. The most well-known source of latent variation in genomic experiments are batch effects-when samples are processed on different days, in different groups or by different people. However, there are also a large number of other variables that may have a major impact on high-throughput measurements. Here we describe the sva package for identifying, estimating and removing unwanted sources of variation in high-throughput experiments. The sva package supports surrogate variable estimation with the sva function, direct adjustment for known batch effects with the ComBat function and adjustment for batch and latent variables in prediction problems with the fsva function.
Similar articles
-
svaseq: removing batch effects and other unwanted noise from sequencing data.Nucleic Acids Res. 2014 Dec 1;42(21):e161. doi: 10.1093/nar/gku864. Epub 2014 Oct 7. Nucleic Acids Res. 2014. PMID: 25294822 Free PMC article.
-
Removing batch effects for prediction problems with frozen surrogate variable analysis.PeerJ. 2014 Sep 23;2:e561. doi: 10.7717/peerj.561. eCollection 2014. PeerJ. 2014. PMID: 25332844 Free PMC article.
-
Practical impacts of genomic data "cleaning" on biological discovery using surrogate variable analysis.BMC Bioinformatics. 2015 Nov 6;16:372. doi: 10.1186/s12859-015-0808-5. BMC Bioinformatics. 2015. PMID: 26545828 Free PMC article.
-
Blind estimation and correction of microarray batch effect.PLoS One. 2020 Apr 9;15(4):e0231446. doi: 10.1371/journal.pone.0231446. eCollection 2020. PLoS One. 2020. PMID: 32271844 Free PMC article.
-
Recent progress with next-generation biomarkers in muscle-invasive bladder cancer.Int J Urol. 2017 Jan;24(1):7-15. doi: 10.1111/iju.13193. Epub 2016 Sep 6. Int J Urol. 2017. PMID: 27597124 Review.
Cited by
-
Exploring the role of endoplasmic reticulum stress in recurrent spontaneous abortion: Identification of diagnostic biomarkers and immune cell interactions.Heliyon. 2024 Oct 5;10(19):e38964. doi: 10.1016/j.heliyon.2024.e38964. eCollection 2024 Oct 15. Heliyon. 2024. PMID: 39430538 Free PMC article.
-
Development of a web-based tool for estimating individualized survival curves in glioblastoma using clinical, mRNA, and tumor microenvironment features with fusion techniques.Clin Transl Oncol. 2024 Oct 30. doi: 10.1007/s12094-024-03739-3. Online ahead of print. Clin Transl Oncol. 2024. PMID: 39476302
-
Investigating the impact of severe maternal SARS-CoV-2 infection on infant DNA methylation and neurodevelopment.Mol Psychiatry. 2024 Oct 30. doi: 10.1038/s41380-024-02808-x. Online ahead of print. Mol Psychiatry. 2024. PMID: 39478169
-
Discovery of PANoptosis-related signatures correlates with immune cell infiltration in psoriasis.PLoS One. 2024 Oct 31;19(10):e0310362. doi: 10.1371/journal.pone.0310362. eCollection 2024. PLoS One. 2024. PMID: 39480805 Free PMC article.
-
Effects of CXCL12 isoforms in a pancreatic pre-tumour cellular model: Microarray analysis.World J Gastroenterol. 2021 Apr 21;27(15):1616-1629. doi: 10.3748/wjg.v27.i15.1616. World J Gastroenterol. 2021. PMID: 33958847 Free PMC article.
References
-
- Brem R.B., et al. Genetic dissection of transcriptional regulation in budding yeast. Science. 2002;296:752–755. - PubMed
-
- Gibson G. The environmental contribution to gene expression profiles. Nat. Rev. Genet. 2008;9:575–581. - PubMed
-
- Johnson W., et al. Adjusting batch effects in microarray data using empirical bayes methods. Biostatistics. 2007;8:118–127. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
