A fast Bayesian change point analysis for the segmentation of microarray data
- PMID: 18667443
- DOI: 10.1093/bioinformatics/btn404
A fast Bayesian change point analysis for the segmentation of microarray data
Abstract
Motivation: The ability to detect regions of genetic alteration is of great importance in cancer research. These alterations can take the form of large chromosomal gains and losses as well as smaller amplifications and deletions. The detection of such regions allows researchers to identify genes involved in cancer progression, and to fully understand differences between cancer and non-cancer tissue. The Bayesian method proposed by Barry and Hartigan is well suited for the analysis of such change point problems. In our previous article we introduced the R package bcp (Bayesian change point), an MCMC implementation of Barry and Hartigan's method. In a simulation study and real data examples, bcp is shown to both accurately detect change points and estimate segment means. Earlier versions of bcp (prior to 2.0) are O(n(2)) in speed and O(n) in memory (where n is the number of observations), and run in approximately 45 min for a sequence of length 10 000. With the high resolution of newer microarrays, the number of computations in the O(n(2)) algorithm is prohibitively time-intensive.
Results: We present a new implementation of the Bayesian change point method that is O(n) in both speed and memory; bcp 2.1 runs in approximately 45 s on a single processor with a sequence of length 10,000--a tremendous speed gain. Further speed improvements are possible using parallel computing, supported in bcp via NetWorkSpaces. In simulated and real microarray data from the literature, bcp is shown to quickly and accurately detect aberrations of varying width and magnitude.
Availability: The R package bcp is available on CRAN (R Development Core Team, 2008). The O(n) version is available in version 2.0 or higher, with support for NetWorkSpaces in versions 2.1 and higher.
Similar articles
-
A faster circular binary segmentation algorithm for the analysis of array CGH data.Bioinformatics. 2007 Mar 15;23(6):657-63. doi: 10.1093/bioinformatics/btl646. Epub 2007 Jan 18. Bioinformatics. 2007. PMID: 17234643
-
Robust smooth segmentation approach for array CGH data analysis.Bioinformatics. 2007 Sep 15;23(18):2463-9. doi: 10.1093/bioinformatics/btm359. Epub 2007 Jul 27. Bioinformatics. 2007. PMID: 17660206
-
R/parallel--speeding up bioinformatics analysis with R.BMC Bioinformatics. 2008 Sep 22;9:390. doi: 10.1186/1471-2105-9-390. BMC Bioinformatics. 2008. PMID: 18808714 Free PMC article.
-
A fast and flexible method for the segmentation of aCGH data.Bioinformatics. 2008 Aug 15;24(16):i139-45. doi: 10.1093/bioinformatics/btn272. Bioinformatics. 2008. PMID: 18689815
-
MSMAD: a computationally efficient method for the analysis of noisy array CGH data.Bioinformatics. 2009 Mar 15;25(6):703-13. doi: 10.1093/bioinformatics/btp022. Epub 2009 Jan 15. Bioinformatics. 2009. PMID: 19147666
Cited by 23 articles
-
When does cognitive decline begin? A systematic review of change point studies on accelerated decline in cognitive and neurological outcomes preceding mild cognitive impairment, dementia, and death.Psychol Aging. 2018 Mar;33(2):195-218. doi: 10.1037/pag0000236. Psychol Aging. 2018. PMID: 29658744 Free PMC article.
-
Recurrent copy number alterations in young women with breast cancer.Oncotarget. 2018 Jan 29;9(14):11541-11558. doi: 10.18632/oncotarget.24336. eCollection 2018 Feb 20. Oncotarget. 2018. PMID: 29545918 Free PMC article.
-
Analysis of statistical and standard algorithms for detecting muscle onset with surface electromyography.PLoS One. 2017 May 10;12(5):e0177312. doi: 10.1371/journal.pone.0177312. eCollection 2017. PLoS One. 2017. PMID: 28489897 Free PMC article.
-
A Novel Graph-based Algorithm to Infer Recurrent Copy Number Variations in Cancer.Cancer Inform. 2016 Oct 9;15(Suppl 2):43-50. doi: 10.4137/CIN.S39368. eCollection 2016. Cancer Inform. 2016. PMID: 27773988 Free PMC article.
-
Evaluating Google, Twitter, and Wikipedia as Tools for Influenza Surveillance Using Bayesian Change Point Analysis: A Comparative Analysis.JMIR Public Health Surveill. 2016 Oct 20;2(2):e161. doi: 10.2196/publichealth.5901. JMIR Public Health Surveill. 2016. PMID: 27765731 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
