Efficient change point detection for genomic sequences of continuous measurements
- PMID: 21088029
- DOI: 10.1093/bioinformatics/btq647
Efficient change point detection for genomic sequences of continuous measurements
Abstract
Motivation: Knowing the exact locations of multiple change points in genomic sequences serves several biological needs, for instance when data represent aCGH profiles and it is of interest to identify possibly damaged genes involved in cancer and other diseases. Only a few of the currently available methods deal explicitly with estimation of the number and location of change points, and moreover these methods may be somewhat vulnerable to deviations of model assumptions usually employed.
Results: We present a computationally efficient method to obtain estimates of the number and location of the change points. The method is based on a simple transformation of data and it provides results quite robust to model misspecifications. The efficiency of the method guarantees moderate computational times regardless of the series length and the number of change points.
Availability: The methods described in this article are implemented in the new R package cumSeg available from the Comprehensive R Archive Network at http://CRAN.R-project.org/package=cumSeg.
Similar articles
-
Multiple Break-Points Detection in Array CGH Data via the Cross-Entropy Method.IEEE/ACM Trans Comput Biol Bioinform. 2015 Mar-Apr;12(2):487-98. doi: 10.1109/TCBB.2014.2361639. IEEE/ACM Trans Comput Biol Bioinform. 2015. PMID: 26357234
-
Efficient change-points detection for genomic sequences via cumulative segmented regression.Bioinformatics. 2022 Jan 3;38(2):311-317. doi: 10.1093/bioinformatics/btab685. Bioinformatics. 2022. PMID: 34601562
-
Hybrid algorithms for multiple change-point detection in biological sequences.Adv Exp Med Biol. 2015;823:41-61. doi: 10.1007/978-3-319-10984-8_3. Adv Exp Med Biol. 2015. PMID: 25381101
-
Finding recurrent copy number alterations preserving within-sample homogeneity.Bioinformatics. 2011 Nov 1;27(21):2949-56. doi: 10.1093/bioinformatics/btr488. Epub 2011 Aug 25. Bioinformatics. 2011. PMID: 21873327
-
Zoom-in array comparative genomic hybridization (aCGH) to detect germline rearrangements in cancer susceptibility genes.Methods Mol Biol. 2010;653:221-35. doi: 10.1007/978-1-60761-759-4_13. Methods Mol Biol. 2010. PMID: 20721746 Review.
Cited by
-
The Photorhabdus asymbiotica virulence cassettes deliver protein effectors directly into target eukaryotic cells.Elife. 2019 Sep 17;8:e46259. doi: 10.7554/eLife.46259. Elife. 2019. PMID: 31526474 Free PMC article.
-
Non-parametric change-point method for differential gene expression detection.PLoS One. 2011;6(5):e20060. doi: 10.1371/journal.pone.0020060. Epub 2011 May 31. PLoS One. 2011. PMID: 21655325 Free PMC article.
-
Visualization of genomic changes by segmented smoothing using an L0 penalty.PLoS One. 2012;7(6):e38230. doi: 10.1371/journal.pone.0038230. Epub 2012 Jun 5. PLoS One. 2012. PMID: 22679492 Free PMC article.
-
FDR control of detected regions by multiscale matched filtering.Commun Stat Simul Comput. 2017;46(1):127-144. doi: 10.1080/03610918.2014.957842. Epub 2014 Dec 23. Commun Stat Simul Comput. 2017. PMID: 31501637 Free PMC article.
-
Detecting multiple generalized change-points by isolating single ones.Metrika. 2022;85(2):141-174. doi: 10.1007/s00184-021-00821-6. Epub 2021 May 24. Metrika. 2022. PMID: 34054146 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
