BCFtools/RoH: a hidden Markov model approach for detecting autozygosity from next-generation sequencing data
- PMID: 26826718
- PMCID: PMC4892413
- DOI: 10.1093/bioinformatics/btw044
BCFtools/RoH: a hidden Markov model approach for detecting autozygosity from next-generation sequencing data
Abstract
Runs of homozygosity (RoHs) are genomic stretches of a diploid genome that show identical alleles on both chromosomes. Longer RoHs are unlikely to have arisen by chance but are likely to denote autozygosity, whereby both copies of the genome descend from the same recent ancestor. Early tools to detect RoH used genotype array data, but substantially more information is available from sequencing data. Here, we present and evaluate BCFtools/RoH, an extension to the BCFtools software package, that detects regions of autozygosity in sequencing data, in particular exome data, using a hidden Markov model. By applying it to simulated data and real data from the 1000 Genomes Project we estimate its accuracy and show that it has higher sensitivity and specificity than existing methods under a range of sequencing error rates and levels of autozygosity.
Availability and implementation: BCFtools/RoH and its associated binary/source files are freely available from https://github.com/samtools/BCFtools
Contact: vn2@sanger.ac.uk or pd3@sanger.ac.uk
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press.
Figures
Similar articles
-
H3M2: detection of runs of homozygosity from whole-exome sequencing data.Bioinformatics. 2014 Oct 15;30(20):2852-9. doi: 10.1093/bioinformatics/btu401. Epub 2014 Jun 24. Bioinformatics. 2014. PMID: 24966365
-
Twelve years of SAMtools and BCFtools.Gigascience. 2021 Feb 16;10(2):giab008. doi: 10.1093/gigascience/giab008. Gigascience. 2021. PMID: 33590861 Free PMC article.
-
ROHMM-A flexible hidden Markov model framework to detect runs of homozygosity from genotyping data.Hum Mutat. 2022 Feb;43(2):158-168. doi: 10.1002/humu.24316. Epub 2021 Dec 28. Hum Mutat. 2022. PMID: 34923717
-
The application of next-generation sequencing in the autozygosity mapping of human recessive diseases.Hum Genet. 2013 Nov;132(11):1197-211. doi: 10.1007/s00439-013-1344-x. Epub 2013 Aug 2. Hum Genet. 2013. PMID: 23907654 Review.
-
Runs of homozygosity: current knowledge and applications in livestock.Anim Genet. 2017 Jun;48(3):255-271. doi: 10.1111/age.12526. Epub 2016 Dec 1. Anim Genet. 2017. PMID: 27910110 Review.
Cited by
-
Genomic and fitness consequences of a near-extinction event in the northern elephant seal.Nat Ecol Evol. 2024 Sep 27. doi: 10.1038/s41559-024-02533-2. Online ahead of print. Nat Ecol Evol. 2024. PMID: 39333394
-
Federated analysis of autosomal recessive coding variants in 29,745 developmental disorder patients from diverse populations.Nat Genet. 2024 Sep 23. doi: 10.1038/s41588-024-01910-8. Online ahead of print. Nat Genet. 2024. PMID: 39313616
-
Whole Genomes Inform Genetic Rescue Strategy for Montane Red Foxes in North America.Mol Biol Evol. 2024 Sep 4;41(9):msae193. doi: 10.1093/molbev/msae193. Mol Biol Evol. 2024. PMID: 39288165 Free PMC article.
-
altAFplotter: a web app for reliable UPD detection in NGS diagnostics.BMC Bioinformatics. 2024 Sep 12;25(1):299. doi: 10.1186/s12859-024-05922-3. BMC Bioinformatics. 2024. PMID: 39266970 Free PMC article.
-
Genomic Diversity as a Key Conservation Criterion: Proof-of-Concept From Mammalian Whole-Genome Resequencing Data.Evol Appl. 2024 Sep 10;17(9):e70000. doi: 10.1111/eva.70000. eCollection 2024 Sep. Evol Appl. 2024. PMID: 39257570 Free PMC article.
References
-
- Durbin R. et al. (1998) Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. Cambridge University Press, New York.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
