Tabix: fast retrieval of sequence features from generic TAB-delimited files
- PMID: 21208982
- PMCID: PMC3042176
- DOI: 10.1093/bioinformatics/btq671
Tabix: fast retrieval of sequence features from generic TAB-delimited files
Abstract
Tabix is the first generic tool that indexes position sorted files in TAB-delimited formats such as GFF, BED, PSL, SAM and SQL export, and quickly retrieves features overlapping specified regions. Tabix features include few seek function calls per query, data compression with gzip compatibility and direct FTP/HTTP access. Tabix is implemented as a free command-line tool as well as a library in C, Java, Perl and Python. It is particularly useful for manually examining local genomic features on the command line and enables genome viewers to support huge data files and remote custom tracks over networks.
Availability and implementation: http://samtools.sourceforge.net.
Similar articles
-
The Integrated Genome Browser: free software for distribution and exploration of genome-scale datasets.Bioinformatics. 2009 Oct 15;25(20):2730-1. doi: 10.1093/bioinformatics/btp472. Epub 2009 Aug 4. Bioinformatics. 2009. PMID: 19654113 Free PMC article.
-
SCALCE: boosting sequence compression algorithms using locally consistent encoding.Bioinformatics. 2012 Dec 1;28(23):3051-7. doi: 10.1093/bioinformatics/bts593. Epub 2012 Oct 9. Bioinformatics. 2012. PMID: 23047557 Free PMC article.
-
The Sequence Alignment/Map format and SAMtools.Bioinformatics. 2009 Aug 15;25(16):2078-9. doi: 10.1093/bioinformatics/btp352. Epub 2009 Jun 8. Bioinformatics. 2009. PMID: 19505943 Free PMC article.
-
Genome data mining for everyone.BMB Rep. 2008 Nov 30;41(11):757-64. doi: 10.5483/bmbrep.2008.41.11.757. BMB Rep. 2008. PMID: 19017486 Review.
-
UCSC genome browser tutorial.Genomics. 2008 Aug;92(2):75-84. doi: 10.1016/j.ygeno.2008.02.003. Epub 2008 Jun 2. Genomics. 2008. PMID: 18514479 Review.
Cited by
-
A genome assembly and transcriptome atlas of the inbred Babraham pig to illuminate porcine immunogenetic variation.Immunogenetics. 2024 Sep 19. doi: 10.1007/s00251-024-01355-7. Online ahead of print. Immunogenetics. 2024. PMID: 39294478
-
Setting Up the JBrowse 2 Genome Browser.Curr Protoc. 2024 Aug;4(8):e1120. doi: 10.1002/cpz1.1120. Curr Protoc. 2024. PMID: 39126338
-
SR-TWAS: leveraging multiple reference panels to improve transcriptome-wide association study power by ensemble machine learning.Nat Commun. 2024 Aug 5;15(1):6646. doi: 10.1038/s41467-024-50983-w. Nat Commun. 2024. PMID: 39103319 Free PMC article.
-
A comprehensive tandem repeat catalog of the human genome.medRxiv [Preprint]. 2024 Jun 20:2024.06.19.24309173. doi: 10.1101/2024.06.19.24309173. medRxiv. 2024. PMID: 38947075 Free PMC article. Preprint.
-
Analysis-ready VCF at Biobank scale using Zarr.bioRxiv [Preprint]. 2024 Jun 12:2024.06.11.598241. doi: 10.1101/2024.06.11.598241. bioRxiv. 2024. PMID: 38915693 Free PMC article. Preprint.
References
-
- Alekseyenko AV, Lee CJ. Nested containment list (NCList): a new algorithm for accelerating interval query of genome alignment and interval databases. Bioinformatics. 2007;23:1386–1393. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
