Rapid identification of non-human sequences in high-throughput sequencing datasets

Bioinformatics. 2012 Apr 15;28(8):1174-5. doi: 10.1093/bioinformatics/bts100. Epub 2012 Feb 28.


Rapid identification of non-human sequences (RINS) is an intersection-based pathogen detection workflow that utilizes a user-provided custom reference genome set for identification of non-human sequences in deep sequencing datasets. In <2 h, RINS correctly identified the known virus in the dataset SRR73726 and is compatible with any computer capable of running the prerequisite alignment and assembly programs. RINS accurately identifies sequencing reads from intact or mutated non-human genomes in a dataset and robustly generates contigs with these non-human sequences (Supplementary Material).

Availability: RINS is available for free download at http://khavarilab.stanford.edu/resources.html.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Genome*
  • Genome, Human
  • High-Throughput Nucleotide Sequencing*
  • Host-Pathogen Interactions
  • Humans
  • Sequence Analysis, DNA
  • Software
  • Viruses / classification
  • Viruses / genetics*
  • Viruses / isolation & purification*