HDTD: analyzing multi-tissue gene expression data

Bioinformatics. 2016 Jul 15;32(14):2193-5. doi: 10.1093/bioinformatics/btw224. Epub 2016 Jun 7.

Abstract

Motivation: By collecting multiple samples per subject, researchers can characterize intra-subject variation using physiologically relevant measurements such as gene expression profiling. This can yield important insights into fundamental biological questions ranging from cell type identity to tumour development. For each subject, the data measurements can be written as a matrix with the different subsamples (e.g. multiple tissues) indexing the columns and the genes indexing the rows. In this context, neither the genes nor the tissues are expected to be independent and straightforward application of traditional statistical methods that ignore this two-way dependence might lead to erroneous conclusions. Herein, we present a suite of tools embedded within the R/Bioconductor package HDTD for robustly estimating and performing hypothesis tests about the mean relationship and the covariance structure within the rows and columns. We illustrate the utility of HDTD by applying it to analyze data generated by the Genotype-Tissue Expression consortium.

Availability and implementation: The R package HDTD is part of Bioconductor. The source code and a comprehensive user's guide are available at http://bioconductor.org/packages/release/bioc/html/HDTD.html

Contact: : A.Touloumis@brighton.ac.uk

Supplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Computational Biology / methods*
  • Gene Expression Profiling / methods*
  • Gene Expression*
  • Humans
  • Neoplasms
  • Software*