Integrative Analysis of "-Omics" Data Using Penalty Functions

Wiley Interdiscip Rev Comput Stat. 2015 Jan-Feb;7(1):99-108. doi: 10.1002/wics.1322.

Abstract

In the analysis of omics data, integrative analysis provides an effective way of pooling information across multiple datasets or multiple correlated responses, and can be more effective than single-dataset (response) analysis. Multiple families of integrative analysis methods have been proposed in the literature. The current review focuses on the penalization methods. Special attention is paid to sparse meta-analysis methods that pool summary statistics across datasets, and integrative analysis methods that pool raw data across datasets. We discuss their formulation and rationale. Beyond "standard" penalized selection, we also review contrasted penalization and Laplacian penalization which accommodate finer data structures. The computational aspects, including computational algorithms and tuning parameter selection, are examined. This review concludes with possible limitations and extensions.

Keywords: Integrative analysis; marker selection; omics data; penalization.