Cross phenotype normalization of microarray data

Jianhua Xuan; Yue Wang; Eric Hoffman; Robert Clarke

doi:10.2741/e80

Cross phenotype normalization of microarray data

Front Biosci (Elite Ed). 2010 Jan 1;2(1):171-86. doi: 10.2741/e80.

Authors

Jianhua Xuan¹, Yue Wang, Eric Hoffman, Robert Clarke

Affiliation

¹ Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, Arlington, VA 22203, USA.

PMID: 20036868
DOI: 10.2741/e80

Abstract

Normalization is a prerequisite for almost all follow-up steps in microarray data analysis. Accurate normalization across different experiments and phenotypes assures a common base for comparative yet quantitative studies using gene expression data. In this paper, we report a comparison study of four normalization approaches, namely, linear regression (LR), Loess regression, invariant ranking (IR) and iterative nonlinear regression (INR), for gene expression data normalization. Among these four methods, LR and Loess regression methods use all available genes to estimate either a linear or nonlinear normalization function, while IR and INR methods feature some iterative processes to identify invariantly expressed genes (IEGs) for nonlinear normalization. We tested these normalization approaches on three real microarray data sets and evaluated their performances in terms of variance reduction and fold-change preservation. By comparison, we found that (1) LR method exhibited the worst performance in both variance reduction and fold-change preservation, and (2) INR method showed an improved performance in achieving low expression variance across replicates and excellent fold-change preservation for differentially expressed genes.

Publication types

Comparative Study
Research Support, N.I.H., Extramural
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Data Interpretation, Statistical*
Gene Expression*
Microarray Analysis / methods*
Phenotype*
Regression Analysis

Abstract

Publication types

MeSH terms

Grants and funding