Partial least squares: a versatile tool for the analysis of high-dimensional genomic data

Brief Bioinform. 2007 Jan;8(1):32-44. doi: 10.1093/bib/bbl016. Epub 2006 May 26.


Partial least squares (PLS) is an efficient statistical regression technique that is highly suited for the analysis of genomic and proteomic data. In this article, we review both the theory underlying PLS as well as a host of bioinformatics applications of PLS. In particular, we provide a systematic comparison of the PLS approaches currently employed, and discuss analysis problems as diverse as, e.g. tumor classification from transcriptome data, identification of relevant genes, survival analysis and modeling of gene networks and transcription factor activities.

Publication types

  • Review

MeSH terms

  • Animals
  • Computational Biology / methods*
  • Data Interpretation, Statistical
  • Genomics / statistics & numerical data*
  • Humans
  • Least-Squares Analysis*
  • Models, Statistical
  • Multivariate Analysis
  • Software
  • Survival Analysis