Cross-validated Cox regression on microarray gene expression data

Stat Med. 2006 Sep 30;25(18):3201-16. doi: 10.1002/sim.2353.


This paper describes how penalized Cox regression, in combination with cross-validated partial likelihood can be employed to obtain reliable survival prediction models for high dimensional microarray data. The suggested procedure is demonstrated on a breast cancer survival data set consisting of 295 tumours as collected in the National Cancer Institute in Amsterdam and previously reported in more general papers. The main aim of this paper it to show how generally accepted biostatistical procedures can be employed to analyse high-dimensional data.

MeSH terms

  • Breast Neoplasms / genetics
  • Carcinoma / genetics
  • Data Interpretation, Statistical*
  • Female
  • Gene Expression
  • Humans
  • Kaplan-Meier Estimate
  • Middle Aged
  • Netherlands
  • Oligonucleotide Array Sequence Analysis / methods*
  • Predictive Value of Tests
  • Proportional Hazards Models*