Variable Selection for Time-to-Event Data

Ai Ni; Chi Song

doi:10.1007/978-1-0716-0849-4_5

Variable Selection for Time-to-Event Data

Methods Mol Biol. 2021:2194:61-76. doi: 10.1007/978-1-0716-0849-4_5.

Authors

Ai Ni¹, Chi Song²

Affiliations

¹ Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, OH, USA. ni.304@osu.edu.
² Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, OH, USA. song.1188@osu.edu.

PMID: 32926362
DOI: 10.1007/978-1-0716-0849-4_5

Abstract

With the increasing availability of large scale biomedical and -omics data, researchers are offered with unprecedented opportunities to discover novel biomarkers for clinical outcomes. At the same time, they are also faced with great challenges to accurately identify important biomarkers from numerous candidates. Many novel statistical methodologies have been developed to tackle these challenges in the last couple of decades. When the clinical outcome is time-to-event data, special statistical methods are needed to analyze this type of data due to the presence of censoring. In this article, we review some of the most commonly used modern statistical methodologies for variable selection for time-to-event data. The reviewed methods are classified into three large categories: filter-test based method, penalized regression method, and machine learning method.

Keywords: Filter test; Machine learning; Penalized regression; Time-to-event data; Variable selection.

Publication types

Review

MeSH terms

Algorithms
Biomarkers / analysis
Genomics*
Humans
Machine Learning*
Principal Component Analysis / methods
Proportional Hazards Models
Regression Analysis

Substances

Biomarkers