A Confidence Region Approach to Tuning for Variable Selection

Funda Gunes; Howard D Bondell

doi:10.1080/10618600.2012.679890

A Confidence Region Approach to Tuning for Variable Selection

J Comput Graph Stat. 2012;21(2):295-314. doi: 10.1080/10618600.2012.679890. Epub 2012 Jun 14.

Authors

Funda Gunes¹, Howard D Bondell

Affiliation

¹ Department of Statistics, North Carolina State University.

Abstract

We develop an approach to tuning of penalized regression variable selection methods by calculating the sparsest estimator contained in a confidence region of a specified level. Because confidence intervals/regions are generally understood, tuning penalized regression methods in this way is intuitive and more easily understood by scientists and practitioners. More importantly, our work shows that tuning to a fixed confidence level often performs better than tuning via the common methods based on AIC, BIC, or cross-validation (CV) over a wide range of sample sizes and levels of sparsity. Additionally, we prove that by tuning with a sequence of confidence levels converging to one, asymptotic selection consistency is obtained; and with a simple two-stage procedure, an oracle property is achieved. The confidence region based tuning parameter is easily calculated using output from existing penalized regression computer packages.Our work also shows how to map any penalty parameter to a corresponding confidence coefficient. This mapping facilitates comparisons of tuning parameter selection methods such as AIC, BIC and CV, and reveals that the resulting tuning parameters correspond to confidence levels that are extremely low, and can vary greatly across data sets. Supplemental materials for the article are available online.

Keywords: Adaptive LASSO; Confidence region; Penalized regression; Tuning parameter; Variable selection.

Abstract

Grants and funding