Fast Bayesian Variable Screenings for Binary Response Regressions with Small Sample Size

J Stat Comput Simul. 2017;87(14):2708-2723. doi: 10.1080/00949655.2017.1341887. Epub 2017 Jun 25.


Screening procedures play an important role in data analysis, especially in high-throughput biological studies where the datasets consist of more covariates than independent subjects. In this article, a Bayesian screening procedure is introduced for the binary response models with logit and probit links. In contrast to many screening rules based on marginal information involving one or a few covariates, the proposed Bayesian procedure simultaneously models all covariates and uses closed-form screening statistics. Specifically, we use the posterior means of the regression coefficients as screening statistics; by imposing a generalized g-prior on the regression coefficients, we derive the analytical form of their posterior means and compute the screening statistics without Markov chain Monte Carlo implementation. We evaluate the utility of the proposed Bayesian screening method using simulations and real data analysis. When the sample size is small, the simulation results suggest improved performance with comparable computational cost.

Keywords: Logistic regression; Probit regression; Sure independence screening; g-prior.