State of the art in selection of variables and functional forms in multivariable analysis-outstanding issues
- PMID: 32266321
- PMCID: PMC7114804
- DOI: 10.1186/s41512-020-00074-3
State of the art in selection of variables and functional forms in multivariable analysis-outstanding issues
Abstract
Background: How to select variables and identify functional forms for continuous variables is a key concern when creating a multivariable model. Ad hoc 'traditional' approaches to variable selection have been in use for at least 50 years. Similarly, methods for determining functional forms for continuous variables were first suggested many years ago. More recently, many alternative approaches to address these two challenges have been proposed, but knowledge of their properties and meaningful comparisons between them are scarce. To define a state of the art and to provide evidence-supported guidance to researchers who have only a basic level of statistical knowledge, many outstanding issues in multivariable modelling remain. Our main aims are to identify and illustrate such gaps in the literature and present them at a moderate technical level to the wide community of practitioners, researchers and students of statistics.
Methods: We briefly discuss general issues in building descriptive regression models, strategies for variable selection, different ways of choosing functional forms for continuous variables and methods for combining the selection of variables and functions. We discuss two examples, taken from the medical literature, to illustrate problems in the practice of modelling.
Results: Our overview revealed that there is not yet enough evidence on which to base recommendations for the selection of variables and functional forms in multivariable analysis. Such evidence may come from comparisons between alternative methods. In particular, we highlight seven important topics that require further investigation and make suggestions for the direction of further research.
Conclusions: Selection of variables and of functional forms are important topics in multivariable analysis. To define a state of the art and to provide evidence-supported guidance to researchers who have only a basic level of statistical knowledge, further comparative research is required.
Keywords: Bias; Categorisation; Descriptive modelling; Empirical evidence; Fractional polynomials; Methods for variable selection; STRATOS initiative; Shrinkage; Spline procedures.
© The Author(s) 2020.
Conflict of interest statement
Competing interestsThe authors declare that they have no competing interests.
Similar articles
-
Selection of important variables and determination of functional form for continuous predictors in multivariable model building.Stat Med. 2007 Dec 30;26(30):5512-28. doi: 10.1002/sim.3148. Stat Med. 2007. PMID: 18058845
-
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2. Phys Biol. 2013. PMID: 23912807
-
Rethinking Giftedness and Gifted Education: A Proposed Direction Forward Based on Psychological Science.Psychol Sci Public Interest. 2011 Jan;12(1):3-54. doi: 10.1177/1529100611418056. Psychol Sci Public Interest. 2011. PMID: 26168418
-
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification.In: Kobeissy FH, editor. Brain Neurotrauma: Molecular, Neuropsychological, and Rehabilitation Aspects. Boca Raton (FL): CRC Press/Taylor & Francis; 2015. Chapter 25. In: Kobeissy FH, editor. Brain Neurotrauma: Molecular, Neuropsychological, and Rehabilitation Aspects. Boca Raton (FL): CRC Press/Taylor & Francis; 2015. Chapter 25. PMID: 26269925 Free Books & Documents. Review.
-
Causal Model Building in the Context of Cardiac Rehabilitation: A Systematic Review.Int J Environ Res Public Health. 2023 Feb 11;20(4):3182. doi: 10.3390/ijerph20043182. Int J Environ Res Public Health. 2023. PMID: 36833877 Free PMC article. Review.
Cited by
-
Replicability and reproducibility of predictive models for diagnosis of depression among young adults using Electronic Health Records.Diagn Progn Res. 2023 Dec 5;7(1):25. doi: 10.1186/s41512-023-00160-2. Diagn Progn Res. 2023. PMID: 38049919 Free PMC article.
-
Predictors and outcomes of cardiac dyssynchrony among patients with heart failure attending Benjamin Mkapa Hospital in Dodoma, central Tanzania: A protocol of prospective-longitudinal study.PLoS One. 2023 Nov 17;18(11):e0287813. doi: 10.1371/journal.pone.0287813. eCollection 2023. PLoS One. 2023. PMID: 37976266 Free PMC article.
-
An enhanced version of FREM (Fracture Risk Evaluation Model) using national administrative health data: analysis protocol for development and validation of a multivariable prediction model.Diagn Progn Res. 2023 Oct 3;7(1):19. doi: 10.1186/s41512-023-00158-w. Diagn Progn Res. 2023. PMID: 37784165 Free PMC article.
-
Combining glucose and high-sensitivity cardiac troponin in the early diagnosis of acute myocardial infarction.Sci Rep. 2023 Sep 5;13(1):14598. doi: 10.1038/s41598-023-37093-1. Sci Rep. 2023. PMID: 37670005 Free PMC article.
-
Pediatric prognostic models predicting inhospital child mortality in resource-limited settings: An external validation study.Health Sci Rep. 2023 Aug 27;6(8):e1433. doi: 10.1002/hsr2.1433. eCollection 2023 Aug. Health Sci Rep. 2023. PMID: 37645032 Free PMC article.
References
-
- Abrahamowicz M, du Berger R, Grover SA. Flexible modelling of the effects of serum cholesterol on coronary heart disease mortality. Am J Epidemiol. 1997;145:714–729. - PubMed
-
- Altman DG, Andersen PK. Bootstrap investigation of the stability of a Cox regression model. Stat Med. 1989;8:771–783. - PubMed
-
- Altman DG, Lausen B, Sauerbrei W, Schumacher M. The dangers of using ‘optimal’cutpoints in the evaluation of prognostic factors. J Nat Cancer Inst. 1994;86:829–835. - PubMed
-
- Antoniadis A, Gijbels I, Verhasselt A. Variable selection in additive models using P-splines. Technometrics. 2012;54:425–438.
Publication types
LinkOut - more resources
Full Text Sources
Other Literature Sources