An empirical comparison of tree-based methods for propensity score estimation
- PMID: 23701015
- PMCID: PMC3796115
- DOI: 10.1111/1475-6773.12068
An empirical comparison of tree-based methods for propensity score estimation
Abstract
Objective: To illustrate the use of ensemble tree-based methods (random forest classification [RFC] and bagging) for propensity score estimation and to compare these methods with logistic regression, in the context of evaluating the effect of physical and occupational therapy on preschool motor ability among very low birth weight (VLBW) children.
Data source: We used secondary data from the Early Childhood Longitudinal Study Birth Cohort (ECLS-B) between 2001 and 2006.
Study design: We estimated the predicted probability of treatment using tree-based methods and logistic regression (LR). We then modeled the exposure-outcome relation using weighted LR models while considering covariate balance and precision for each propensity score estimation method.
Principal findings: Among approximately 500 VLBW children, therapy receipt was associated with moderately improved preschool motor ability. Overall, ensemble methods produced the best covariate balance (Mean Squared Difference: 0.03-0.07) and the most precise effect estimates compared to LR (Mean Squared Difference: 0.11). The overall magnitude of the effect estimates was similar between RFC and LR estimation methods.
Conclusion: Propensity score estimation using RFC and bagging produced better covariate balance with increased precision compared to LR. Ensemble methods are a useful alterative to logistic regression to control confounding in observational studies.
Keywords: Propensity scores; ensemble methods; tree-based methods.
© Health Research and Educational Trust.
Similar articles
-
Preschool motor skills following physical and occupational therapy services among non-disabled very low birth weight children.Matern Child Health J. 2014 May;18(4):821-8. doi: 10.1007/s10995-013-1306-x. Matern Child Health J. 2014. PMID: 23820671
-
A comparison of machine learning algorithms and covariate balance measures for propensity score matching and weighting.Biom J. 2019 Jul;61(4):1049-1072. doi: 10.1002/bimj.201800132. Epub 2019 May 14. Biom J. 2019. PMID: 31090108
-
Should a propensity score model be super? The utility of ensemble procedures for causal adjustment.Stat Med. 2019 Apr 30;38(9):1690-1702. doi: 10.1002/sim.8075. Epub 2018 Dec 26. Stat Med. 2019. PMID: 30586681
-
Introduction to propensity scores.Respirology. 2014 Jul;19(5):625-35. doi: 10.1111/resp.12312. Epub 2014 May 29. Respirology. 2014. PMID: 24889820 Review.
-
[Propensity score methods for creating covariate balance in observational studies].Rev Esp Cardiol. 2011 Oct;64(10):897-903. doi: 10.1016/j.recesp.2011.06.008. Epub 2011 Aug 27. Rev Esp Cardiol. 2011. PMID: 21872981 Review. Spanish.
Cited by
-
Can supervised deep learning architecture outperform autoencoders in building propensity score models for matching?BMC Med Res Methodol. 2024 Aug 2;24(1):167. doi: 10.1186/s12874-024-02284-5. BMC Med Res Methodol. 2024. PMID: 39095707 Free PMC article.
-
Testing the missing at random assumption in generalized linear models in the presence of instrumental variables.Scand Stat Theory Appl. 2024 Mar;51(1):334-354. doi: 10.1111/sjos.12685. Epub 2023 Aug 7. Scand Stat Theory Appl. 2024. PMID: 38370508 Free PMC article.
-
Propensity score adjustment using machine learning classification algorithms to control selection bias in online surveys.PLoS One. 2020 Apr 22;15(4):e0231500. doi: 10.1371/journal.pone.0231500. eCollection 2020. PLoS One. 2020. PMID: 32320429 Free PMC article.
-
Intersections of machine learning and epidemiological methods for health services research.Int J Epidemiol. 2021 Jan 23;49(6):1763-1770. doi: 10.1093/ije/dyaa035. Int J Epidemiol. 2021. PMID: 32236476 Free PMC article.
-
Patterns of care and outcomes for adjuvant treatment of pT3N0 rectal cancer using the National Cancer Database.J Gastrointest Oncol. 2020 Feb;11(1):1-12. doi: 10.21037/jgo.2019.10.02. J Gastrointest Oncol. 2020. PMID: 32175100 Free PMC article.
References
-
- American PsychiatricAssociation. Diagnostic and Statistical Manual of Mental Disorders. Washington, DC: American Psychiatric Association; 2000.
-
- Austin PC. “Propensity-Score Matching in the Cardiovascular Surgery Literature from 2004 to 2006: A Systematic Review and Suggestions for Improvement”. Journal of Thoracic and Cardiovascular Surgery. 2007;134(5):1128–35. - PubMed
-
- Austin PC, Mamdani MM. “A Comparison of Propensity Score Methods: A Case-Study Estimating the Effectiveness of Post-AMI Statin Use”. Statistics in Medicine. 2006;25(12):2084–106. - PubMed
-
- Bang H, Robins JM. “Doubly Robust Estimation in Missing Data and Causal Inference Models”. Biometrics. 2005;61(4):962–73. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
