Power of tests for a dichotomous independent variable measured with error
- PMID: 18454782
- PMCID: PMC2442236
- DOI: 10.1111/j.1475-6773.2007.00810.x
Power of tests for a dichotomous independent variable measured with error
Abstract
Objective: To examine the implications for statistical power of using predicted probabilities for a dichotomous independent variable, rather than the actual variable.
Data sources/study setting: An application uses 271,479 observations from the 2000 to 2002 CAHPS Medicare Fee-for-Service surveys. STUDY DESIGN AND DATA: A methodological study with simulation results and a substantive application to previously collected data.
Principle findings: Researchers often must employ key dichotomous predictors that are unobserved but for which predictions exist. We consider three approaches to such data: the classification estimator (1); the direct substitution estimator (2); the partial information maximum likelihood estimator (3, PIMLE). The efficiency of (1) (its power relative to testing with the true variable) roughly scales with the square of one less the classification error. The efficiency of (2) roughly scales with the R(2) for predicting the unobserved dichotomous variable, and is usually more powerful than (1). Approach (3) is most powerful, but for testing differences in means of 0.2-0.5 standard deviations, (2) is typically more than 95 percent as efficient as (3).
Conclusions: The information loss from not observing actual values of dichotomous predictors can be quite large. Direct substitution is easy to implement and interpret and nearly as efficient as the PIMLE.
Similar articles
-
Differences in the structure of CAHPS measures among the medicare fee-for-service, medicare managed care, and privately insured populations.Health Serv Res. 2001 Jul;36(3):489-508. Health Serv Res. 2001. PMID: 11482586 Free PMC article.
-
Likelihood of hospital readmission after first discharge: Medicare Advantage vs. fee-for-service patients.Inquiry. 2012 Fall;49(3):202-13. doi: 10.5034/inquiryjrnl_49.03.01. Inquiry. 2012. PMID: 23230702
-
Comparing post-acute rehabilitation use, length of stay, and outcomes experienced by Medicare fee-for-service and Medicare Advantage beneficiaries with hip fracture in the United States: A secondary analysis of administrative data.PLoS Med. 2018 Jun 26;15(6):e1002592. doi: 10.1371/journal.pmed.1002592. eCollection 2018 Jun. PLoS Med. 2018. PMID: 29944655 Free PMC article.
-
The effect of HMOs on the inpatient utilization of medicare beneficiaries.Health Serv Res. 2004 Oct;39(5):1607-27. doi: 10.1111/j.1475-6773.2004.00306.x. Health Serv Res. 2004. PMID: 15333125 Free PMC article.
-
Segmented regression with errors in predictors: semi-parametric and parametric methods.Stat Med. 1997 Jan 15-Feb 15;16(1-3):169-88. doi: 10.1002/(sici)1097-0258(19970130)16:2<169::aid-sim478>3.0.co;2-m. Stat Med. 1997. PMID: 9004390
Cited by
-
Disaggregating Latino nativity in equity research using electronic health records.Health Serv Res. 2023 Oct;58(5):1119-1130. doi: 10.1111/1475-6773.14154. Epub 2023 Mar 28. Health Serv Res. 2023. PMID: 36978286 Free PMC article.
-
Implications of missingness in self-reported data for estimating racial and ethnic disparities in Medicaid quality measures.Health Serv Res. 2022 Dec;57(6):1370-1378. doi: 10.1111/1475-6773.14025. Epub 2022 Jul 25. Health Serv Res. 2022. PMID: 35802064 Free PMC article.
-
Trends in Missing Race and Ethnicity Information After Imputation in HealthCare.gov Marketplace Enrollment Data, 2015-2021.JAMA Netw Open. 2022 Jun 1;5(6):e2216715. doi: 10.1001/jamanetworkopen.2022.16715. JAMA Netw Open. 2022. PMID: 35687340 Free PMC article.
-
Using Ancillary Sociodemographic Data to Identify Sexual Minority Adults Among Those Responding "Something Else" or "Don't Know" to Sexual Orientation Questions.Med Care. 2019 Dec;57(12):e87-e95. doi: 10.1097/MLR.0000000000001190. Med Care. 2019. PMID: 31415342 Free PMC article.
-
Imputation of race/ethnicity to enable measurement of HEDIS performance by race/ethnicity.Health Serv Res. 2019 Feb;54(1):13-23. doi: 10.1111/1475-6773.13099. Epub 2018 Dec 3. Health Serv Res. 2019. PMID: 30506674 Free PMC article.
References
-
- Bhattacharya J, Goldman D, McCaffrey D. Estimating Probit Models with Self-Selected Treatments. Statistics in Medicine. 2006;25(3):389–413. - PubMed
-
- Dempster A P, Laird N M, Rubin D B. Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society, Series B: Methodological. 1977;39(1):1–22.
-
- Fuller W A. Measurement Error Models. New York: John Wiley and Sons; 1987.
-
- Haviland A M, Nagin D S. Causal Inferences with Group Based Trajectory Models. Psychometrika. 2005;70(3):557–78.
-
- Health Services Advisory Group. “The Evaluation of a Mental Component Summary Score Threshold for Depression Risk” Report to the Centers for Medicare and Medicaid Services, Task 5.20 Final Report. 2006. [November 2, 2006]. Available at http://www.hosonline.org.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
