Testing for independence in J×K contingency tables with complex sample survey data
- PMID: 25762089
- PMCID: PMC4567525
- DOI: 10.1111/biom.12297
Testing for independence in J×K contingency tables with complex sample survey data
Abstract
The test of independence of row and column variables in a (J×K) contingency table is a widely used statistical test in many areas of application. For complex survey samples, use of the standard Pearson chi-squared test is inappropriate due to correlation among units within the same cluster. Rao and Scott (1981, Journal of the American Statistical Association 76, 221-230) proposed an approach in which the standard Pearson chi-squared statistic is multiplied by a design effect to adjust for the complex survey design. Unfortunately, this test fails to exist when one of the observed cell counts equals zero. Even with the large samples typical of many complex surveys, zero cell counts can occur for rare events, small domains, or contingency tables with a large number of cells. Here, we propose Wald and score test statistics for independence based on weighted least squares estimating equations. In contrast to the Rao-Scott test statistic, the proposed Wald and score test statistics always exist. In simulations, the score test is found to perform best with respect to type I error. The proposed method is motivated by, and applied to, post surgical complications data from the United States' Nationwide Inpatient Sample (NIS) complex survey of hospitals in 2008.
Keywords: Chi-squared test; Nationwide Inpatient Sample; Score statistic; Wald statistic; Weighted estimating equations.
© 2015, The International Biometric Society.
Similar articles
-
The score test for independence in R x C contingency tables with missing data.Biometrics. 1996 Jun;52(2):751-62. Biometrics. 1996. PMID: 8672711
-
A simple test of association for contingency tables with multiple column responses.Biometrics. 2000 Sep;56(3):893-6. doi: 10.1111/j.0006-341x.2000.00893.x. Biometrics. 2000. PMID: 10985233
-
Comparison of tests of contingency tables.J Biopharm Stat. 2017;27(5):784-796. doi: 10.1080/10543406.2016.1269786. Epub 2017 Jan 27. J Biopharm Stat. 2017. PMID: 27936354
-
[Statistical analysis of pharmacological data: use of cumulative chi-squared statistic].Nihon Yakurigaku Zasshi. 1997 Dec;110(6):341-6. doi: 10.1254/fpj.110.341. Nihon Yakurigaku Zasshi. 1997. PMID: 9503392 Review. Japanese.
-
Issues in biomedical statistics: analysing 2 x 2 tables of frequencies.Aust N Z J Surg. 1994 Nov;64(11):780-7. doi: 10.1111/j.1445-2197.1994.tb04539.x. Aust N Z J Surg. 1994. PMID: 7945088 Review.
Cited by
-
Relationship between health literacy and health-related quality of life in Korean adults with chronic diseases.PLoS One. 2024 Apr 18;19(4):e0301894. doi: 10.1371/journal.pone.0301894. eCollection 2024. PLoS One. 2024. PMID: 38635779 Free PMC article.
-
Influence of biopsychosocial factors on self-reported anxiety/depression symptoms among first-generation immigrant population in the U.S.BMC Public Health. 2024 Mar 15;24(1):819. doi: 10.1186/s12889-024-18336-w. BMC Public Health. 2024. PMID: 38491362 Free PMC article.
-
Factors Associated with Military Sexual Trauma (MST) Disclosure During VA Screening Among Women Veterans.J Gen Intern Med. 2023 Nov;38(14):3188-3197. doi: 10.1007/s11606-023-08257-6. Epub 2023 Jun 8. J Gen Intern Med. 2023. PMID: 37291361 Free PMC article.
-
Food insecurity and its impact on substance use and suicidal behaviours among school-going adolescents in Africa: evidence from the Global School-Based Student Health Survey.Eur Child Adolesc Psychiatry. 2024 Feb;33(2):467-480. doi: 10.1007/s00787-023-02168-x. Epub 2023 Mar 2. Eur Child Adolesc Psychiatry. 2024. PMID: 36859592
-
Determinants of losses in the tuberculosis infection cascade of care among children and adolescent contacts of pulmonary tuberculosis cases: A Brazilian multi-centre longitudinal study.Lancet Reg Health Am. 2022 Nov;15:100358. doi: 10.1016/j.lana.2022.100358. Epub 2022 Aug 23. Lancet Reg Health Am. 2022. PMID: 36438860 Free PMC article.
References
-
- Agresti A. Categorical Data Analysis. 3. New York: Wiley; 2013.
-
- Amemiya T. Advanced Econometrics. Harvard University Press; 1985.
-
- Aitchison J, Silvey SD. Maximum-likelihood estimation of parameters subject to restraints. Ann Math Stat. 1958;29:813–828.
-
- Bera AK, Bilias Y. Raos score, Neymans C(α) and Silveys LM tests: An essay on historical developments and some new results. Journal of Statistical Planning and Inference. 2001;97:9–44.
-
- Boos DD. On generalized score tests. The American Statistician. 1992;46:327–333.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources

