A rank-sum test for clustered data when the number of subjects in a group within a cluster is informative
- PMID: 26575695
- PMCID: PMC4870168
- DOI: 10.1111/biom.12447
A rank-sum test for clustered data when the number of subjects in a group within a cluster is informative
Abstract
The Wilcoxon rank-sum test is a popular nonparametric test for comparing two independent populations (groups). In recent years, there have been renewed attempts in extending the Wilcoxon rank sum test for clustered data, one of which (Datta and Satten, 2005, Journal of the American Statistical Association 100, 908-915) addresses the issue of informative cluster size, i.e., when the outcomes and the cluster size are correlated. We are faced with a situation where the group specific marginal distribution in a cluster depends on the number of observations in that group (i.e., the intra-cluster group size). We develop a novel extension of the rank-sum test for handling this situation. We compare the performance of our test with the Datta-Satten test, as well as the naive Wilcoxon rank sum test. Using a naturally occurring simulation model of informative intra-cluster group size, we show that only our test maintains the correct size. We also compare our test with a classical signed rank test based on averages of the outcome values in each group paired by the cluster membership. While this test maintains the size, it has lower power than our test. Extensions to multiple group comparisons and the case of clusters not having samples from all groups are also discussed. We apply our test to determine whether there are differences in the attachment loss between the upper and lower teeth and between mesial and buccal sites of periodontal patients.
Keywords: Correlated data; Dental data; Nonparametric tests; Wilcoxon rank-sum test; Within-cluster resampling.
© 2015, The International Biometric Society.
Figures
Similar articles
-
Rank-based inference for covariate and group effects in clustered data in presence of informative intra-cluster group size.Stat Med. 2018 Dec 30;37(30):4807-4822. doi: 10.1002/sim.7979. Epub 2018 Sep 19. Stat Med. 2018. PMID: 30232808
-
A signed-rank test for clustered data.Biometrics. 2008 Jun;64(2):501-7. doi: 10.1111/j.1541-0420.2007.00923.x. Epub 2007 Oct 26. Biometrics. 2008. PMID: 17970820
-
Extension of the rank sum test for clustered data: two-group comparisons with group membership defined at the subunit level.Biometrics. 2006 Dec;62(4):1251-9. doi: 10.1111/j.1541-0420.2006.00582.x. Biometrics. 2006. PMID: 17156300
-
Statistical grand rounds: a review of analysis and sample size calculation considerations for Wilcoxon tests.Anesth Analg. 2013 Sep;117(3):699-710. doi: 10.1213/ANE.0b013e31827f53d7. Epub 2013 Mar 1. Anesth Analg. 2013. PMID: 23456667 Review.
-
Rank tests for clustered survival data.Lifetime Data Anal. 2003 Mar;9(1):21-33. doi: 10.1023/a:1021869803601. Lifetime Data Anal. 2003. PMID: 12602772 Review.
Cited by
-
Integrated bioinformatics analysis of noncoding RNAs with tumor immune microenvironment in gastric cancer.Sci Rep. 2023 Sep 11;13(1):15006. doi: 10.1038/s41598-023-41444-3. Sci Rep. 2023. PMID: 37696973 Free PMC article.
-
Construction of immune cell infiltration score model to assess prognostic ability of tumor immune environment in lung adenocarcinoma.Am J Transl Res. 2023 Mar 15;15(3):1730-1743. eCollection 2023. Am J Transl Res. 2023. PMID: 37056847 Free PMC article.
-
The CANDOR corpus: Insights from a large multimodal dataset of naturalistic conversation.Sci Adv. 2023 Mar 31;9(13):eadf3197. doi: 10.1126/sciadv.adf3197. Epub 2023 Mar 31. Sci Adv. 2023. PMID: 37000886 Free PMC article.
-
Adjusting for informative cluster size in pseudo-value-based regression approaches with clustered time to event data.Stat Med. 2023 Jun 15;42(13):2162-2178. doi: 10.1002/sim.9716. Epub 2023 Mar 27. Stat Med. 2023. PMID: 36973919
-
CD163 as a Potential Biomarker in Colorectal Cancer for Tumor Microenvironment and Cancer Prognosis: A Swedish Study from Tissue Microarrays to Big Data Analyses.Cancers (Basel). 2022 Dec 14;14(24):6166. doi: 10.3390/cancers14246166. Cancers (Basel). 2022. PMID: 36551651 Free PMC article.
References
-
- Beck JD, Koch GG, Rozier RG, Tudor GE. Prevalence and risk indicators for periodontal attachment loss in a population of older community-dwelling blacks and whites. Journal of Periodontology. 1990;61:521–528. - PubMed
-
- Blazer DG, George LK. ICPSR02744-v1. Inter-university Consortium for Political and Social Research [distributor] Ann Arbor, MI: 2004. Established Populations for Epidemiologic Studies of the Elderly, 1996–1997: Piedmont Health Survey of the Elderly, Fourth In-Person Survey [Durham, Warren, Vance, Granville, and Franklin Counties, North Carolina] [Computer file]
-
- Datta S, Satten GA. Rank-sum tests for clustered data. Journal of the American Statistical Association. 2005;100:908–915.
-
- Datta S, Satten GA. A Signed-rank test for clustered data. Biometrics. 2008;64:501–507. - PubMed
-
- Hájek J, Šidák Z, Sen PK. Theory of Rank Tests. San Diego, CA: Academic Press; 1999.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
