Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jun;72(2):432-40.
doi: 10.1111/biom.12447. Epub 2015 Nov 17.

A rank-sum test for clustered data when the number of subjects in a group within a cluster is informative

Affiliations
Free PMC article

A rank-sum test for clustered data when the number of subjects in a group within a cluster is informative

Sandipan Dutta et al. Biometrics. 2016 Jun.
Free PMC article

Abstract

The Wilcoxon rank-sum test is a popular nonparametric test for comparing two independent populations (groups). In recent years, there have been renewed attempts in extending the Wilcoxon rank sum test for clustered data, one of which (Datta and Satten, 2005, Journal of the American Statistical Association 100, 908-915) addresses the issue of informative cluster size, i.e., when the outcomes and the cluster size are correlated. We are faced with a situation where the group specific marginal distribution in a cluster depends on the number of observations in that group (i.e., the intra-cluster group size). We develop a novel extension of the rank-sum test for handling this situation. We compare the performance of our test with the Datta-Satten test, as well as the naive Wilcoxon rank sum test. Using a naturally occurring simulation model of informative intra-cluster group size, we show that only our test maintains the correct size. We also compare our test with a classical signed rank test based on averages of the outcome values in each group paired by the cluster membership. While this test maintains the size, it has lower power than our test. Extensions to multiple group comparisons and the case of clusters not having samples from all groups are also discussed. We apply our test to determine whether there are differences in the attachment loss between the upper and lower teeth and between mesial and buccal sites of periodontal patients.

Keywords: Correlated data; Dental data; Nonparametric tests; Wilcoxon rank-sum test; Within-cluster resampling.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Empirical cdf plot of scores at buccal and mesial sites at baseline study
Plot of empirical cumulative distribution functions (ℱ̂3(․)) of attachment scores in buccal and mesial sites at baseline study.
Figure 2
Figure 2. Empirical cdf plot of scores at buccal and mesial sites at 18 months
Plot of empirical cumulative distribution functions (ℱ̂3(․)) of attachment scores in buccal and mesial sites at 18 months.

Similar articles

Cited by

References

    1. Beck JD, Koch GG, Rozier RG, Tudor GE. Prevalence and risk indicators for periodontal attachment loss in a population of older community-dwelling blacks and whites. Journal of Periodontology. 1990;61:521–528. - PubMed
    1. Blazer DG, George LK. ICPSR02744-v1. Inter-university Consortium for Political and Social Research [distributor] Ann Arbor, MI: 2004. Established Populations for Epidemiologic Studies of the Elderly, 1996–1997: Piedmont Health Survey of the Elderly, Fourth In-Person Survey [Durham, Warren, Vance, Granville, and Franklin Counties, North Carolina] [Computer file]
    1. Datta S, Satten GA. Rank-sum tests for clustered data. Journal of the American Statistical Association. 2005;100:908–915.
    1. Datta S, Satten GA. A Signed-rank test for clustered data. Biometrics. 2008;64:501–507. - PubMed
    1. Hájek J, Šidák Z, Sen PK. Theory of Rank Tests. San Diego, CA: Academic Press; 1999.

Publication types