Marginal regression models for clustered count data based on zero-inflated Conway-Maxwell-Poisson distribution with applications

Biometrics. 2016 Jun;72(2):606-18. doi: 10.1111/biom.12436. Epub 2015 Nov 17.

Abstract

Community water fluoridation is an important public health measure to prevent dental caries, but it continues to be somewhat controversial. The Iowa Fluoride Study (IFS) is a longitudinal study on a cohort of Iowa children that began in 1991. The main purposes of this study (http://www.dentistry.uiowa.edu/preventive-fluoride-study) were to quantify fluoride exposures from both dietary and nondietary sources and to associate longitudinal fluoride exposures with dental fluorosis (spots on teeth) and dental caries (cavities). We analyze a subset of the IFS data by a marginal regression model with a zero-inflated version of the Conway-Maxwell-Poisson distribution for count data exhibiting excessive zeros and a wide range of dispersion patterns. In general, we introduce two estimation methods for fitting a ZICMP marginal regression model. Finite sample behaviors of the estimators and the resulting confidence intervals are studied using extensive simulation studies. We apply our methodologies to the dental caries data. Our novel modeling incorporating zero inflation, clustering, and overdispersion sheds some new light on the effect of community water fluoridation and other factors. We also include a second application of our methodology to a genomic (next-generation sequencing) dataset that exhibits underdispersion.

Keywords: Bootstrap; Caries data; Expectation-solution algorithm; Generalized estimating equation; Generalized linear model; Genomics; Iowa Fluoride Study.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Biometry / methods
  • Cluster Analysis
  • Computer Simulation
  • Confidence Intervals
  • Data Interpretation, Statistical*
  • Drinking Water
  • Fluoridation
  • Genomics
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Models, Statistical*
  • Poisson Distribution
  • Regression Analysis*

Substances

  • Drinking Water