Copula-based semiparametric regression method for bivariate data under general interval censoring

Biostatistics. 2021 Apr 10;22(2):315-330. doi: 10.1093/biostatistics/kxz032.


This research is motivated by discovering and underpinning genetic causes for the progression of a bilateral eye disease, age-related macular degeneration (AMD), of which the primary outcomes, progression times to late-AMD, are bivariate and interval-censored due to intermittent assessment times. We propose a novel class of copula-based semiparametric transformation models for bivariate data under general interval censoring, which includes the case 1 interval censoring (current status data) and case 2 interval censoring. Specifically, the joint likelihood is modeled through a two-parameter Archimedean copula, which can flexibly characterize the dependence between the two margins in both tails. The marginal distributions are modeled through semiparametric transformation models using sieves, with the proportional hazards or odds model being a special case. We develop a computationally efficient sieve maximum likelihood estimation procedure for the unknown parameters, together with a generalized score test for the regression parameter(s). For the proposed sieve estimators of finite-dimensional parameters, we establish their asymptotic normality and efficiency. Extensive simulations are conducted to evaluate the performance of the proposed method in finite samples. Finally, we apply our method to a genome-wide analysis of AMD progression using the Age-Related Eye Disease Study data, to successfully identify novel risk variants associated with the disease progression. We also produce predicted joint and conditional progression-free probabilities, for patients with different genetic characteristics.

Keywords: Bivariate; Copula; GWAS; Interval-censored; Semiparametric; Sieve.