Cox regression survival analysis with compositional covariates: Application to modelling mortality risk from 24-h physical activity patterns

Stat Methods Med Res. 2020 May;29(5):1447-1465. doi: 10.1177/0962280219864125. Epub 2019 Jul 25.

Abstract

Survival analysis is commonly conducted in medical and public health research to assess the association of an exposure or intervention with a hard end outcome such as mortality. The Cox (proportional hazards) regression model is probably the most popular statistical tool used in this context. However, when the exposure includes compositional covariables (that is, variables representing a relative makeup such as a nutritional or physical activity behaviour composition), some basic assumptions of the Cox regression model and associated significance tests are violated. Compositional variables involve an intrinsic interplay between one another which precludes results and conclusions based on considering them in isolation as is ordinarily done. In this work, we introduce a formulation of the Cox regression model in terms of log-ratio coordinates which suitably deals with the constraints of compositional covariates, facilitates the use of common statistical inference methods, and allows for scientifically meaningful interpretations. We illustrate its practical application to a public health problem: the estimation of the mortality hazard associated with the composition of daily activity behaviour (physical activity, sitting time and sleep) using data from the U.S. National Health and Nutrition Examination Survey (NHANES).

Keywords: Cox regression; NHANES; Survival analysis; accelerometry; compositional data; physical activity; sedentary behaviour; time use.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Exercise*
  • Nutrition Surveys
  • Proportional Hazards Models
  • Regression Analysis