Construction of residential histories to estimate long-term environmental exposures in the California Teachers Study cohort

J Expo Sci Environ Epidemiol. 2024 Mar 6. doi: 10.1038/s41370-023-00631-0. Online ahead of print.

Abstract

Environmental epidemiologic studies using geospatial data often estimate exposure at a participant's residence upon enrollment, but mobility during the exposure period can lead to misclassification. We aimed to mitigate this issue by constructing residential histories for participants in the California Teachers Study through follow-up (1995-2018). Address records have been collected from the US Postal Service, LexisNexis, Experian, and California Cancer Registry. We identified records of the same address based on geo-coordinate distance (≤250 m) and street name similarity. We consolidated addresses, prioritizing those confirmed by participants during follow-up questionnaires, and estimating the duration lived at each address using dates associated with records (e.g., date-first-seen). During 23 years of follow-up, about half of participants moved (48%, including 14% out-of-state). We observed greater mobility among younger women, Hispanic/Latino women, and those in metropolitan and lower socioeconomic status areas. The cumulative proportion of in-state movers remaining eligible for analysis was 21%, 32%, and 41% at 5, 10, and 20 years post enrollment, respectively. Using self-reported information collected 10 years after enrollment, we correctly identified 94% of movers and 95% of non-movers as having moved or not moved from their enrollment address. This dataset provides a foundation for estimating long-term environmental exposures in diverse epidemiologic studies in this cohort. IMPACT: Our efforts in constructing residential histories for California Teachers Study participants through follow-up (1995-2018) benefit future environmental epidemiologic studies. Address availability during the exposure period can mitigate misclassification due to residential changes, especially when evaluating long-term exposures and chronic health outcomes. This can reduce differential misclassification among more mobile subgroups, including younger women and those from lower socioeconomic and urban areas. Our approach to consolidating addresses from multiple sources showed high accuracy in comparison to self-reported residential information. The residential dataset produced from this analysis provides a valuable tool for future studies, ultimately enhancing our understanding of environmental health impacts.

Keywords: Analytic methods; Epidemiology; Geospatial analyses; Health studies.