Epigenetic aging clocks are computational models that predict age using DNA methylation information. Initially, first-generation clocks were developed to make predictions using CpGs that change with age. Over time, next-generation clocks were created using CpGs that relate to both age and health. Since existing next-generation clocks were constructed in blood, we sought to develop a next-generation clock optimized for prediction in cheek swabs, which are non-invasive and easy to collect. To do this, we collected MethylationEPIC data as well as lifestyle and health information from 8045 diverse adults. Using a novel simulated annealing approach that allowed us to incorporate lifestyle and health factors into training as well as a combination of CpG filtering, CpG clustering, and clock ensembling, we constructed CheekAge, an epigenetic aging clock that has a strong correlation with age, displays high test-retest reproducibility across replicates, and significantly associates with a plethora of lifestyle and health factors, such as BMI, smoking status, and alcohol intake. We validated CheekAge in an internal dataset and multiple publicly available datasets, including samples from patients with progeria or meningioma. In addition to exploring the underlying biology of the data and clock, we provide a free online tool that allows users to mine our methylomic data and predict epigenetic age.
Keywords: Aging clock; Buccal; Ensemble learning; Epigenetic age; Machine learning; Simulated annealing.
© 2024. The Author(s).