Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jul 15;35(14):i474-i483.
doi: 10.1093/bioinformatics/btz320.

Identifying progressive imaging genetic patterns via multi-task sparse canonical correlation analysis: a longitudinal study of the ADNI cohort

Affiliations
Free PMC article

Identifying progressive imaging genetic patterns via multi-task sparse canonical correlation analysis: a longitudinal study of the ADNI cohort

Lei Du et al. Bioinformatics. .
Free PMC article

Abstract

Motivation: Identifying the genetic basis of the brain structure, function and disorder by using the imaging quantitative traits (QTs) as endophenotypes is an important task in brain science. Brain QTs often change over time while the disorder progresses and thus understanding how the genetic factors play roles on the progressive brain QT changes is of great importance and meaning. Most existing imaging genetics methods only analyze the baseline neuroimaging data, and thus those longitudinal imaging data across multiple time points containing important disease progression information are omitted.

Results: We propose a novel temporal imaging genetic model which performs the multi-task sparse canonical correlation analysis (T-MTSCCA). Our model uses longitudinal neuroimaging data to uncover that how single nucleotide polymorphisms (SNPs) play roles on affecting brain QTs over the time. Incorporating the relationship of the longitudinal imaging data and that within SNPs, T-MTSCCA could identify a trajectory of progressive imaging genetic patterns over the time. We propose an efficient algorithm to solve the problem and show its convergence. We evaluate T-MTSCCA on 408 subjects from the Alzheimer's Disease Neuroimaging Initiative database with longitudinal magnetic resonance imaging data and genetic data available. The experimental results show that T-MTSCCA performs either better than or equally to the state-of-the-art methods. In particular, T-MTSCCA could identify higher canonical correlation coefficients and capture clearer canonical weight patterns. This suggests that T-MTSCCA identifies time-consistent and time-dependent SNPs and imaging QTs, which further help understand the genetic basis of the brain QT changes over the time during the disease progression.

Availability and implementation: The software and simulation data are publicly available at https://github.com/dulei323/TMTSCCA.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Comparison of the mean canonical correlation coefficients (CCCs) obtained from 10-fold cross-validation trials. The CCCs at all time points are illustrated, where CCC_T1 is calculated between the SNP data and the imaging data at T1, and so forth.
Fig. 2.
Fig. 2.
Heat maps of canonical weights on synthetic data. Rows 1–4: Ground truth, mSCCA, TGSCCA and T-MTSCCA, respectively. In each row, U is on the left panel and V is on the right. Within each panel, there are four canonical weights associating with four time points. For our method, there are four canonical weights corresponding to four time points, i.e. T1, T2, T3, T4, for both U and V. For mSCCA and TGSCCA, the weight matrix U is stacked by the same canonical weight vector u.
Fig. 3.
Fig. 3.
Comparison of the mean canonical correlation coefficients (CCCs) obtained from 10-fold testing trials on ADNI.
Fig. 4.
Fig. 4.
Comparison of canonical weights in terms of SNPs averaged from 10-fold cross-validation trials. Each row corresponds to a SCCA method: (1) mSCCA; (2) TGSCCA and (3) T-MTSCCA. For our method, there are four rows corresponding to four time points, i.e. BL, M6, M12 and M24. For mSCCA and TGSCCA, the four rows are stacked by the same u.
Fig. 5.
Fig. 5.
Comparison of canonical weights in terms of each imaging QTs averaged from 10-fold cross-validation trials. Each row corresponds to a SCCA method: (1) mSCCA; (2) TGSCCA and (3) T-MTSCCA. Within each panel, there are four rows corresponding to four time points of imaging QTs, i.e. BL, M6, M12 and M24.

Similar articles

Cited by

References

    1. Argyriou A. et al. (2006) Multi-task feature learning. Adv. Neural Inform. Proc. Syst., 7341, 48.
    1. Ashburner J., Friston K.J. (2000) Voxel-based morphometry–the methods. Neuroimage, 11, 805–821. - PubMed
    1. Bakkour A. et al. (2013) The effects of aging and Alzheimer’s disease on cerebral cortical anatomy: specificity and differential relationships with cognition. Neuroimage, 76, 332–344. - PMC - PubMed
    1. Carlsson C.M. (2010) Type 2 diabetes mellitus, dyslipidemia, and Alzheimer’s disease. J. Alzheimers Dis., 20, 711–722. - PMC - PubMed
    1. Chen J., Bushman F.D. (2013) Structure-constrained sparse canonical correlation analysis with an application to microbiome data analysis. Biostatistics, 14, 244–258. - PMC - PubMed

Publication types