Chances and challenges of machine learning-based disease classification in genetic association studies illustrated on age-related macular degeneration
- PMID: 32741009
- DOI: 10.1002/gepi.22336
Chances and challenges of machine learning-based disease classification in genetic association studies illustrated on age-related macular degeneration
Abstract
Imaging technology and machine learning algorithms for disease classification set the stage for high-throughput phenotyping and promising new avenues for genome-wide association studies (GWAS). Despite emerging algorithms, there has been no successful application in GWAS so far. We establish machine learning-based phenotyping in genetic association analysis as misclassification problem. To evaluate chances and challenges, we performed a GWAS based on automatically classified age-related macular degeneration (AMD) in UK Biobank (images from 135,500 eyes; 68,400 persons). We quantified misclassification of automatically derived AMD in internal validation data (4,001 eyes; 2,013 persons) and developed a maximum likelihood approach (MLA) to account for it when estimating genetic association. We demonstrate that our MLA guards against bias and artifacts in simulation studies. By combining a GWAS on automatically derived AMD and our MLA in UK Biobank data, we were able to dissect true association (ARMS2/HTRA1, CFH) from artifacts (near HERC2) and identified eye color as associated with the misclassification. On this example, we provide a proof-of-concept that a GWAS using machine learning-derived disease classification yields relevant results and that misclassification needs to be considered in analysis. These findings generalize to other phenotypes and emphasize the utility of genetic data for understanding misclassification structure of machine learning algorithms.
Keywords: UK Biobank; age-related macular degeneration (AMD); genome-wide association study; machine learning-based disease classification; response misclassification.
© 2020 The Authors. Genetic Epidemiology published by Wiley Periodicals LLC.
Similar articles
-
A Deep Phenotype Association Study Reveals Specific Phenotype Associations with Genetic Variants in Age-related Macular Degeneration: Age-Related Eye Disease Study 2 (AREDS2) Report No. 14.Ophthalmology. 2018 Apr;125(4):559-568. doi: 10.1016/j.ophtha.2017.09.023. Epub 2017 Oct 31. Ophthalmology. 2018. PMID: 29096998 Free PMC article. Clinical Trial.
-
Assessment of CFH and HTRA1 polymorphisms in age-related macular degeneration using classic and machine-learning approaches.Ophthalmic Genet. 2020 Dec;41(6):539-547. doi: 10.1080/13816810.2020.1804945. Epub 2020 Aug 24. Ophthalmic Genet. 2020. PMID: 32838591
-
Ongoing controversies and recent insights of the ARMS2-HTRA1 locus in age-related macular degeneration.Exp Eye Res. 2021 Sep;210:108605. doi: 10.1016/j.exer.2021.108605. Epub 2021 Apr 28. Exp Eye Res. 2021. PMID: 33930395 Review.
-
Association Between Perifoveal Drusen Burden Determined by OCT and Genetic Risk in Early and Intermediate Age-Related Macular Degeneration.Invest Ophthalmol Vis Sci. 2019 Oct 1;60(13):4469-4478. doi: 10.1167/iovs.19-27475. Invest Ophthalmol Vis Sci. 2019. PMID: 31658355 Free PMC article.
-
Bringing the age-related macular degeneration high-risk allele age-related maculopathy susceptibility 2 into focus with stem cell technology.Stem Cell Res Ther. 2017 Jun 6;8(1):135. doi: 10.1186/s13287-017-0584-4. Stem Cell Res Ther. 2017. PMID: 28583181 Free PMC article. Review.
Cited by
-
Predicting late-stage age-related macular degeneration by integrating marginally weak SNPs in GWA studies.Front Genet. 2023 Mar 30;14:1075824. doi: 10.3389/fgene.2023.1075824. eCollection 2023. Front Genet. 2023. PMID: 37065470 Free PMC article.
-
Feature Fusion and Detection in Alzheimer's Disease Using a Novel Genetic Multi-Kernel SVM Based on MRI Imaging and Gene Data.Genes (Basel). 2022 May 7;13(5):837. doi: 10.3390/genes13050837. Genes (Basel). 2022. PMID: 35627222 Free PMC article.
-
Genetic Risk Score Analysis Supports a Joint View of Two Classification Systems for Age-Related Macular Degeneration.Invest Ophthalmol Vis Sci. 2023 Sep 1;64(12):31. doi: 10.1167/iovs.64.12.31. Invest Ophthalmol Vis Sci. 2023. PMID: 37721739 Free PMC article.
-
Genome-wide association meta-analysis for early age-related macular degeneration highlights novel loci and insights for advanced disease.BMC Med Genomics. 2020 Aug 26;13(1):120. doi: 10.1186/s12920-020-00760-7. BMC Med Genomics. 2020. PMID: 32843070 Free PMC article.
-
Longitudinal fundus imaging and its genome-wide association analysis provide evidence for a human retinal aging clock.Elife. 2023 Apr 17;12:e82364. doi: 10.7554/eLife.82364. Elife. 2023. PMID: 36975205 Free PMC article.
References
REFERENCES
-
- Brandl, C., Zimmermann, M. E., Günther, F., Barth, T., Olden, M., Schelter, S. C., … Heid, I. M. (2018). On the impact of different approaches to classify age-related macular degeneration: Results from the German AugUR study. Scientific Reports, 8(1), 8675. https://doi.org/10.1038/s41598-018-26629-5
-
- Buniello, A., Macarthur, J. A. L., Cerezo, M., Harris, L. W., Hayhurst, J., Malangone, C., … Parkinson, H. (2019). The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Research, 47(D1), D1005-D1012. https://doi.org/10.1093/nar/gky1120
-
- Burlina, P. M., Joshi, N., Pekala, M., Pacheco, K. D., Freund, D. E., & Bressler, N. M. (2017). Automated grading of age-related macular degeneration from color fundus images using deep convolutional neural networks. JAMA Ophthalmology, 135(11), 1170. https://doi.org/10.1001/jamaophthalmol.2017.3782
-
- Bycroft, C., Freeman, C., Petkova, D., Band, G., Elliott, L. T., Sharp, K., … Marchini, J. (2018). The UK Biobank resource with deep phenotyping and genomic data. Nature, 562(7726), 203-209. https://doi.org/10.1038/s41586-018-0579-z
-
- Carroll, R. J., Ruppert, D., Stefanski, L. A., & Crainiceanu, C. M. (2006). Measurement error in nonlinear models (2nd ed.). Boca Raton, FL: Chapman and Hall/CRC.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous
