Phenotype Prediction and Genome-Wide Association Study Using Deep Convolutional Neural Network of Soybean
- PMID: 31824557
- PMCID: PMC6883005
- DOI: 10.3389/fgene.2019.01091
Phenotype Prediction and Genome-Wide Association Study Using Deep Convolutional Neural Network of Soybean
Abstract
Genomic selection uses single-nucleotide polymorphisms (SNPs) to predict quantitative phenotypes for enhancing traits in breeding populations and has been widely used to increase breeding efficiency for plants and animals. Existing statistical methods rely on a prior distribution assumption of imputed genotype effects, which may not fit experimental datasets. Emerging deep learning technology could serve as a powerful machine learning tool to predict quantitative phenotypes without imputation and also to discover potential associated genotype markers efficiently. We propose a deep-learning framework using convolutional neural networks (CNNs) to predict the quantitative traits from SNPs and also to investigate genotype contributions to the trait using saliency maps. The missing values of SNPs are treated as a new genotype for the input of the deep learning model. We tested our framework on both simulation data and experimental datasets of soybean. The results show that the deep learning model can bypass the imputation of missing values and achieve more accurate results for predicting quantitative phenotypes than currently available other well-known statistical methods. It can also effectively and efficiently identify significant markers of SNPs and SNP combinations associated in genome-wide association study.
Keywords: deep learning; genome-wide association study; genomic selection; genotype contribution; soybean.
Copyright © 2019 Liu, Wang, He, Wang, Joshi and Xu.
Figures
Similar articles
-
Accuracy of prediction of simulated polygenic phenotypes and their underlying quantitative trait loci genotypes using real or imputed whole-genome markers in cattle.Genet Sel Evol. 2015 Dec 23;47:99. doi: 10.1186/s12711-015-0179-4. Genet Sel Evol. 2015. PMID: 26698091 Free PMC article.
-
Sparse Convolutional Denoising Autoencoders for Genotype Imputation.Genes (Basel). 2019 Aug 28;10(9):652. doi: 10.3390/genes10090652. Genes (Basel). 2019. PMID: 31466333 Free PMC article.
-
Design of a low-density SNP chip for the main Australian sheep breeds and its effect on imputation and genomic prediction accuracy.Anim Genet. 2015 Oct;46(5):544-56. doi: 10.1111/age.12340. Epub 2015 Sep 11. Anim Genet. 2015. PMID: 26360638
-
Genetics of complex traits: prediction of phenotype, identification of causal polymorphisms and genetic architecture.Proc Biol Sci. 2016 Jul 27;283(1835):20160569. doi: 10.1098/rspb.2016.0569. Proc Biol Sci. 2016. PMID: 27440663 Free PMC article. Review.
-
A Guide for Using Deep Learning for Complex Trait Genomic Prediction.Genes (Basel). 2019 Jul 20;10(7):553. doi: 10.3390/genes10070553. Genes (Basel). 2019. PMID: 31330861 Free PMC article. Review.
Cited by
-
Review of applications of artificial intelligence (AI) methods in crop research.J Appl Genet. 2024 May;65(2):225-240. doi: 10.1007/s13353-023-00826-z. Epub 2024 Jan 13. J Appl Genet. 2024. PMID: 38216788 Review.
-
A joint learning approach for genomic prediction in polyploid grasses.Sci Rep. 2022 Jul 21;12(1):12499. doi: 10.1038/s41598-022-16417-7. Sci Rep. 2022. PMID: 35864135 Free PMC article.
-
Deciphering Pleiotropic Signatures of Regulatory SNPs in Zea mays L. Using Multi-Omics Data and Machine Learning Algorithms.Int J Mol Sci. 2022 May 4;23(9):5121. doi: 10.3390/ijms23095121. Int J Mol Sci. 2022. PMID: 35563516 Free PMC article.
-
A divide-and-conquer approach for genomic prediction in rubber tree using machine learning.Sci Rep. 2022 Oct 26;12(1):18023. doi: 10.1038/s41598-022-20416-z. Sci Rep. 2022. PMID: 36289298 Free PMC article.
-
Genome-wide association study-based prediction of atrial fibrillation using artificial intelligence.Open Heart. 2022 Jan;9(1):e001898. doi: 10.1136/openhrt-2021-001898. Open Heart. 2022. PMID: 35086918 Free PMC article.
References
-
- Akond A. M., Ragin B., Bazzelle R., Kantartzi S. K., Meksem K., Kassem M. A. (2012). Quantitative trait loci associated with moisture, protein, and oil content in soybean [Glycine max (L.) Merr.]. J. Agric. Sci. 4 (11), 16. 10.5539/jas.v4n11p16 - DOI
Grants and funding
LinkOut - more resources
Full Text Sources
