This study evaluated the use of multiomics data for classification accuracy of rheumatoid arthritis (RA). Three approaches were used and compared in terms of prediction accuracy: (1) whole-genome prediction (WGP) using SNP marker information only, (2) whole-methylome prediction (WMP) using methylation profiles only, and (3) whole-genome/methylome prediction (WGMP) with combining both omics layers. The number of SNP and of methylation sites varied in each scenario, with either 1, 10, or 50% of these preselected based on four approaches: randomly, evenly spaced, lowest p value (genome-wide association or epigenome-wide association study), and estimated effect size using a Bayesian ridge regression (BRR) model. To remove effects of high levels of pairwise linkage disequilibrium (LD), SNPs were also preselected with an LD-pruning method. Five Bayesian regression models were studied for classification, including BRR, Bayes-A, Bayes-B, Bayes-C, and the Bayesian LASSO. Adjusting methylation profiles for cellular heterogeneity within whole blood samples had a detrimental effect on the classification ability of the models. Overall, WGMP using Bayes-B model has the best performance. In particular, selecting SNPs based on LD-pruning with 1% of the methylation sites selected based on BRR included in the model, and fitting the most significant SNP as a fixed effect was the best method for predicting disease risk with a classification accuracy of 0.975. Our results showed that multiomics data can be used to effectively predict the risk of RA and identify cases in early stages to prevent or alter disease progression via appropriate interventions.
Conflict of interest statement
The authors declare that they have no conflict of interest.
Strategies for Obtaining and Pruning Imputed Whole-Genome Sequence Data for Genomic Prediction.Front Genet. 2019 Jul 17;10:673. doi: 10.3389/fgene.2019.00673. eCollection 2019. Front Genet. 2019. PMID: 31379929 Free PMC article.
Accuracy of genomic breeding values for meat tenderness in Polled Nellore cattle.J Anim Sci. 2016 Jul;94(7):2752-60. doi: 10.2527/jas.2016-0279. J Anim Sci. 2016. PMID: 27482662
Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs.Genet Sel Evol. 2019 Oct 21;51(1):58. doi: 10.1186/s12711-019-0500-8. Genet Sel Evol. 2019. PMID: 31638889 Free PMC article.
Association between single-nucleotide polymorphisms in the SEC8L1 gene, which encodes a subunit of the exocyst complex, and rheumatoid arthritis in a Japanese population.Arthritis Rheum. 2005 May;52(5):1371-80. doi: 10.1002/art.21013. Arthritis Rheum. 2005. PMID: 15880602
Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels.BMC Proc. 2010 Mar 31;4(Suppl 1 Proceedings of the 13th European workshop on QTL map):S6. doi: 10.1186/1753-6561-4-S1-S6. eCollection 2010. BMC Proc. 2010. PMID: 20380760 Free PMC article.