Application of linear and machine learning models to genomic prediction of fatty acid composition in Japanese Black cattle

Anim Sci J. 2023 Jan-Dec;94(1):e13883. doi: 10.1111/asj.13883.

Abstract

We collected 3180 records of oleic acid (C18:1) and monounsaturated fatty acid (MUFA) measured using gas chromatography (GC) and 6960 records of C18:1 and MUFA measured using near-infrared spectroscopy (NIRS) in intermuscular fat samples of Japanese Black cattle. We compared genomic prediction performance for four linear models (genomic best linear unbiased prediction [GBLUP], kinship-adjusted multiple loci [KAML], BayesC, and BayesLASSO) and five machine learning models (Gaussian kernel [GK], deep kernel [DK], random forest [RF], extreme gradient boost [XGB], and convolutional neural network [CNN]). For GC-based C18:1 and MUFA, KAML showed the highest accuracies, followed by BayesC, XGB, DK, GK, and BayesLASSO, with more than 6% gain of accuracy by KAML over GBLUP. Meanwhile, DK had the highest prediction accuracy for NIRS-based C18:1 and MUFA, but the difference in accuracies between DK and KAML was slight. For all traits, accuracies of RF and CNN were lower than those of GBLUP. The KAML extends GBLUP methods, of which marker effects are weighted, and involves only additive genetic effects; whereas machine learning methods capture non-additive genetic effects. Thus, KAML is the most suitable method for breeding of fatty acid composition in Japanese Black cattle.

Keywords: Japanese Black cattle; fatty acid composition; genomic prediction; machine learning.

MeSH terms

  • Animals
  • Cattle / genetics
  • Fatty Acids*
  • Fatty Acids, Monounsaturated
  • Genome*
  • Genomics / methods
  • Genotype
  • Machine Learning
  • Models, Genetic
  • Phenotype
  • Polymorphism, Single Nucleotide

Substances

  • Fatty Acids
  • Fatty Acids, Monounsaturated