AUTALASSO: an automatic adaptive LASSO for genome-wide prediction

Patrik Waldmann; Maja Ferenčaković; Gábor Mészáros; Negar Khayatzadeh; Ino Curik; Johann Sölkner

doi:10.1186/s12859-019-2743-3

AUTALASSO: an automatic adaptive LASSO for genome-wide prediction

BMC Bioinformatics. 2019 Apr 2;20(1):167. doi: 10.1186/s12859-019-2743-3.

Authors

Patrik Waldmann¹, Maja Ferenčaković², Gábor Mészáros³, Negar Khayatzadeh³, Ino Curik², Johann Sölkner³

Affiliations

¹ Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Box 7023, Uppsala, 750 07, Sweden. Patrik.Waldmann@slu.se.
² Department of Animal Science, Faculty of Agriculture, University of Zagreb, Svetosimunska 25, Zagreb, 10000, Croatia.
³ Division of Livestock Sciences,Department of Sustainable Agricultural Systems,University of Natural Resources and Life Sciences Vienna, Gregor Mendel Str. 33, Vienna, A-1180, Austria.

Abstract

Background: Genome-wide prediction has become the method of choice in animal and plant breeding. Prediction of breeding values and phenotypes are routinely performed using large genomic data sets with number of markers on the order of several thousands to millions. The number of evaluated individuals is usually smaller which results in problems where model sparsity is of major concern. The LASSO technique has proven to be very well-suited for sparse problems often providing excellent prediction accuracy. Several computationally efficient LASSO algorithms have been developed, but optimization of hyper-parameters can be demanding.

Results: We have developed a novel automatic adaptive LASSO (AUTALASSO) based on the alternating direction method of multipliers (ADMM) optimization algorithm. The two major hyper-parameters of ADMM are the learning rate and the regularization factor. The learning rate is automatically tuned with line search and the regularization factor optimized using Golden section search. Results show that AUTALASSO provides superior prediction accuracy when evaluated on simulated and real bull data compared to the adaptive LASSO, LASSO and ridge regression implemented in the popular glmnet software.

Conclusions: The AUTALASSO provides a very flexible and computationally efficient approach to GWP, especially when it is important to obtain high prediction accuracy and genetic gain. The AUTALASSO also has the capability to perform GWAS of both additive and dominance effects with smaller prediction error than the ordinary LASSO.

Keywords: GWAS; Genomic selection; Mathematical optimization; Proximal algorithms; Regularization.

MeSH terms

Algorithms*
Animals
Breeding
Cattle
Genome
Genomics / methods*
Software