BulkLMM: Real-time genome scans for multiple quantitative traits using linear mixed models

bioRxiv [Preprint]. 2023 Dec 21:2023.12.20.572698. doi: 10.1101/2023.12.20.572698.

Abstract

Genetic studies often collect data using high-throughput phenotyping. That has led to the need for fast genomewide scans for large number of traits using linear mixed models (LMMs). Computing the scans one by one on each trait is time consuming. We have developed new algorithms for performing genome scans on a large number of quantitative traits using LMMs, BulkLMM, that speeds up the computation by orders of magnitude compared to one trait at a time scans. On a mouse BXD Liver Proteome data with more than 35,000 traits and 7,000 markers, BulkLMM completed in a few seconds. We use vectorized, multi-threaded operations and regularization to improve optimization, and numerical approximations to speed up the computations. Our software implementation in the Julia programming language also provides permutation testing for LMMs and is available at https://github.com/senresearch/BulkLMM.jl.

Keywords: Computing; Genome Scan; Julia; Linear Mixed Models; Parallel.

Publication types

  • Preprint