EWASex: an efficient R-package to predict sex in epigenome-wide association studies

Bioinformatics. 2020 Dec 11:btaa949. doi: 10.1093/bioinformatics/btaa949. Online ahead of print.

Abstract

Summary: Epigenome-Wide Association Study (EWAS) has become a powerful approach to identify epigenetic variations associated with diseases or health traits. Sex is an important variable to include in EWAS to ensure unbiased data processing and statistical analysis. We introduce the R-package EWASex, which allows for fast and highly accurate sex-estimation using DNA methylation data on a small set of CpG sites located on the X-chromosome under stable X-chromosome inactivation in females.

Results: We demonstrate that EWASex outperforms the current state of the art tools by using different EWAS datasets. With EWASex, we offer an efficient way to predict and to verify sex that can be easily implemented in any EWAS using blood samples or even other tissue types. It comes with pre-trained weights to work without prior sex labels and without requiring access to RAW data, which is a necessity for all currently available methods.

Availability and implementation: The EWASex R-package along with tutorials, documentation and source code are available at https://github.com/Silver-Hawk/EWASex.

Supplementary information: Supplementary data are available at Bioinformatics online.