WGBSSuite: simulating whole-genome bisulphite sequencing data and benchmarking differential DNA methylation analysis tools

Bioinformatics. 2015 Jul 15;31(14):2371-3. doi: 10.1093/bioinformatics/btv114. Epub 2015 Mar 15.

Abstract

Motivation: As the number of studies looking at differences between DNA methylation increases, there is a growing demand to develop and benchmark statistical methods to analyse these data. To date no objective approach for the comparison of these methods has been developed and as such it remains difficult to assess which analysis tool is most appropriate for a given experiment. As a result, there is an unmet need for a DNA methylation data simulator that can accurately reproduce a wide range of experimental setups, and can be routinely used to compare the performance of different statistical models.

Results: We have developed WGBSSuite, a flexible stochastic simulation tool that generates single-base resolution DNA methylation data genome-wide. Several simulator parameters can be derived directly from real datasets provided by the user in order to mimic real case scenarios. Thus, it is possible to choose the most appropriate statistical analysis tool for a given simulated design. To show the usefulness of our simulator, we also report a benchmark of commonly used methods for differential methylation analysis.

Availability and implementation: WGBS code and documentation are available under GNU licence at http://www.wgbssuite.org.uk/

Contact: : owen.rackham@imperial.ac.uk or l.bottolo@imperial.ac.uk

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Benchmarking*
  • Computer Simulation*
  • DNA Methylation*
  • Genome, Human
  • Humans
  • Models, Statistical*
  • Sequence Analysis, DNA / methods*
  • Software*
  • Stochastic Processes
  • Sulfites / chemistry*

Substances

  • Sulfites
  • hydrogen sulfite