Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jan;39(1):2-10.
doi: 10.1002/gepi.21876. Epub 2014 Dec 13.

Genetic data simulators and their applications: an overview

Affiliations
Free PMC article

Genetic data simulators and their applications: an overview

Bo Peng et al. Genet Epidemiol. 2015 Jan.
Free PMC article

Abstract

Computer simulations have played an indispensable role in the development and applications of statistical models and methods for genetic studies across multiple disciplines. The need to simulate complex evolutionary scenarios and pseudo-datasets for various studies has fueled the development of dozens of computer programs with varying reliability, performance, and application areas. To help researchers compare and choose the most appropriate simulators for their studies, we have created the genetic simulation resources (GSR) website, which allows authors of simulation software to register their applications and describe them with more than 160 defined attributes. This article summarizes the properties of 93 simulators currently registered at GSR and provides an overview of the development and applications of genetic simulators. Unlike other review articles that address technical issues or compare simulators for particular application areas, we focus on software development, maintenance, and features of simulators, often from a historical perspective. Publications that cite these simulators are used to summarize both the applications of genetic simulations and the utilization of simulators.

Keywords: genetic simulation; genetic simulation resources; software.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Number of simulators that simulate different types of genetic sequences at each year. X-axis: initial release of simulator. Y-axis: number of simulators. A package that simulates multiple types of sequences will be displayed in multiple groups.
Figure 2
Figure 2
Number of newly developed simulators that use different simulation approaches at each year. X-axis: initial release of simulator. Y-axis: number of simulators. A simulator that uses multiple simulation methods will be displayed in multiple groups, as well as the “multiple methods” group.
Figure 3
Figure 3
Number of evolutionary features provided by simulators using different simulation methods. Y-axis: number of features provided. X-axis: number of packages.
Figure 4
Figure 4
Distribution of publications by Discipline. Number of publications that cite the catalogued simulators in GSR in different disciplines as categorized by Web of Science.
Figure 5
Figure 5
Distribution of simulation methods for publications that cite 93 catalogued simulators, from year 1990 to 2013. We group simulators by the simulation methods they use and count the number of publications that cite simulators in each group. Articles that cite more than one simulator or a simulator using multiple methods will be counted multiple times.

Similar articles

Cited by

References

    1. Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, Kang HM, Marth GT, McVean GA. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491(7422):56–65. - PMC - PubMed
    1. Arenas M. Simulation of molecular data under diverse evolutionary scenarios. PLoS Comput Biol. 2012;8(5):e1002495. - PMC - PubMed
    1. Balloux F. EASYPOP (version 1.7): a computer program for population genetics simulations. J Hered. 2001;92(3):301–302. - PubMed
    1. Carvajal-Rodriguez A. Simulation of genes and genomes forward in time. Curr Genomics. 2010;11(1):58–61. - PMC - PubMed
    1. Chadeau-Hyam M, Hoggart CJ, O'Reilly PF, Whittaker JC, De Iorio M, Balding DJ. Fregene: simulation of realistic sequence-level data in populations and ascertained samples. BMC Bioinformatics. 2008;9:364. - PMC - PubMed

Publication types

LinkOut - more resources