Using uniformat and gene[rate] to Analyze Data with Ambiguities in Population Genetics

José Manuel Nunes

doi:10.4137/EBO.S32415

Using uniformat and gene[rate] to Analyze Data with Ambiguities in Population Genetics

Evol Bioinform Online. 2016 Feb 21;11(Suppl 2):19-26. doi: 10.4137/EBO.S32415. eCollection 2015.

Author

José Manuel Nunes¹

Affiliation

¹ Department of Genetics and Evolution, Anthropology Unit, University of Geneva, Geneva, Switzerland.

Abstract

Some genetic systems frequently present ambiguous data that cannot be straightforwardly analyzed with common methods of population genetics. Two possibilities arise to analyze such data: one is the arbitrary simplification of the data and the other is the development of methods adapted to such ambiguous data. In this article, we present an attempt at such a development, the uniformat grammar and The gene[rate] tools, highlighting the specific aspects and the adaptations required to analyze ambiguous nominal data in population genetics.

Keywords: EM algorithm; Hardy-Weinberg; ambiguous genetic data; data manipulation; frequency estimation; linkage disequilibrium.