Using uniformat and gene[rate] to Analyze Data with Ambiguities in Population Genetics

Evol Bioinform Online. 2016 Feb 21;11(Suppl 2):19-26. doi: 10.4137/EBO.S32415. eCollection 2015.

Abstract

Some genetic systems frequently present ambiguous data that cannot be straightforwardly analyzed with common methods of population genetics. Two possibilities arise to analyze such data: one is the arbitrary simplification of the data and the other is the development of methods adapted to such ambiguous data. In this article, we present an attempt at such a development, the uniformat grammar and The gene[rate] tools, highlighting the specific aspects and the adaptations required to analyze ambiguous nominal data in population genetics.

Keywords: EM algorithm; Hardy-Weinberg; ambiguous genetic data; data manipulation; frequency estimation; linkage disequilibrium.