Explaining microbial phenotypes on a genomic scale: GWAS for microbes

Brief Funct Genomics. 2013 Jul;12(4):366-80. doi: 10.1093/bfgp/elt008. Epub 2013 Apr 26.


There is an increasing availability of complete or draft genome sequences for microbial organisms. These data form a potentially valuable resource for genotype-phenotype association and gene function prediction, provided that phenotypes are consistently annotated for all the sequenced strains. In this review, we address the requirements for successful gene-trait matching. We outline a basic protocol for microbial functional genomics, including genome assembly, annotation of genotypes (including single nucleotide polymorphisms, orthologous groups and prophages), data pre-processing, genotype-phenotype association, visualization and interpretation of results. The methodologies for association described herein can be applied to other data types, opening up possibilities to analyze transcriptome-phenotype associations, and correlate microbial population structure or activity, as measured by metagenomics, to environmental parameters.

Keywords: functional genomics; genome-wide association studies; genotype–phenotype association; microbial genomics; random forest.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Genetic Association Studies
  • Genome-Wide Association Study / methods*
  • Genomics / methods*