Computation of median gene clusters

Sebastian Böcker; Katharina Jahn; Julia Mixtacki; Jens Stoye

doi:10.1089/cmb.2009.0098

Computation of median gene clusters

J Comput Biol. 2009 Aug;16(8):1085-99. doi: 10.1089/cmb.2009.0098.

Authors

Sebastian Böcker¹, Katharina Jahn, Julia Mixtacki, Jens Stoye

Affiliation

¹ Institut für Informatik, Friedrich-Schiller-Universität Jena , Jena, Germany.

PMID: 19689215
DOI: 10.1089/cmb.2009.0098

Abstract

Whole genome comparison based on gene order has become a popular approach in comparative genomics. An important task in this field is the detection of gene clusters, i.e., sets of genes that occur co-localized in several genomes. For most applications, it is preferable to extend this definition to allow for small deviations in the gene content of the cluster occurrences. However, relaxing the equality constraint increases the computational complexity of gene cluster detection drastically. Existing approaches deal with this problem by using simplifying constraints on the cluster definition and/or allowing only pairwise genome comparison. In this article, we introduce a cluster concept named median gene clusters that improves over existing models, present efficient algorithms for their computation and show experimental results on the detection of approximate gene clusters in multiple genomes.

MeSH terms

Algorithms*
Genomics / methods*
Multigene Family*