Statistical inference of chromosomal homology based on gene colinearity and applications to Arabidopsis and rice

BMC Bioinformatics. 2006 Oct 12;7:447. doi: 10.1186/1471-2105-7-447.

Abstract

Background: The identification of chromosomal homology will shed light on such mysteries of genome evolution as DNA duplication, rearrangement and loss. Several approaches have been developed to detect chromosomal homology based on gene synteny or colinearity. However, the previously reported implementations lack statistical inferences which are essential to reveal actual homologies.

Results: In this study, we present a statistical approach to detect homologous chromosomal segments based on gene colinearity. We implement this approach in a software package ColinearScan to detect putative colinear regions using a dynamic programming algorithm. Statistical models are proposed to estimate proper parameter values and evaluate the significance of putative homologous regions. Statistical inference, high computational efficiency and flexibility of input data type are three key features of our approach.

Conclusion: We apply ColinearScan to the Arabidopsis and rice genomes to detect duplicated regions within each species and homologous fragments between these two species. We find many more homologous chromosomal segments in the rice genome than previously reported. We also find many small colinear segments between rice and Arabidopsis genomes.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Arabidopsis / genetics*
  • Base Sequence
  • Chromosome Mapping / methods*
  • Chromosomes, Plant / genetics*
  • Computer Simulation
  • Data Interpretation, Statistical
  • Genome, Plant / genetics*
  • Models, Genetic
  • Models, Statistical
  • Molecular Sequence Data
  • Multigene Family / genetics*
  • Oryza / genetics*
  • Sequence Homology, Nucleic Acid
  • Synteny / genetics*