An improved scoring scheme for predicting glycan structures from gene expression data

Genome Inform. 2007:18:237-46.

Abstract

The prediction of glycan structures from gene expression of glycosyltransferases (GTs) is a challenging new area in computational biology because the biosynthesis of glycan chains is under the control of GT expression. In this paper we developed a new method for predicting glycan structures from gene expression data. There are two main original aspects of the proposed method. First, we proposed to increase the number of predictable glycan structure candidates by estimating missing glycans from a global glycan structure map, which enables us to predict new glycan structures that are not stored in the database. Second, we proposed a more general scoring scheme based on real-valued gene expression intensity rather than converting it into binary information. In the result we applied the proposed method to predicting cancer-specific glycan structures from gene expression profiles for patients of acute lymphocytic leukemia (ALL) and acute myelocytic leukemia (AML). We confirmed that several of the predicted glycan structures successfully correspond to known cancer-specific glycan structures according to the literature, and our method outperforms the previous methods at a statistically significant level.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Carbohydrate Conformation*
  • Gene Expression Profiling*
  • Humans
  • Leukemia, Myeloid, Acute / genetics
  • Oligonucleotide Array Sequence Analysis
  • Polysaccharides / chemistry*
  • Precursor Cell Lymphoblastic Leukemia-Lymphoma / genetics

Substances

  • Polysaccharides