Creation and implications of a phenome-genome network

Nat Biotechnol. 2006 Jan;24(1):55-62. doi: 10.1038/nbt1150.


Although gene and protein measurements are increasing in quantity and comprehensiveness, they do not characterize a sample's entire phenotype in an environmental or experimental context. Here we comprehensively consider associations between components of phenotype, genotype and environment to identify genes that may govern phenotype and responses to the environment. Context from the annotations of gene expression data sets in the Gene Expression Omnibus is represented using the Unified Medical Language System, a compendium of biomedical vocabularies with nearly 1-million concepts. After showing how data sets can be clustered by annotative concepts, we find a network of relations between phenotypic, disease, environmental and experimental contexts as well as genes with differential expression associated with these concepts. We identify novel genes related to concepts such as aging. Comprehensively identifying genes related to phenotype and environment is a step toward the Human Phenome Project.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aging / physiology
  • Animals
  • Artificial Intelligence
  • Cluster Analysis
  • Databases, Protein*
  • Environment
  • Gene Expression Regulation / physiology*
  • Genomics / methods*
  • Mice
  • Mice, Inbred C57BL
  • Models, Biological*
  • Muscle Proteins / physiology*
  • Muscle, Skeletal / physiology*
  • Natural Language Processing
  • Pattern Recognition, Automated / methods
  • Protein Interaction Mapping / methods
  • Signal Transduction / physiology*
  • Unified Medical Language System


  • Muscle Proteins