Prioritisation of candidate Single Amino Acid Polymorphisms using one-class learning machines

Int J Comput Biol Drug Des. 2011;4(4):316-31. doi: 10.1504/IJCBDD.2011.044446. Epub 2011 Dec 24.

Abstract

Recent advancements of the next-generation sequencing technology have enabled the direct sequencing of rare genetic variants in both case and control individuals. Although there have been a few statistical methods for uncovering potential associations between multiple rare variants and human inherited diseases, most of these methods require computational approaches to filter out non-functional variants for the purpose of maximising the statistical power. To tackle this problem, we formulate the detection of genetic variants that are associated with a specific type of disease from the perspective of one-class novelty learning. We focus on a typical type of genetic variants called Single Amino Acid Polymorphisms (SAAPs), and we take advantages of a feature selection mechanism and two one-class learning methods to prioritise candidate SAAPs. Systematic validation demonstrates that the proposed model is effective in recovering disease-associated SAAPs.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Amino Acid Sequence
  • Amino Acids / genetics*
  • Genetic Variation*
  • Humans
  • Models, Theoretical
  • Neural Networks, Computer
  • Polymorphism, Genetic*
  • Principal Component Analysis
  • Support Vector Machine

Substances

  • Amino Acids