Analysis of correlated mutations in HIV-1 protease using spectral clustering

Bioinformatics. 2008 May 15;24(10):1243-50. doi: 10.1093/bioinformatics/btn110. Epub 2008 Mar 28.

Abstract

Motivation: The ability of human immunodeficiency virus-1 (HIV-1) protease to develop mutations that confer multi-drug resistance (MDR) has been a major obstacle in designing rational therapies against HIV. Resistance is usually imparted by a cooperative mechanism that can be elucidated by a covariance analysis of sequence data. Identification of such correlated substitutions of amino acids may be obscured by evolutionary noise.

Results: HIV-1 protease sequences from patients subjected to different specific treatments (set 1), and from untreated patients (set 2) were subjected to sequence covariance analysis by evaluating the mutual information (MI) between all residue pairs. Spectral clustering of the resulting covariance matrices disclosed two distinctive clusters of correlated residues: the first, observed in set 1 but absent in set 2, contained residues involved in MDR acquisition; and the second, included those residues differentiated in the various HIV-1 protease subtypes, shortly referred to as the phylogenetic cluster. The MDR cluster occupies sites close to the central symmetry axis of the enzyme, which overlap with the global hinge region identified from coarse-grained normal-mode analysis of the enzyme structure. The phylogenetic cluster, on the other hand, occupies solvent-exposed and highly mobile regions. This study demonstrates (i) the possibility of distinguishing between the correlated substitutions resulting from neutral mutations and those induced by MDR upon appropriate clustering analysis of sequence covariance data and (ii) a connection between global dynamics and functional substitution of amino acids.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Amino Acid Substitution
  • Base Sequence
  • Cluster Analysis*
  • DNA Mutational Analysis / methods*
  • Evolution, Molecular*
  • HIV Protease / chemistry*
  • HIV Protease / genetics*
  • Molecular Sequence Data
  • Mutation
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods*
  • Statistics as Topic

Substances

  • HIV Protease
  • p16 protease, Human immunodeficiency virus 1