Matched Molecular Pair Analysis on Large Melting Point Datasets: A Big Data Perspective

ChemMedChem. 2018 Mar 20;13(6):599-606. doi: 10.1002/cmdc.201700303. Epub 2017 Aug 23.

Abstract

A matched molecular pair (MMP) analysis was used to examine the change in melting point (MP) between pairs of similar molecules in a set of ∼275k compounds. We found many cases in which the change in MP (ΔMP) of compounds correlates with changes in functional groups. In line with the results of a previous study, correlations between ΔMP and simple molecular descriptors, such as the number of hydrogen bond donors, were identified. In using a larger dataset, covering a wider chemical space and range of melting points, we observed that this method remains stable and scales well with larger datasets. This MMP-based method could find use as a simple privacy-preserving technique to analyze large proprietary databases and share findings between participating research groups.

Keywords: OCHEM; general solubility equation; matched molecular pairs; melting points.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Big Data*
  • Databases, Chemical
  • Datasets as Topic*
  • Hydrogen Bonding
  • Models, Chemical*
  • Molecular Structure
  • Quantitative Structure-Activity Relationship
  • Solubility
  • Transition Temperature*