Identification of "known unknowns" utilizing accurate mass data and chemical abstracts service databases

J Am Soc Mass Spectrom. 2011 Feb;22(2):348-59. doi: 10.1007/s13361-010-0034-3. Epub 2011 Jan 28.

Abstract

In many cases, an unknown to an investigator is actually known in the chemical literature. We refer to these types of compounds as "known unknowns." Chemical Abstracts Service (CAS) Registry is a particularly good source of these substances as it contains over 54 million entries. Accurate mass measurements can be used to query the CAS Registry by either molecular formulae or average molecular weights. Searching the database by the web-based version of SciFinder is the preferred approach when molecular formulae are available. However, if a definitive molecular formula cannot be ascertained, searching the database with STN Express by average molecular weights is a viable alternative. The results from either approach are refined by employing the number of associated references or minimal sample history as orthogonal filters. These approaches were shown to be successful in identifying "known unknowns" noted in LC-MS and even GC-MS analyses in our laboratory. In addition, they were demonstrated in the identification of a variety of compounds of interest to others.

MeSH terms

  • Databases, Factual*
  • Mass Spectrometry*
  • Models, Chemical*
  • Molecular Weight
  • Pesticides
  • Pharmaceutical Preparations
  • Polymers

Substances

  • Pesticides
  • Pharmaceutical Preparations
  • Polymers