Accurate geometrical restraints for Watson-Crick base pairs

Acta Crystallogr B Struct Sci Cryst Eng Mater. 2019 Apr 1;75(Pt 2):235-245. doi: 10.1107/S2052520619002002. Epub 2019 Mar 27.

Abstract

Geometrical restraints provide key structural information for the determination of biomolecular structures at lower resolution by experimental methods such as crystallography or cryo-electron microscopy. In this work, restraint targets for nucleic acids bases are derived from three different sources and compared: small-molecule crystal structures in the Cambridge Structural Database (CSD), ultrahigh-resolution structures in the Protein Data Bank (PDB) and quantum-mechanical (QM) calculations. The best parameters are those based on CSD structures. After over two decades, the standard library of Parkinson et al. [(1996), Acta Cryst. D52, 57-64] is still valid, but improvements are possible with the use of the current CSD database. The CSD-derived geometry is fully compatible with Watson-Crick base pairs, as comparisons with QM results for isolated and paired bases clearly show that the CSD targets closely correspond to proper base pairing. While the QM results are capable of distinguishing between single and paired bases, their level of accuracy is, on average, nearly two times lower than for the CSD-derived targets when gauged by root-mean-square deviations from ultrahigh-resolution structures in the PDB. Nevertheless, the accuracy of QM results appears sufficient to provide stereochemical targets for synthetic base pairs where no reliable experimental structural information is available. To enable future tests for this approach, QM calculations are provided for isocytosine, isoguanine and the iCiG base pair.

Keywords: Cambridge Structural Database (CSD); Protein Data Bank (PDB); canonical Watson–Crick base pairs; isocytosine (iC); isoguanine (iG); nucleobase geometry; quantum-mechanical calculations; stereochemical restraints; ultrahigh resolution.