Density functional theory calculations of the geometry and vibrational frequencies. Equilibrium geometries of the investigated G·C base pairs and transition states (TSs) of their mutual conformational or tautomeric transformations, as well as their harmonic vibrational frequencies have been calculated, using Gaussian’09 program package [32] at the B3LYP/6-311++G(d,p) level of theory [33-37], which approved itself successfully for the calculations of the similar systems and processes and shown acceptable level of accuracy and adequacy of the obtained results [37, 38]. A scaling factor that is equal to 0.9668 has been applied in the present work for the correction of the harmonic frequencies for all complexes [30, 31, 39].
We have confirmed local minima and TSs, localized by Synchronous Transit-guided Quasi-Newton method [40], on the potential energy landscape by the absence or presence, respectively, of one imaginary frequency in the vibrational spectra of the complexes. All reaction pathways have been reliably confirmed by providing intrinsic reaction coordinate (IRC) calculations [40] from each TS in the forward and reverse directions at the B3LYP/6-311++G(d,p) level of theory.
All calculations have been performed in the continuum with ε=1, that adequately reflects the processes occurring in real biological systems without deprivation of the structurally-functional properties of the bases in the composition of the DNA or RNA molecules and satisfactorily models the substantially hydrophobic recognition pocket of the DNA-polymerase machinery as a part of the replisome [41, 42].
Single point energy calculations. We continued geometry optimizations with electronic energy calculations as the single point calculations at the MP2/6-311++G(2df,pd) level of theory [43, 44].
The Gibbs free energy G for all structures was obtained in the following way:
G=Eel+Ecorr, (1)
where Eel – electronic energy and Ecorr – thermal correction.
QTAIM analysis. Bader's quantum theory of Atoms in Molecules (QTAIM) [45] was applied to analyse the electron density distribution, using program package AIMAll [46]. The presence of the bond critical point (BCP), namely the so-called (3,-1) BCP, and a bond path between the donor and acceptor of the H-bond, as well as the positive value of the Laplacian at this BCP (Δρ>0), were considered as criteria for the formation of the H-bond [47-50]. Wave functions were obtained at the B3LYP/6-311++G(d,p) level of theory, used for geometry optimisation.
The atomic numbering scheme for the DNA bases is conventional and rare tautomeric forms of the G and C bases are marked by an asterisk (*) [14].
Obtained results and their discussion.
Obtained results are presented on Fig. 1 and in Table 1, which detailed discussion are outlined below.
Altogether, it was established five novel routes of the conformationally-tautomeric transformations of the G·C base pairs – 1. G*·C*(WC)↔G*t·C(rwН)↑↔G*N7·C*(rwH)↑↔G*N7·C*(wH)↑↔G*t·C*O2(wH)↑↔G*t·C*O2(rwH)↑, 2. G*·C*(rWC)↔G*t·C*O2(rwWC)↓/↔G*t·C*O2(wH)↑, 3. G*·C*t(rWC)↔G*t·C*tO2(wН)↓↔
G*t·C*tO2(rwH)↓/↔G*t·C*tO2(rwWC)↓↔G*t·C*tO2(wWC)↓, 4. G*t·C*(rH)↔G*·C*O2(wWC)↑/
↔G*·C*O2(wH)↓ and 5. G*t·C*t(rH)↔G*·C*tO2(wWC)↑/
↔G*·C*tO2(rwH)↓↔G*·C*tO2(wH)↓ (Fig. 1 and Table 1).
General feature of these reactions of the transformations of the WC/H base pairs is that they start from the rotations of the G·C base pairs around the intermolecular H-bonds, which is further followed by the proton transfer (PT) along the intermolecular neighboring H-bonds (Fig. 1 and Table 1).
So, let’s analyze step-by-step revealed pathways in more details.
- The G*·C*(WC)↔G*·C*(rwWC/H)↔G*t·C(rwWC/H)↔G*t·C(rwН)↑↔G*N7·C*(rwH)↑↔
G*N7·C*(wH)↑↔G*t·C*O2(wH)↑↔G*t·C*O2(rwH)↑ reaction pathway leads to the formation of the wobble (wH) or reverse wobble (rwH) base pairs, which bind from the Hoogsteen side, and consists from seven stages (Fig. 1a). This conformationally-tautomeric transformation starts from the rotation of the bases within the Lowdin G*·C*(WC) base pair around the upper (G)O6H…N4(C) H-bond, leading to the formation of the G*·C*(rwWC/H) base pair - G*·C*(WC)↔G*·C*(rwWC/H).
Further, formed G*·C*(rwWC/H) base pair transforms through the proton transfer along the intermolecular O6H…N4 and N3H…O6 H-bonds into the G*·C*(rwWC/H) base pair - G*·C*(rwWC/H)↔G*t·C(rwWC/H), which is accompanied by the changing of the orientation of the O6H hydroxyl group of the G base from cis- to trans-.
Formed G*t·C(rwWC/H) base pair transforms through the mutual rotation of the bases around the middle intermolecular (G)O6H…N3(C) H-bond - G*t·C(rwWC/H)↔G*t·C(rwН)↑. Novel reverse wobble Hoogsteen G*t·C(rwН)↑ base pair launches cascade of the interconversions between the wobble Hoogsteen (G*N7·C*(wH)↑, G*t·C*O2(wH)↑) and reverse wobble Hoogsteen base pairs (G*t·C(rwН)↑, G*N7·C*(rwH)↑, G*t·C*O2(rwH)↑), occurring through the proton transfer and rotations of the bases - G*t·C(rwН)↑↔G*N7·C*(rwH)↑↔G*N7·C*(wH)↑↔G*t·C*O2(wH)↑↔G*t·C*O2(rwH)↑ (Fig. 1a).
Notably, that during this G*·C*(WC)↔G*t·C*O2(rwH)↑ transformation dipole moment µ of the complex varies in the wide range of values – 2.36-11.12 D, reaching its maximum at the TSs – TSG*·C*(rwWC/H)↔G*t·C(rwWC/H) (µ=10.48 D), TSG*t·C(rwWC/H)↔G*t·C(rwH)↑ (µ=8.84 D), TSG*t·C(rwH)↑↔G*N7·C*(rwH)↑ (µ=11.12 D) and TSG*N7·C*(rwH)↑↔G*N7·C*(H)↑ (µ=9.46 D), indicating their high polarization.
- The G*·C*(rWC)↔G*·C*(wWC/H)↔G*t·C*O2(wWC/H)↔G*t·C*O2(rwWC)↓/
↔G*t·C*O2(wH)↑ reaction pathway starts from the rotation of the bases within the reverse Lowdin G*·C*(rWC) base pair and further splits into two pathways – G*t·C*O2(wWC/H)↔G*t·C*O2(rwWC)↓ and G*t·C*O2(wWC/H)↔G*t·C*O2(wH)↑, leading to the formations of the reverse wobble Watson-Crick G*t·C*O2(rwWC)↓ and wobble Hoogsteen G*t·C*O2(wH)↑ base pairs, accordingly (Fig. 1b).
This division occurs at the so-called bifurcation point – G*t·C*O2(wWC/H), which arises after the proton transfer – G*·C*(wWC/H)↔G*t·C*O2(wWC/H), accompanied by cis→trans-transition of the O6H hydroxyl group of the G* base. Following the 1st route – G*t·C*O2(wWC/H)↔G*t·C*O2(rwWC)↓ – this bifurcation point transforms into the reverse wobble Watson-Crick G*t·C*O2(rwWC)↓ base pair, while following the 2nd route – it comes to the wobble Hoogsteen G*t·C*O2(wH)↑ base pair. Notably, that both formed G*t·C*O2(rwWC)↓ and G*t·C*O2(wH)↑ base pairs are almost iso-energetical, since their relative Gibbs free energies consist ΔG=16.83 and 16.02 kcal·mol-1, accordingly.
The minimum values of the dipole moment µ of these complexes is achieved for the G*t·C*O2(rwWC)↓ base pair (µ=2.25 D), while the maximum value – for the TSG*t·C*O2(wWC/H)↔G*t·C*O2(wH)↑ (µ=7.55 D).
- The G*·C*t(rWC)↔G*·C*t(wWC/H)↔G*t·C*tO2(wWC/H)↔G*t·C*tO2(wН)↓↔
G*t·C*tO2(rwH)↓/↔G*t·C*tO2(rwWC)↓↔G*t·C*tO2(wWC)↓ reaction pathway leads to the G*t·C*tO2(rwH)↓ and G*t·C*tO2(wWC)↓ base pairs, accordingly (Fig. 1c). It proceeds through the bifurcation point – G*t·C*tO2(wWC/H) base pair, which arises after the G*·C*t(wWC/H)↔G*t·C*tO2(wWC/H) transformation through the proton transfer in the G*·C*t(wWC/H) base pair.
This G*t·C*tO2(wWC/H) base pair launches two separate routes of the cascade transformations through the rotations of the bases around the intermolecular H-bonds, leading to the shifting of the base on the right down:
- G*t·C*tO2(wWC/H)↔G*t·C*tO2(wН)↓↔G*t·C*tO2(rwH)↓;
- G*t·C*tO2(wWC/H)↔G*t·C*tO2(rwWC)↓↔G*t·C*tO2(wWC)↓.
Maximum values of the dipole moment µ are achieved at the G*t·C*tO2(wН)↓ (µ=7.64 D), G*t·C*tO2(rwWC)↓ (µ=4.88 D), G*t·C*tO2(wWC)↓ (µ=5.26) base pairs and TSG*t·C*tO2(wН)↓↔G*t·C*tO2(rwН)↓ (µ=8.04 D) (Fig. 1c).
- Joint feature of the G*t·C*(rH)↔G*t·C*(wWC/H)↔G*·C*O2(wWC/H)↔G*·C*O2(wWC)↑
/↔G*·C*O2(wH)↓ (Fig. 1d) and G*t·C*t(rH)↔G*t·C*t(wWC/H)↔G*·C*tO2(wWC/H)↔G*·C*tO2(wWC)↑
/↔G*·C*tO2(rwH)↓↔G*·C*tO2(wH)↓ (Fig. 1e) transformations of the reverse Hoogsteen G*t·C*(rH) and G*t·C*t(rH) base pairs, respectively, is that they both start from the mutual rotations of the bases around the upper (G)O6H…O2(C) H-bond and further continue with the proton transfer along the intermolecular N3H…O6 and O6H…O2 H-bonds, leading to the so-called bifurcation point – G*·C*O2(wWC/H) and G*·C*tO2(wWC/H) base pairs, respectively.
After that, reaching the so-called bifurcation point, each of these reactions divides into two routes. The 1st route – G*·C*O2(wWC/H)↔G*·C*O2(wWC)↑ and G*·C*tO2(wWC/H)↔G*·C*tO2(wWC)↑, respectively, occurs through the mutual rotations of the bases around the (G)O6H…N3(C) H-bond and leads to the formation of the G*·C*O2(wWC)↑ and G*·C*tO2(wWC)↑ base pairs, respectively (Figs. 1d and 1e). The 2nd route interconnects the formed G*·C*O2(wWC/H) and G*·C*tO2(wWC/H) base pairs with various base pairs through the cascade of the rotational transformations – G*·C*O2(wWC/H)↔G*·C*O2(wH)↓ and G*·C*tO2(wWC/H)↔G*·C*tO2(rwH)↓↔G*·C*tO2(wH)↓, respectively (Figs. 1d and 1e).
Dipole moments reach their maximum values (6.00-7.27 D) at the TSs (Figs. 1d and 1e).
Altogether, these conformationally-tautomeric transformations are accompanied by the re-arrangement of the intermolecular AH…B H-bonds at the base pairs and TSs of their interconversions, reaching their maximal H…B distances (2.505-2.598 Å) at the unusual G*·C*tO2(rwH)↓ and G*·C*tO2(wH)↓ base pairs (Fig. 1e).