Atomic Structures of the M Olecular Components in DNA and RNA Based on Bond Lengths As Sums of Atomic Radii

1 Atomic St ructures of the M olecular Components in DNA and RNA based on Bond L engths as Sums of Atomic Radii Raji Heyrovská (Institute of Biophysics, Academy of Sciences of the Czech Republic) E-mail: [email protected] Abstract The author’s interpretation in recent years of bond lengths as sums of the relevant atomic/ionic radii has been extended here to the bonds in the skeletal structures of adenine, guanine, thymine, cytosine, uracil, ribose, deoxyribose and phosphoric acid. On examining the bond length data in the literature, it has been found that the averages of the bond lengths are close to the sums of the corresponding atomic covalent radii of carbon (tetravalent), nitrogen (trivalent), oxygen (divalent), hydrogen (monovalent) and phosphorus (pentavalent). Thus, the conventional molecular structures have been resolved here (for the first time) into probable atomic structures. 1. Introduction The interpretation of the bond lengths in the molecules constituting DNA and RNA have been of interest ever since the discovery [1] of the molecular structure of nucleic acids. This work was motivated by the earlier findings [2a,b] that the inter-atomic distance between any two atoms or ions is, in general, the sum of the radii of the atoms (and or ions) constituting the chemical bond, whether the bond is partially or fully ionic or covalent. It 2 was shown in [3] that the lengths of the hydrogen bonds in the Watson- Crick [1] base pairs (A, T and C, G) of nucleic acids as well as in many other inorganic and biochemical compounds are sums of the ionic or covalent radii of the hydrogen donor and acceptor atoms or ions and of the transient proton involved in the hydrogen bonding. Thus, it was shown [2, 3] that, in general, bond lengths between any two atoms (or ions) can be considered as sum of the radii of the atoms (or ions) constituting the bond. In the case of covalent bonds in general, Pauling [4] considered the bond lengths as sums of the covalent radii of the constituent atoms. However, Pauling [4] did not extend this to the purines (A and G) and pyrimidines (T, C and U). This article shows (for the first time) that the existing data on the inter- atomic distances in adenine(A), guanine (G), thymine (T) (in DNA), cytosine (C), uracil (U) (in RNA), ribose (in RNA), deoxyribose (in DNA) and phosphoric acid can be interpreted as sums of the appropriate covalent radii of carbon, nitrogen, hydrogen, oxygen and phosphorus. 2. Bond lengths as sums of the relevant atomic covalent r adii (this work) Pauling [4] observed that in all the bases (A, T, C, G and U), the CC bond length is around 1.40 Å and that the CN bond length has values around 1.32 Å and 1.36 Å. On examining the existing bond length data [4 -7], it is found that in fact the CC, CN, CH, NH, CO, OH and PO bonds that constitute the structures of the above molecules have some essentially fixed bond lengths, and that the CC and CN bonds have more values than those suggested in [4]. 3 For the interpretation here of the bond lengths, the atomic covalent radii, Rcov defined in [4] as half the inter-atomic distance between two atoms of the same kind, have been used. Note that C has two types of single bonds as pointed out in [4]: the aliphatic single bonds with a covalent radius of 0.77 Å (as in diamond), and the aromatic single bonds with a covalent radius of 0.72 Å (as in graphite). The covalent double bond radius of C, also defined [4] as half the CC double bond length, is 0.67 Å. The above three radii of C are distinguished here using the denotations as shown in Fig. 1: C(I): 0.77 Å, C(II): 0.72 Å and C(III): 0.67 Å. Similarly, Rcov for N(I) is half the NN covalent single bond length, for N(II) it is half the covalent NN double bond length, for H it is half the HH covalent single bond distance, for O(I) it is half the OO covalent single bond length and for O(II) it is half the OO covalent double bond length. Rcov for P was obtained by subtracting Rcov of O(I) from the known [4, 7] PO(I) bond length. An interesting observation from the data in [4 - 7] is e.g., that the CN bond length of 1.34 Å in the aromatic ring in adenine is the same irrespective of whether it pertains to a single or a double bond. This is similar to the finding in [4] that in the case of benzene, all the six CC bonds are of length 1.39 (+/- ) 0.01 Å with equal bond order of 1.5 (due to resonance, [4]), although represented by three alternating single and double bonds. The CC bond length in benzene (with bond order 1.5) was interpreted in [2] as the sum (0.72 + 0.77 Å) of the radii for C(II) and C(III). Similarly, in adenine, the CN bond distances [4 -7] for N1-C2 (single bond), C2-N3 (double bond), N3-C4 (single bond), and C6-N1 (double bond) (see column 2, Table 1) in the aromatic ring all have the same value of 1.34 (+/-) 4 0.02 Å. This is interpreted here as the sum of the covalent radii (Rcov) of C(II) and N(II)], which is equal to (0.72 + 0.62 Å). 3. Comparison of the bond length data with the sums of covalent radii The existing bond length data in [4 - 7] and the sums R(sum) of the relevant atomic covalent radii (given in Fig. 1) are assembled in Table 1 for A, G, T and C and in Table 2 for U, ribose, deoxyribose and phosphoric acid. In Table 3, the sums of the atomic radii, R(sum) (in column 2) are compared with the average values (in column 3) of the bond lengths in [4- 7] (given in Tables 1 and 2). Fig. 2 shows a plot of the bond lengths versus R(sum). It can be seen that of the twelve bond lengths, R(sum) is close (+/- 0.03) to the average values [4 -7] for ten of them, and only two values deviate by 0.04 and 0.07 Å from R(sum), due to the higher values in [6, 7] than in [4, 5]. Therefore, it is concluded here that it is reasonable to interpret the bond lengths in the purines (A and G), pyrimidines (T, C and U), sugars (ribose and deoxyribose) and phosphoric acid, as sums of the appropriate covalent radii (Rcov) of the atoms involved. Thus, the molecular structures conventionally written as in [1, 4 – 7] have now been resolved into atomic structures based on the individual atomic radii. See Fig. 3 for thymine (T) and adenine (A), Fig. 4 for cytosine (C) and guanine (G) and Fig. 5 for phosphoric acid, deoxyribose, uracil (U) and ribose. For these molecules, while confirming Pauling’s [4] two CN bonds of lengths 1.32 and 1.36 (+/- 0.02 Å) and one CC bond length of 1.40 Å, the sums of the atomic radii show here (see Table 3) that actually there are 1) 5 five CN bonds (1.29, 1.34, 1.37, 1.42 and 1.47 Å), 2) three CC bonds (1.39, 1.49 and 1.54 Å), 3) two CO bonds (1.27 and 1.44 Å) and 4) two PO bonds (1.52 and 1.59 Å). The bond lengths of C, O and N with H are: O(I) - H (1.04 Å), N(I) - H (1.07 Å), C(III) - H (1.04 Å), C(II) - H (1.09 Å) and C(I), H (1.14 Å), (not given in Table 1 - 3). Note that the increase in the CH bond length in going from aromatic to aliphatic C is attributed by Pauling [4] to the change in the radius of H, whereas here it is attributed to those of C. Thus, the sums of the covalent radii of the individual atoms (see Fig. 1) constituting the bonds are shown to be fair representations of the average bond lengths [4 -7] (see Table 3) of all the individual molecular components of DNA and RNA. This has enabled the establishing of the most probable atomic structures as shown in Figs. 2 – 5. During the formation of the Watson-Crick base pairs [1, 3] by hydrogen bonding, the donor to acceptor bonds become partially ionic (electrostatic) by the sharing of the transient proton as described in [3]. Acknowledgments The author is grateful for the grants, AVOZ50040507 of the Academy of Sciences of the Czech Republic and LC06035 of the Ministry of Education, Youth and Sports of the Czech Republic. 6 References [1] J. D. Watson and F. H. C. Crick, Nature (London), 171 (1953) 737. [2] R. Heyrovska, a) Mol. Phys., 103 (2005) 877 and b) International Joint meeting of ECS, USA and Japanese, Korean and Australian Societies, Honolulu, Hawaii, October 2004, Vol. 2004 - 2, Extd. Abs. C2-0551. http://www.electrochem.org/dl/ma/206/pdfs/0551.pdf [3] R. Heyrovska, Chem. Phys. Lett., 432 (2006) 348, and the literature cited therein. [4] L. Pauling, The Nature of the Chemical Bond, Cornell Univ. Press, Ithaca, New York (1960). [5] A. L. Lehninger, Biochemistry, Worth Publishers, Inc., New York (1975). [6] M.

Atomic Structures of the M Olecular Components in DNA and RNA Based on Bond Lengths As Sums of Atomic Radii

The Dependence of the Sio Bond Length on Structural Parameters in Coesite, the Silica Polymorphs, and the Clathrasils

The Nafure of Silicon-Oxygen Bonds

Bond Distances and Bond Orders in Binuclear Metal Complexes of the First Row Transition Metals Titanium Through Zinc

Bond-Length Distributions for Ions Bonded to Oxygen: Results for the Lanthanides and Actinides and Discussion of the F-Block Contraction

Chapter 5 the Covalent Bond 1

CH 101Fall 2018 Discussion #13 Chapter 10 Your Name: TF's Name: D

The Electronic Structure of Molecules an Experiment Using Molecular Orbital Theory

Bond Length and Radii Variations in Fluoride and Oxide Molecules and Crystals

Bond Lengths and Bond Valences of Ions Bonded to Oxygen

Atomic Structures of Graphene, Benzene and Methane with Bond Lengths As

Assessing Phosphine−Chalcogen Bond Energetics from Calculations Samuel R

Chapter 9 2010