Integrative Molecular Medicine

Review Article ISSN: 2056-6360

Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life Kozo Nagano* Nagano Research Institute of Molecular Biology, 4-8-24 Higiriyama, Kohnan-ku, Yokohama 233-0015, Japan

Abstract Origin of life on earth must be started from a story explaining how ribonucleic acids (RNAs) had been accumulated in azoic era. Since the world of nonazoic era could have been full of water, a single molecule of RNA should have been in danger of hydrolysis rather than in a tendency of progressive synthesis. It necessarily requires coevolution of proteins that had been playing a role of protecting RNA from hydrolysis. The present article describes a possibility of a large-scale of confined organic phosphates and methane hydrates under high pressure and temperature deeply hidden into the mantle of the earth. In such conditions, a polymeric form of D- can be made as a left-handed helix connected by base pairs with neighboring strands. In the lattice of left-handed helices of RNA, a lot of spaces or gaps allowed accumulation of various amino acids, from the smallest one such as glycine to not very large as tryptophan. At the 3’ end of a polymeric RNA strand, aminoacylation could also occur. The other essential substances to the origin of life, such as adenine-triphosphates, could have been accumulated together.

Introduction are mainly reduced metals and metal oxides. Surface temperature of primordial earth would have been pretty high by thermal energy Fossils have shown that biochemically advanced microbial life produced by collisions of comets and meteorites, but would have had existed on earth over 3 billion years ago. Since the age of earth is been cooled down when its core became occupied by components believed to be about 4.6 billion years, it would be natural to think that of meteorites and its surface was covered by those from comets. The prebiotic molecules of nucleic acids and proteins had been accumulated above process in evolution of earth should be referred to as the first either shortly after or at the same time of the birth of the earth, the stage. The second stage in the history of earth started from temperature surface of which at that time was magmatic ocean covered by a thick elevation in its core by accumulation of thermal energy originated from layer of vaporized water molecules and various gaseous molecules, disintegration of uranium atoms. At this stage, the surface temperature rather rich in hydrogen molecules. Glycine, α-amino-isobutyric acid, would have been as cool as that of comets. As temperature in the core alanine, glutamic acid, β-alanine, isovaline, aspartic acid, and leucine of the earth was elevated, water vapour would have been concentrated were observed in the Murchison meteorite [1]. Although they reported in the atmosphere of the ancient earth, while its surface would have the relative abundance of L-enantiomer, it might not be the cause been rather a magmatic ocean composed of exposed hot mantle, on of the origin of the prevailance of L-enantiomer on the present-day which hot rain would have been falling to make a lot of boiling lakes. earth [2,3]. In the Yamato-91198 meteorite, serine, threonine, proline, The boiling lakes might have grown up to form a boiling ocean. It valine, isoleucine, sarcosine, α-aminobutyric acid, β-aminobutyric should be referred to as the third stage. The fourth stage in the history acid, γ-aminobutyric acid, α-aminoisobutyric acid, α-aminoadiopic of earth would be in the cooling process. When the first signs of the acid, various aliphatic carbonic acid, and aliphatic as well as aromatic ancient life existed over 3 billion years ago, the evolution of RNA had hydrocarbons were observed. However, no sign of nucleic acids has already reached a ripening period. The basic hypothesis adopted in the been observed in meteorites except some purines and pyrimidines. present article is that the evolution of RNA started from the end of the Some of them might have been synthesized by a shock of thunder, first stage and continued until the end of the third stage. while some of them, such as nucleic acids in particular, might have been destroyed on a boiled magmatic ocean. Accordingly, it remains Accumulation of organic phosphates and aromatic an etigma how nucleic acid had been formed using D- as its amines main component among many other candidates. The present article tries to present a hypothesis that solves the problem of multiple A mineralogical form of the mixture of water and methane at a difficulties to reach the origin of life from the origin of earth. very low temperature under a high pressure would have been that Evolution of earth The origin of earth has been supposed to be a gathering center of Correspondence to: Kozo Nagano, Nagano Research Institute of Molecular meteorites and comets. A core of comet is usually composed of ice water Biology, 4-8-24 Higiriyama, Kohnan-ku, Yokohama 233-0015, Japan, E-mail: and dry ice (carbon dioxide) as well as methane, ammonia, silicates, [email protected] and phosphates. On the other hand, the components of meteorites Received: April 10, 2017; Accepted: April 21, 2017; Published: April 25, 2017

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 1-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

of methane hydrate that is known to be very inflammable in contact oxygen atoms. Accordingly, various amino acids without phosphorus with oxygen gas. This phenomenon would mean that methyl radical atom would have been accumulated under the first layer of petroleum is easily formed by reacting with an oxygen atom. Since the primeval oil and fatty acids. The hydrogen-bonding network of methane hydrate atmosphere is supposed to be rich in hydrogen molecules, the most would have been in the second layer mixed with ammonia, sulfate (or of the reactive oxygen atoms would have been reserved in a form of mercaptan), phosphin, amino acids, alkylamines, etc., and made a ferrous oxide brought about by meteorites. Accordingly, one of the direct contact with ferrous oxide and silicates. Methanol radical as a conceivable starting events of abiogenic biological substance synthesis reaction product of methyl radical and water would have been trapped would have been a contact of the methane hydrate and ferrous oxide inside the hydrogen-bonding network and would have resulted in in the deep center of the gaseous earth under a high pressure and a formation of glyceride and phosphoglyceride. Glyceride would have low temperature. Disintegration of uranium atoms might have played been a starting material for various synthesis including various a role in producing oxygen radical, which might also be an origin of . The reaction between phosphoglyceride and fatty acids the formation of hot mantle and core of the earth by concentration would have formed a phospholipid as a single layer between the first of melted reduced iron in the center of the earth. The possible various petroleum oil layer and the second or third layers rich in hydrogen- chain reactions from the methyl radical would have produced various bonding capacities. Accumulation of polyphosphate might have been aliphatic and aromatic chains and fatty acids. Specific gravities of such important at this stage, because the leading role of organic reactions in organic compounds synthesized abiogenically on the primeval earth are evolution of both earth and life would be shifting gradually from the supposed to be heavier than that of methane hydrate that carries much radical reaction described above to polymerization by dehydration of higher content of hydrogen atoms. Most of organic compounds brought amino acids and phosphosugars through reaction intermediates such about by meteorites would have been mixed with the newly synthesized as sugar triphosphates. ones. Those were precipitated deeply in order of specific gravity at the It is natural to suppose that the high concentration of water vapor bottom layer on the mantle of the earth. Since the mantle were not kept the surface temperature not too high from that of boiling water. well separated enough from the core, those organic compounds of If the temperature was not too high, there could be enough amount approximately uniform composition had an intimate contact with the of organic compounds covering the surface of the magmatic ocean, rocks of metallic meteorites containing ferrous oxide and uranium as where polymerization of small organic compounds by dehydration well as silicates, sulfates, and phosphates. The first organic compound reaction would have been prevailing, phosphorylation of polymerized synthesized in this way would have been methanol, which could have organic compounds could have been processed in the melted solution occupied the lattice position of water molecule in the crystalline form of phosphate and silicate on the magma. The reason why sulfur atoms of methane hydrate. Condensation reaction between two methanol were not involved in the process of the polymerization reaction could molecules with an oxygen radical would have produced a molecule of be due to the boiling and destructive temperature for sulfurylated diose that is a simplest form of sugar containing two carbon atoms. compounds compared to that of phosphorylated compounds as well A similar condensation between one molecule of diose and methanol as to the delicately adjusted surface temperature of the earth. It is, would have produced a without isomers. This is called glycerol. Phosphorylation of glycerol produce glycerophosphoric acid, which however, important to note that a primitive sulfur-oxidizing bacteria has three isomers. Triose molecules were synthesized just below the was found at submarine thermal springs on the Galapagos rift [4]. It interface with the methane hydrate layer, but deposited deeply in the utilizes chemosynthesis to derive entire energy supply from reactions bottom of the mantle. Further evolution of organic compounds from between the sea water and the rocks at high temperatures, rather triose could be independent of the presence of the methane hydrate. than photosynthesis. At the era of Origins of Life, sulfur oxidizing It is, however, important to assume that the main chemical reactions bacteria could have played a big role, but Corliss, et al. (1979) did not were promoted by oxygen radicals. A similar condensation reaction describe anything about how the ancestral bacteria obtained energy- between two could have produced , and a simultaneous rich phosphate compounds and amino acids from the hot spring condensation of three trioses with two oxygen radicals would have water. There could be a possibility of sulfur-containing RNA instead produced glycerols. Such a mixture of hexoses and glycerols could of normal phosphor-containing RNA. As described in the preceding have made a contact with an energy-rich polyphosphoric acids in the section, accumulation of phosphoglyceride must have occurred under rocks. Polyphosphates could also be formed inside a rock probably by the petroleum oil and phospholipids layers before evolution of RNA. intervention of oxygen radicals. It would have been a considerable period The reaction between phosphoglyceride and methyl radical in the before accumulation of phosphohexoses and phosphoglycerols were methane hydrate network by oxygen atoms supplied from ferrous completed. Such reactions could have proceeded at a low temperature. oxides would have produced various phosphorylated pentoses and As the temperature of the mantle started elevating, dehydration hexoses. Water molecules in the hydrogen-bonding network would reactions between phosphohexoses and between phosphoglycerols be replaced by those phosphorylated as well as various amino and fatty acids gradually started syntheses of polyphosphohexoses acids and alkylamines. On the other hand, Martin, Baross, Kelley and and phospholipids, both of which were separated from each other in Russell [5] found that the chemistry of the H2-CO2 redox couple in accordance with their physicochemical characters. hydrothermal vents showed a striking parallelism with that of the core energy metabolic reactions of some modern prokaryotic autotrophs. Accumulation of phospho-D-ribose polymer as left- Thus, there could be another question about what mechanism initiated handed single-stranded helices the chemistry of life in such an environment as the hydrothermal vent areas. The basic hypothesis of the present article tries to throw a light One of the points in the present hypothesis is that the primeval on that point. earth would have been covered by aliphatic and aromatic compounds including their derivatives such as fatty acids, because the densities of Condensation of glycerophosphoric acid with diose leads to nitrogen-, sulfur-, and phosphorus-containing organic compounds are various isomers of ribose, among which phosphor-D-ribose is the supposed to be larger than those containing only hydrogen, carbon and most prospective. On the other hand, condensation reactions with

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 2-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

carbon radicals and ammonia could have accumulated aromatic ring compounds such as uracil, thymidyl, cytocyl, adenyl, guanosyl radicals etc, among which some hydroxyl ring compounds such as dihydrouracyl would have been included. Joyce (1989) has shown various courses of producing adenine, diaminopurine, hypoxanthine, guanine, and xanthine, starting from hydrogen cyanide (HCN), and passing through diaminomaleonitrile (DAMN) [6]. He has also shown that condensation of glycoaldehyde (HC=OCH2OH) with hydrogen carbon monoxide (H2CO) produces aldotetrose (), , , ribose, , , (these are pentoses), and , , , tagalose, , , , , , , , (these are hexoses). The possible reactions of the phospho-D-ribose with uracil radical, which is the Figure 1. Stereoview of 6-fold left-handed single-stranded helix of 5’-phospho-D-ribose. most predominant species among the aromatic compounds mentioned above, could have reached the goal of uridine. One of the biggest merit of 5’-phospho-D-ribose to the other conceivable sugars and sugar phosphates is that it can geometrically be extended infinitely as a left- handed single-stranded helix. The uracil bases planted at C1’ position allows a considerable stability for hydrogen-bonding capability with the neighboring left-handed single-stranded helices. Even if ideal Watson- Crick base pairs of adenine-uracil (A-U) and guanine-cytosine (G-C) or a wobble pair of guanine-uracil (G-U) are not available, uracil- uracil (U-U) base pair guarantees a relative stability. Since some tRNA molecules of animaria mithocondria contains a considerable amount of U-U base pairs in their double stranded helical regions [7]. The reason why the present author emphasizes very much about uridine 5’-phosphate is that it could have a possibility of infinitely large lattice formation with hydrogen-bonded U-U base pairs and left-handed Figure 2. Stereoview of three 6-fold left-handed single-stranded helices of 5’-phospho- D-ribose hydrogen bonded by U-U base pairs, which are shown in a different direction single-stranded helices shown in Figure 1. The distance between two from that of Figure 1. uracil bases is 7.2 .

If this is 3.6 Å, which is usually the best distance for base stacking, the ring of D-ribose has a fatal near-contact with the neighboring D-ribose ring atÅ ± 6th position on the left-handed single-stranded helix. In that sense, the distance of 7.2 between the two uracil bases would mean that there is a possibility of intercalate another aromatic bases such as adenine, guanine,Å etc, between the layer of uracil-uracil hydrogen-bonding network. Figure 2 shows a part of such an infinite helix surrounded by three similar infinite helices stabilized by making U-U base pairs. Such a network of polyuridine chain protects itself from being hydrolyzed by hot water. Figure 3 shows that adenine triphosphate (ATP) can be stacked between the layer of uracil bases. Similar stacking of guanine triphosphate (GTP), and the other nucleotide polyphosphates could occur in the Figure 3. Stereoview of the contact area of the three infinite helices intercalating one cavity to save the material, even though simple polymerization of molecule of adenine triphosphate (ATP) which is drawn in green. such materials are not favorable in order to grow up the polyuridine layer. Joyce (1989) listed up the coenzymes in the present-day living organisms that resemble nucleotides, such as UDP-glucose, CDP- diacylglycerol, S-adenosylmethionine, flavin mononucleotide, thiamine pyrophosphate, nicotinamide, etc.. Those are all capable of being saved in the cavity [6]. The components of cell membranes such as phospholipids are also the competent candidates to be saved in similar way. Even amino acids are capable of being important bulking materials if their side chains are not too big. Polymerization of such small amino acids could have occurred by simple dehydration reaction. Figure 4 shows a case of L-amino acids polymerization growing up in the hole of cavity, even if it is not a right-handed α–helix.

The layers of the 6-fold symmetrical left-handed helices of Figure 4. Stereoview of polymerized peptides, amino acid sequence of which are Gly- polyuridines cannot be estimated about their thickness. Their Pro- Gly-Asp-Ala-Ala-Ser-Val. The conformation of the peptides in blue is not α–helix.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 3-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

outside surface would have been in contact with hot water vapor. Accordingly, it would have been covered by a thick skin of layers rich in phospholipids just like a big cell membrane, below which another thick layer of methane hydrates, ammonia, D-, and L-amino acids, and the other components in a state of chaotic mixture. The movement of the outside skin area would be supposed to be so violent that any regular sophisticated structure such as anticodon loop could not have a chance to evolve. In contrast, another contact surface with mantle, which contain crystalline structures of silicate, metal oxides and also variety of concave and convex structure, would have fitted to grow a special structure of polyuridines. Figure 5 shows that an amino acid glycine, which is drawn in pink, bonded with its carboxyl end to 3’-end of polyuridine at its bottom region of the left-handed single-stranded Figure 7. Stereoview showing a dipeptide Gly-Pro is bound at 3’-O of polyuridine helix. This is an ancestral form of transfer RNA (tRNA) and is going to terminus on the long helix in brown which is separated from the other part of long helix polymerize with proline for further polymerization of L-amino acids. (just one turn) in yellow. The nucleotides in magenta is a part of anticodon loop of ordinary tRNA from U31 to A39 (U-OMC-U-C-C-A-MIA-A-A), the anticodon of which is for Gly. N-terminal end of glycine in pink is bonded to carboxyl end of proline The rest of polyuridine in helix is drawn in orange for P atom, red for O, and white for C. in blue. This kind of stereo-specific chemical reaction could never have evolved on the outside surface in contact with hot water but was possible on the surface of silicates. Figure 6 also shows that the largest side chain of tryptophan among the essential 20 kinds of L-amino acids could be accommodated in this environment. As the temperature and the pressure gradually lowered at the bottom of the polyuridine layers in contact with the mantle rocks, the aminoacylation between 3’-end of uridine and the carboxyl end of amino acids became harder without a supply of energy transfer from the reaction of ATP to ADP

+ H3PO4. Figure 7 shows a similar state to that of Figure 8 but ATP not Figure 8. This picture corresponds to Figure 7 showing the anticodon in violet. Besides, yet inserted, while Figure 8 such a change by inserting the structure of ATP molecule is drawn in gold. The reason why ATP sits in contact with 3’-O of uridine ATP between the transition state of dipeptide Gly-Pro bound at 3’-O and carboxyl end of Gly is that an extra amount of energy is needed for aminoacylation during the course of lowering temperature and pressure of the environment.

of polyuridine terminus on the long helix in brown which is separated from the other part of long helix (just one turn) in yellow. The nucleotides in magenta is a part of anticodon loop of ordinary tRNA from U31 to A39 (U-OMC-U-C-C-A-MIA-A-A), the anticodon of which is for Gly. It is, however, important to notice that polymerization of L-amino acids seems far more favorable than that of D-amino acids. Only one D-amino acid in a consecutive L-amino acid peptide could be involved, but does not seem favorable. Such a predilection for L-amino acids becomes much clearer in the ancestral form of amino acylation as shown in the following stereo pictures showing from glycine (Gly) in Figure 9 to tryptophan (Trp) in Figure 31. In these pictures O4 atom of U33 is not hydrogen-bonded to the phosphate oxygen of 35th position Figure 5. Stereoview of dimerization of glycine and proline. Glycine drawn in pink is that is always the case for all crystallographic results of tRNAs [8]. It is bonded to 3’-end of polyuridine at its bottom region of the left-handed single-stranded known that U33 is highly conserved for all tRNAs. The anticodon loop helix. N-terminal end of proline in blue is bonded to carboxyl end of glycine. holds in its crystal structure a magnesium ion in the pocket, but under a low magnesium concentration of 1 mMol, that is about the same as that of outside the cell, the loop behaves much more flexible than that of crystalline form [9]. This is one of the most important point in the evolution of life for capability of selection for all L-amino acids and glycine in its pocket. At this stage of evolution of life. The nucleotide at the 33th position might not have to be uridine, but U33 seems more favorable for performing a multifunctional role in translation [9]. At the periphery of the polyuridine layers condensation of amino acids with its carboxyl end to 3’-end of a terminal uridine promotes polymerization of small amino acids like glycine, alanine, serine, valine, threonine, and proline. The anticodon loop structure seems to have evolved spontaneously so that it could catch only L-amino acids. It can Figure 6. Stereoview of dimerization of tryptophan and proline. Tryptophan drawn in pink be seen that D-amino acids were very hard to be accommodated in the is bonded to 3’-end of polyuridine at its bottom region of the left-handed single-stranded cavity of the anticodon loop. However, how the respective nucleotide helix. N-terminal end of proline in blue is bonded to carboxyl end of tryptophan. This picture is almost the same as that of Figure 5 except the side chain of tryptophan which is sequence of the anticodons were favorable for the nominated amino the largest among all L-amino acids. acid by the chemical structure of the codons, such as, N6 of A, O6 and

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 4-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

Figure 9. This picture shows an ancestral form of aminoacylation for Gly. The other parts are the same as the preceding picture, except for Cα in orange, Ccarboxyl in blue, and Figure 13. This picture is the same as Figure 9 except for Arg. Cα, Cβ, Cγ, Cδ, and Cζ two carboxyl oxygens in red. This picture is for that of transition state before one of the are drawn in orange, and Nε, NH1 and NH2 are in blue. Anticodon nucleotides in skyblue carboxyl oxygen is removed by dephosphorylation of ATP. are CCU.

Figure 14.This picture is the same as Figure13 for Arg except for anticodon. Cα, Cβ, Cγ, Figure 10. This picture is the same as Figure 9 except for Ala. Cα and Cβ are drawn in Cδ, and Cζ are drawn in orange, and Nε, NH1 and NH2 are in blue. Anticodon nucleotides orange. Anticodon nucleotides in skyblue are CGC. in skyblue are CCG.

Figure 11. This picture is the same as Figure 9 except for Pro. Cα, Cβ, Cγ and Cδ are drawn Figure 15. This picture is the same as Figure 9 except for Val. Cα, Cβ, Cγ1 and Cγ2 are in orange. Anticodon nucleotides in skyblue are CGG. drawn in orange. Anticodon nucleotides in skyblue are CAC.

Figure 12. This picture is the same as Figure 9 except for Ser. Cα and Cβ are drawn in Figure16. This picture is the same as Figure 9 except for Thr. Cα, Cβ and Cγ1 are drawn in orange, and Oγ is in red. Anticodon nucleotides in skyblue are GCU. orange, and Oγ2 is in red. Anticodon nucleotides in skyblue are CGU.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 5-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

Figure 17. This picture is the same as Figure 9 except for Ser. Cα and Cβ are drawn in Figure 21. This picture is the same as Figure 20 for Leu except for Anticodon. Cα, Cβ, Cγ, orange, and Oγ is in red. Anticodon nucleotides in skyblue are CGA. Cδ1 and Cδ2 are drawn in orange. Anticodon nucleotides in skyblue are CAA.

Figure 18. This picture is the same as Figure 9 except for Met. Cα, Cβ, Cγ and Cε are Figure 22. This picture is the same as Figure 9 except for Glu. Cα, Cβ, Cγ and Cδ are drawn in orange, and Sδ is in green. Anticodon nucleotides in skyblue are CAU. drawn in orange, and Oε1 and Oε2 are in red. Anticodon nucleotides in skyblue are CUC.

Figure 19. This picture is the same as Figure 9 except for Ile. Cα, Cβ, Cγ1, Cδ and Cγ2 are Figure 23. This picture is the same as Figure 9 except for Asp. Cα, Cβ and Cγ are drawn in drawn in orange. Anticodon nucleotides in skyblue are GAU. orange, and Oδ1 and Oδ2 are in red. Anticodon nucleotides in skyblue are GUC.

Figure 20. This picture is the same as Figure 9 except for Leu. Cα, Cβ, Cγ, Cδ1 and Cδ2 Figure 24. This picture is the same as Figure 9 except for Gln. Cα, Cβ, Cγ and Cδ are are drawn in orange. Anticodon nucleotides in skyblue are CAG. drawn in orange, and Oε1 is in red, and Nε2 is in blue. Anticodon nucleotides in skyblue are CUG.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 6-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

Figure 25. This picture is the same as Figure 9 except for Asn. Cα, Cβ and Cγ are drawn in Figure 29. This picture is the same as Figure 9 except for Tyr. Cα, Cβ, Cγ, Cδ1, Cδ2, Cε1, orange, and Oδ1 is in red and Nδ2 is in blue. Anticodon nucleotides in skyblue are GUU. Cε2 and Cζ are drawn in orange, and Oη are in red. Anticodon nucleotides in skyblue are GUA

Figure 26. This picture is the same as Figure 9 except for Lys. Cα, Cβ, Cγ, Cδ and Cε are Figure 30. This picture is the same as Figure 9 except for Cys. Cα and Cβ are drawn in drawn in orange, and Nζ is in blue. Anticodon nucleotides in skyblue are CUU. orange, and Sγ is in green. Anticodon nucleotides in skyblue are GCA.

Figure 31. This picture is the same as Figure 9 except for Trp. Cα, Cβ, Cγ, Cδ1, Cδ2, Cε2, Figure 27. This picture is the same as Figure 9 except for His. Cα, Cβ, Cγ, Cδ2 and Cε1 are Cε3, Cζ2, Cζ3 and Cη2 are drawn in orange, and Nε1 are in blue. Anticodon nucleotides drawn in orange, and Nδ1 and Nε2 are in blue. Anticodon nucleotides in skyblue are GUG. in skyblue are CCA

N2 of G, N4 of C, and O4 of U. Accordingly, there could be a lot of mistakes in selecting amino acid by the nucleotide combination of anticodon. In order to enhance the accuracy of amino acid selection by the combination of anticodon, we must wait a long time before evolution of aminoacyl tRNA synthethase as well as tRNA itself. As is well known, the length of tRNA has become evolved to about 76 nucleotides of clover-leaf shape. The total three-dimensional structure of tRNAs are referred to as the L-shaped form, the waist region of which is particularly important for amino acid type recognition. On the other hand, the evolution of aminoacyl tRNA synthethase seemed to have started to make a dimer of a couple of synthetase and tRNA. Because of the elongation of the distance between the 3’-end and the anticodon Figure 28. This picture is the same as Figure 9 except for Phe. Cα, Cβ, Cγ, Cδ1, Cδ2, Cε1, loop of a tRNA during the course of evolution, the 1 to 1 recognition Cε2 and Cζ are drawn in orange. Anticodon nucleotides in skyblue are GAA of anticodon with its 3’-end of tRNA became more and more difficult,

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 7-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

so that a pair-wise recognition of anticodon loop with its 3’ end of the partner tRNA might have been more easier to recognize. The evolution of aminoacyl-tRNA synthetase (aaRS) has led to two classes, Class I and Class II, [10,11] based on structural sequence features of the catalytic core. Each aaRS recognizes a single amino acid and attaches it to the 3’- end of a tRNA whose anticodon matches a codon for that amino acid. According to Ibba and S ll [12], however, there exists many exceptions to these rules, and many aaRS additionally rely on various proofreading or editing mechanisms ӧto ensure the translation to the genetic code. In order to charge a tRNA, the amino acid must be first activated with ATP by the aaRS to form an aminoacyl-adenylate intermediate and then transferred onto the terminal adenosine of the tRNA [12]. After aminoacylation, two tRNAs are able to be arranged by a Figure 33. Stereoview of 6.5-fold symmetrical left-handed single-stranded polyuridine. This structure is also capable of base-pairing with the adjacent layer. The cavity in the layer short sequence of nucleotides so that a dipeptide of Pro-Gly may be is a little wider than that of 6-fold symmetrical one. preferable for supporting the total structure of the system such as a part of aminoacyl tRNA synthetase. Figure 32 shows the simplest case of protein synthesis showing an ancestral form of mRNA and synthesized protein Pro-Gly. Accuracy of protein synthesis in accordance with the prescription of mRNA could be best for Gly and Pro at the first stage of evolution of life. As accuracy of aminoacylation is enhanced, a variety of peptide becomes ready for preparation by use of a special set of mRNA prepared for a necessity of growth of nucleotide layer. Since the number of L-amino acids was increasing, the cavity of the 6-fold symmetrical layer became a little too narrow, so that the next Figure 34. This picture is the same as Figure 33, but seen from a different direction in stage of more compact and a little wider in cavity might have been parallel with the axis of that of the left-handed helix. useful. Figures 33 and 34 show two stereo views of 6.5 fold symmetrical left-handed single-stranded polyuridine. In this case, there is no need of worrying about the short contact between the nucleotides of two nearest neighbors, and the stacking base distance is also all right. However, a lot of pyrophosphonucleotides and phospholipids are excluded. Figure 35 shows that a peptide composed of L-amino acids having rather small side chains such as Ala, Ser, Val, Pro and Gly can be accommodated inside the cavity of the left-handed helical cylinder, but those having large side chains could have a tendency to break the regularity. Arg residues in particular tends to disrupt the regular shape, particularly because the helical cylinder has a consecutive line of phosphate groups inside, resulting in a familiar shape of right-handed double-stranded polyuridine helices. Since such a role of Arg residues were so important, Arg occupies a comparably large part in the Table Figure 35. Stereoview of α–helical structure of peptides drawn in blue and composed of L-amno acids, sequence of which is Pro Gly Asp Ala Ala Ser Val. of the genetic code in comparison with that of Ser, if we are allowed to think that Gly, Ala, and Pro were comparatively important to the other Similarly, Leu and Val residues seem to have been more important in amino acids in the earlier stage of evolution of L-amino acid proteins. playing a role of making hydrophobic side of α–helices and β–sheets. The roles of Glu, Asp, His and Lys could have been evolving as players of catalytic activities. It is well known that some particular form of RNAs could have a catalytic activity [13-15]. However, there seems to be a handicap of flexibility in the 3D-structure of RNA, so that the evolution of proteins having Glu, Asp, His and Lys residues became more and more important during the course of the origin of life. Also, important to consider the double-stranded right-handed RNA helices, it is immediately related with the importance of the canonical G-C and A-U base pairs and G-U wobble base pair. The growing of double- stranded regions of RNA made it possible of the evolutions of tRNA, mRNA and ribosomes of both large and small subunits. Evolution of ribosome

Figure 32. Stereoview of two adjacent anticodon loops, A-site tRNA is drawn in magenta The significance of the ribosome in the origins of life consists in the and P-site tRNA in green. Anticodon nucleotides at the A-site are drawn in skyblue, while movement of mRNA between two subunits of the large and small RNA those at the P-site in orange. Messenger RNA for A-site is in blue, while that for P-site is fragments. At the beginning of the origin of life, when no protein factors in gold. The rest of the nucleotides for A-site tRNA is in brown, while that for P-site tRNA is in yellow. The peptides at A-site and P-site are Gly and Pro, respectively, and drawn in such as elongation factors Tu and G as well as rRNA binding proteins dark blue. existed, there was no energetical barriers between A and P sites, and

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 8-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

also between P and E sites. Brownian motions of the conformational as shown in Figure47. The reason why some bases are unpaired in this changes performed the translocation of the ancestral forms of tRNAs model is that the corresponding nucleotides in Eukarya, as shown in accompanying splitting and uniting of RNA fragments. On the other Figure 44, are G-G unfavorable pairs. hand, the structures of the present-day ribosome are too big. Figure 36 shows the secondary structure of Esherichia coli 16S rRNA, which U33-extended and U33-folded forms of the anticodon consists of 1542 nucleotides and 21 binding proteins, and Figures 37 loop of tRNA and 38 show 5’-half and 3’-half of the secondary structures of E. coli 23S Uridine 33 at the 5’-side of the anticodon loop of tRNA is universally rRNA, which consist 2904 nucleotides and 30 binding proteins. There conserved except for cytidine 33 in some initiator formylmethionyl- are 5 different families in the living organisms in the present world. tRNAs. The uridine base of U33 is folded inside the anticodon loop of Those are (a) Eubacteria, (b) Archaea, (c) Chloroplast, (d) Eukarya, tRNAs observed so far by X-ray crystallography, and hydrogen-bonded and (e) Mitochondria. The secondary structures of 52 organisms in (a), to a phosphate oxygen atom of 36th nucleotide of the anticodon [8]. 25 organisms in (b), 27 organisms in (c), 44 organisms in (d) and 52 This structure is referred to as U33-folded form in thisarticle. This organisms in (e) were analyzed, and the references of those sequences conformation of the codon-anticodon heix of U33-folded form does were listed in Nagano and Nagano [7]. The analyses have shown that not seem stable enough. The U33 base can be expected to stick out or to the simplest secondary structures of the ribosome are mitochondrial be exposed if the conformation of the codon-anticodon helix assumes rRNAs of Trypanosoma brucei, 9S small subunit rRNA of which is that of an ideal A-form RNA. Such a structure could be a little bit shown in Figure 39, and 12S large subunit rRNA of which is shown in unstable because of the exposed U33 base compared to the hydrogen- Figure 40 [7]. It is said that the smallest rRNA of mitochondrial protista bonded U33 base of the U33-folded form. And yet, the stability of the Trypanosoma brucei is stabilized by binding comparatively more U33-extended form heavily depends on the stability of the codon- proteins. However, at the period of the ancestral ribosome, evolution of anticodon helix, and seems to play an important role in enhancing the binding proteins is supposed to be immature, and so, the ribosome was accuracy of translation. The relative stability between the U33-folded supposed to stick to the silicate rocks at the bottom of the volcanic hot and extended forms must also depend on the concentration of Mg2+ spring, where the concentration of alkaline metals including Mg2+ ions of the buffer solution used in the experiments. It is said that the Mg2+ are relatively low [4], and temperature and pressure were very high. concentration inside muscle cells is about 1 mMol that is about the The Brownian motion of the ancestral ribosome were suppressed by same as that outside the cells. On the other hand, a Mg2+ concentration the high pressure instead of binding proteins. used for crystallization of whole ribosome is 10 mMol or much higher It has been known that the first step of codon-anticodon recognition in some cases. The conformation of the anticodon loop of tRNAs in only expels the ternary complex of noncognate aa-tRNAs with accuracy the living cell must be more dynamic. The difference in the positively of about 200/10,000 [16]. The mechanism of kinetic proofreading at the charged ions between the living ribosomes and the ribosome crystals step of GTP hydrolysis was proposed to expel the ternary complex of are mainly compensated by K+, if the environment of ribosomes are near-cognate aa-tRNA to enhance the accuracy up to 5/10,000 [17]. about the same as the muscle cells. A recent result of X-ray structure Here, the ternary complex denotes a complex of aa-tRNA, elongation of the decoding site has shown that the G-U base pairs of near-cognate factor Tu (EF Tu), and GTP. It was found, however, the accuracy is codon-anticodon interactions at the first position (between mRNA/ 2+ Leu reduced by the used of high Mg concentration [18]. Thompson, et al. Phe and tRNA2 ) and at the second position (between mRNA/Cys [19] has observed that incorporation of near-cognate aa-tRNA into A and tRNATyr) cannot be discriminated from the cognate G-C and A-U site sharply increases from 6 mMol of Mg2+ concentration and reaches a base pairs, respectively [23]. The sharp increase of the near-cognate plateau at 12 mMol Mg2+. The present author tries to explain how such GTPase reaction between poly(U)-programmed ribosomes and Leu2 a phenomenon could be realized on the 3D structure of the ribosome. ternary complexes from 6 mMol to 12 mMol (Figure 1 [19]) strongly suggests that deprotonation of N3 and protonation of O4 of uridine The 12 nucleotide sequence of 23S rRNA, AGUACGAGAGGA, is base of G-U pair cannot be distinguished from G-C pair [23], and the longest universally conserved among all known rRNAs [20]. The that the conformational change of the anticodon loops from the U33- present author has found that the region of nucleotides 1064-1077 extended to U33-folded form of the ribosomes do not discriminate of 16S rRNA in E. coli sequence is complementary with the region of the near-cognate Leu -tRNA from the cognate Phe-tRNA. Figure 48 nucleotides 2653-2666 of 23S rRNA, that is called sarcin-ricin loop 2 shows a different 3D structure for the near-cognate G-U pair at the (SRL) [7]. This relationship holds for all nucleotide sequences listed first position of the codon-anticodon helix on the basis of the model in the Figure 41 for Eubacteria, Figure 42 for Archaea, Figure 43 for building technique of the present work, in which the U33 base, which Chloroplast, Figure 44 for Eukarya, and Figure 45 for Mitochondria. is drawn in magenta, sticks out from the anticodon loop, although a Since the region 2653-2666 of 23S rRNA is called GTPase binding site, Mg2+ hexahydrate still binds between OP2 atom of YG37 and O2 atom and the nucleotides at the positions 1068-1073 of 16S rRNA constitute of O2’ methylated OMC32. The U33-extended form of the anticodon the 5’-side of secondary structure, helix 35 of 16S rRNA (h35), and loop would destabilize the loop structure. Besides, deprotonation those of 2653-2658 and 2663-2666 of 23S rRNA are the head region from N1-H of guanine base and N3-H of uridine base resulting in the of helix 95 (H95), the relationship can be summarized as shown in movement of the proton to the O4 atom of the uridine base might be Figure 46, in which SpcF denotes spectinomycin footprint site [21]. preferable under the Mg2+ excess conditions. Figure 1 of Demeshkina, The residue number 2660 indicates the sequence of α–sarcin cleavage et al. [23] also shows that P/A-kink of the mRNA is widely open as the site [22]. The left-most structure in Figure 46 plays an important role kinks obtained by the other X-ray structures of the ribosomes [24,25] in GTP hydrolysis, while another structure on the second from the and supported by binding two clusters of Mg2+ hexahydrate. On the rightmost side seemed to be that of transition state, but finally turned other hand, Nagano (2015) demonstrated a different type of P/A- and out to be the binding site of GTPase at the moment of GTP hydrolysis. E/P-kinks, as shown in Figure 49, each of which is supported by one Although such a long helix has been observed by neither X-ray analysis cluster of Mg2+ hexahydrate. The difference of the kink structures of nor cryo-EM, a model building technique can show how it looks like, Nagano [9] from those in the X-ray structures can be rationalized on

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 9-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

tRNA . . 60 mA U C.. . . .C. .G. . . . G C A A U A A . . .G C .G.U.G.U.C. 1090 U. U. .Ψ.T m G. C. 50 G. C. 37G. C. 1100 1110 U G.C . 1130 . . .C.A A. 720 U A.35 .G C 38 112039 .A.G. . .G...... U. .G. . .. .U. .. . C C G U ACG AGC CC UUA UC CUUUG U C C C G G C. 700 .UA GA. . . G C G. G. 710 A. G.23a ...... A...... G. UG C UCG. .A G GG GU. .A G GAA A C. .A AG G G C C G G AU.G.C A UC UG G A GG AA U GC A U. G. A. U. .C 1140 A. . A 1070 G. G. 1150 U G ...... A 36 . U. C. G.1190 G A. . .G.G C G. . . .U GG A C C U UA AG A. .G C G.G A U . C. U A U G U . . U. A. 1180 A U 690 680 . G. 1080 A. G . . 23 G C 730 G.U. G. C. . . .C G. A. G. 1160 670 G. C U U. . . .C G. . C. G. C. G. . G .A G. C U. A.A G. . G. C. U A.40 G 790 1060C. G. . G. . C. . . G U C. . G U 740 .A G A. . . A U G . . U U 1030 G. C.1200 . . .A G . . 1170 A. .A A. . U. A. .G.U. U. A. A.U G A U C A. . A. .C U. C. 34 . .C U. A. G. C. C. G. C . G C . . .A 22G C . . A C G.33a G. U . . A U . A . 660 C. G. . . 1020 U. . U. G. .G C. .A...... G A C G U. A. A. G.800 G. U G A G A A U . . . 3'M .A U ...... C. 1050 .G C C G A. . A G.A C U U U U G.A.A.G. C. 1040 . . . 780 A. .A G. G. U. U A . 1010 C. U. G C . . .GC G 33 A G. . . G. C 750 . .C A. G. C 1210 . A. 1000 . . . .A U G.G G. .C A U. G . . U. . C. A U. . G G. C. .U U. .U 630 U. . C. G A. .A . . U G. C. . .C . .A A C. 640 .U.A. 650 . .A 840.C .A C... .U.C G .A.A C.C.U.G.G.G U.G.C.A.U.C.U.G.A C.U .G.G.C.A.A.G.C 24U. A. .C .G A GU . . 620 C C. G. C. .G .G 990 U. A...... G. G.C . . .U .C U.C G GG C C C. .GU GU AGA C U GA..UUG UU UG C. .U C A. 770 C. G. 810 . .G G. C. A G. G 760 .. . . G G C 32 C. .A 600 590 G. . . A G C C G. . 26.U .U 850 . . ..A . G. A U A. . .G 980 U. A. UA 21 20 A . .C. U.G. .A U . . . . .C. C G A. C. A. .G G .C C U U. C. G.1220 G. G. 610 580 C. G . . .G .C .A . G. .G C A . .U A A C G. C. .A .. . C. A . . G. U .G .G AA G. A. 820 830.G U G. A.C.. .A .A C. G...... G .C C. . C G. 1270 .C G. A. G U C G A C U U C. 970 31.A.A .U 1260 G. C. A.U.A A. . G. . . .G . .U. A. C. . . . .G G.530 A U. .C.A.G.C.U.G.A.A. G.A 860 .C G C .U . . .U A.G G. A.A A C. G . .A .C.U A A U G C. . C. . 470 520 A. .U G. C. .U. G. .G . A. G U. . C. C. . .A 570 C G 880 U. G. A. 960 A. 1250 G . U G. U. C .A . . 25 870 C. U. C. . .A . C.. ..G A. U. G. U. G. C. C. . G U G. C. A.A.A. A. CA A. U 480 . A U A . G .U A .G U. A. 19.G C. 26a C A . . C. C. 1280 460 .U. .C .U.U G C G. C.1230 A A. . . .A . 18. . G. U. . U G .U. .G C. G. U G 890 950 U. G. .U . .. A C ...... A G.A.AA 17 .G .A . C. G. .C G.G.G.A.G.U A C G.G.C.C G A. U. . U .G C.G.U U. .A C. 41C. . 1290 .G 510U. A. .U ... . 900 30C. G. G .G G.A.A . A. .U C.U.C. . .U U G G.AA G C 1240 . C G .A.A.C G 540 . AA..A . . C. . 450 .. . C. 490 U. . 560 A. A .A U..C.A. . G G. AG G. C. . A. 910 27 . A A U.. .U G. C. C G.G. A 29 .G G G C G. G. C. G. . A. 940 .U . C. C. C G .G.G G G A. . G. . A. 2 920 28 930 .C ..A. . . G. A. U. . .G. .C . A .C .A .A.G . U A. U A.C. A. G. 20 U G.A.A.U.U.G .A.C.G G.G.G.C.C.C.G . .G C . 430 . . C. G. C 1340 G. .A 440 U .A . . . .U.C . . . .G.G.U.A.C...... 1380 U. .G 1300 16 . .A. . U G A A G C .A .A CUC U. .A C. .U G U U CC GG GC. . . . A. G. . . G. U A A. .U . 550 .A ..G . A C C U U. G. C. .G C.C U .G...... 500 . .U . A .A.G.U U. . .A C. 1390 G. U. U . A C. GGUU G UAC A. .G U U U .G U G G. . .G.C ..A.. A 10 A. 1510 C. A. A. U. .C.C.G.G U.A.U.G .U .G . A .C A G 30 . 1 C G . G . A. 1350 G. U . . . .G A . . U.A.A.C.C.G.U.A.G.G.G. U.A.A.G.U A. A G. A.. ..G 400 .C UC . 1400 C A. G. . U. . U 420 AG.AA 410 .C.G . 3 G ...... G. 1330 U. . . .C .G .G . A G UU GG C.GU CC. .A C. U. .A U.G . .C U .G C. . A 1370 G. G. A. 390 . .GG . C A.1500 1520 C C G. . 15.. .C C .C 40 U . A G 1530 45 . U. . .AG.C.C.U.GAU.G.C .A.G .A . C . . 43 A. G C. G.1310 A. . .G . .U C. . . .. CG 4 A. C. G. A. G. . 42 U. A. C..C G.G.G.U. A.C G. .U. . C. A U G G U A C. A. .G C. U. G . C. . 380 A C.370 .. . C. A U. .A .AU U. A. U. U. C. U. . .A 50 C. G. A. . .A1360 .G U. G. A. 5'end A. A A G C U. G. A A. . . . G. G. . A C. U. G C C. G. . .C 360 . . A 1490 C C.A A U. A. 1410 . U. . 1320 5' G C.5 C. G. U. .C.G . .A .C G. C .A . A U A G . U. G . . . . C C. .14 C. G. U. G. . G. 350 U.1540 C. A . .A.G C. . 60 70 G. U. 340 U. . .G C A .G.A A.A. 6 80 G C U. 330 G A...... G...... C. . . .C G UC AC G U CA G G AA G A AG U. G. A. A .A.C.. .13 .A . .C G ...A. C.G.G . U.G C.A .G.U.C U.U.U.C.U.U.C. U A. G. A. G U CC G . .. . .G. G 3'end .. AG 90 G. U. G. .C.A.A GG.U.C 1480 A.G.U . G. 100 1420 U. A. 300 320 A G. C. . G .A. .A. C . G. U. .G G.G. . .G C . .G .G . . A U . 310 .A G .C A G. U. 3'm A. . . C.A. .C .U G U C.U. G.C G.6a 44U. A. G . . A. 110 U G G U.C. G. . . . 12 G. U G. U. 290 .A . U A C. G. . . A U .C . . .A . . .C .G U. G. U A. U. .C .G 120 240A. U. 1430.A U.1470 .U .G G C .A C. 280.A .A . . G. . . .U A. U. . A. A.C G A A C G. U. C. G. . . .C .A C. G.7 G. C. .G .G .C G. U. A. 11 .C A A .G G. . . U. U. .A130 G. U. .A .A 250 230 U. A. .U.G G C G. C. .U U. .U. .C .A . .G 1440 .U G .G A. .C A . .C G. C. . 270.A .G G. C.1460 .C .U G. U. C G . C G . . .U G . . U. G. .G.C .G U. A. G. .U G. G. A. U.140 A C. . .G 260 C G . A A.AU 10 220 . . 150 8 A. . .G...... G.C G.A. .G. . .G.. .A.A...... G. 210 C. GG CC U CUU G GG A U CU A CUG A.160 G. U. U ...... A .U A. .U.C C. .G G GGAG. C. .C C..UA. .G A UGGC.A. 1450 G A.G A. G A A.U.C U. . . 200 A. U 180 U C . 170 A. A. A. A. C. .C C. G. A. U. 9.G C. 190 A G .A.C. Figure 36. Secondary structure of E. coli rRNA following the layout style of Gutell (1993). 16S rRNA along with TѰC-stem-loop of yeast Phe-tRNAPhe. 5’, C, 3’M, and 3’m represent 5’ domain for the region of 1-556, central domain for the region of 557-918, 3’-major domain for the region of 919-1396, and 3’-minor domain for the region of 1397-1542, respectively. Small numerals denote the residue numbers of the nucleotides indicated by the respective lines. The helices are numbered by large numbers as Brimacombe (1995).

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 10-32

Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

.A. U A 1070 1080 . . 1090 .C. . . . .U A ...... A.G A G.C.C.AUC.A.U G.AAA G C G U. A A 1510 ...... C.G. . .A 1500 58 . .A. 1520 GA U.U C G G U..G U A . . C U A U .U...... A G... .G G. .. . . U G G C A .A C C GG A A AA UC A GCU GCGU G . 43 . . 44 1100 A. 1060 A U . .G. C.C.U.U.U.U.G.G.U . C.G.G. .U.G.C.G A . . A C G...... C G A A U G A . . 1480 1470 A. U. 1050 C. G. 1490 U G . . . . .A.C A G U.C. U A G G 1110 1460 ...... A. U C 1530 AC C C. G. . .U.G. . 57 . . 59. U U C.C.C.G.G G.A.G.G.C.A.C U. G. U. G. A. G ...... G C.G . . .G G G C C. A.GU C G U G.GC 1040 A. . C A.G.C G. . A. .G 1450 . A 1540 G. C. G. C.G U A G. C. . . 42G U U. A. . . G. C.1550 U. G. 1120 1440 . . .U AA. U..G C A. A . G. U. . . AG. C. C. G.U A.U G C 1030 .C G. . A. .A . . .A G . . . . 56G C. .A A A . U. U. . .A G . . A. C. .G A. 1390 UG CU A .G U.1130 . . . .U G G. C. .G .G 1560 G. . 1430 A C. 53A. U.1400 . U U U. C G . ..G . . . . .A C G .U G A .A U. .G C . . 1380..A A. 1410 . . . 1420 . A. .A 1370 .A. .G A C. G.C GA. .G C. .A. . . .G. .G. . .G. U. . U.U.G.G.U.G.U.U.A.C.U A.G.G.G.G 1020 A. .G.C A. G G C U G U C G G A. .U . . . .G.G ...... A.A.C.U.A.C.A.A.U.G.G U.C.U.C.C .G U .A A U.C G AG.C C G.G.A.A.C G G. .C C. . .A ...... A A..U C C.A U A A. GA. ..A G G . 1140 1360 A . . 54 1590 . A A . 52 1350 G. C.1600 C. A. .G .C . . 51 . . 1570 G U G . .C G U A. U. .C.G 1580 .U .C . . .G U C. A A .A .U .A .A C . U C .G . .C . . G. C. 1010 .U G 1220 .G .G 1330 1340 U . .C . . 50 . C. . . .G .U .A U C.G . G.G.G.C.A.G.G.G .A A A C 1150 .A 1230 A A C . .G ...... C. . U .U U U.G C...C C U G U C C U. A C U. A. . A A U 1610 . C G .G 1320 . G. . . G. 1620 .A 41 .A.G .C .G C. .A U. 1310 G.G. . .G C. .A G. . . .G AA. .U . C. 59a U UA. G. C. G. G. G.A 1000 A. A. . C. G C G A . .C .U 1210 G...... C U. .A .G U. .A 49A. U. C A C. G. .G C. G. 1170 .C .C .G C. 1630 G. C. .G 46C. G. . . A . . . . .A 1240 .A A A. .A G.G.U.AGA. A U. .U 1180 .A.G U. A. . C. .G A. G. A. .. . .G C G 1160 .U A. . .U . .G G 1300 .C C A U.AA . . .C UG C U C . .A III G. C. G .G . . G. . G .A 1640 .C.A . 45 U A 1280 G C.1290 . 60 A. .A. 990 C. G. A .U . . . C. G. 980 A . U 1200 C G G. C. .. C. .A.C G .C . . . G C. G. AG. C. .A.G G. U A 47C. . 48 C G. . . 1250 G C. G. C. G. . .U A.G.U. 1270 . . C G . . 40 .G G .G G . . . . .U.A.A.A U C .A U. C G .A.C G A G 1190 C . . .A . A. . A. .G U A. G.G.G.G.A.G A. .A 970 A .A 960 39 ...... U .U . . . .G .C C C C U C. . 1260 .A.C. . . ..G.UCGU G. . .A GUCC G U. C U.G. . A. . ..C.A.C . 36 . A C . . .G.G.C GG . A 810 U. C. G. . . U C.G.U G A . . A C U. 950 . A 820 G 670 .G G. C . . .U.G .C. .A 26 G II . 940 A .G C C A. .C . A .G . . . .A . G. .U 580 G A .G . . . G. U. C. A. U. . G .C C G C. U. . G C. A. .G U. A. 590 .G .U . U. G A. .C .C A. A . .A A. U. A .G 84037 U . A.800 A . .A . U. . . . U. A. U. .C .U G. U. G C A . . .G . .G . G. U. A. .U A U A 830 G.A U .G .C . . .C G. U. U. U. A. C 570 G 90 G .C .G 680 27G. C. . .. 930 . C.U.U .A.A.A .G U .U A U. .U . .C U A. U. . A. G. A.A A. 790 .U . 32 U. . . U 660 C. G. G. A .G . .A G U. U. A. . .C 850 G. U. U. G C A . . A. . . . 38.G .U A G G. . U. A. C. .G C.G.U. .G .C . C C A C. . A. U.100 . .C A . U. G. 600 A. C . .G.G.35a G. A. .A .C .G A. .U U . .G C. U. .C. 920 A. G G G. . A A. G. .G C. U. . . .U . .. G C A. A. .A.A.610 560 .C . .A G. U G. C.A A AU G 780 ...... G.A .G.A. . . I . .U C A. . . G. G U U. G G U U C C . .G C G A.A.C.A.U G.A A 10 U. 80 G. C. .A . . . U. A .U .U . . A. A .G U G . .650 . . .G. . . . . 530 G. C. U . C. G. .C .. U. A. .C U G.G.G.C G .C C A. .G G A U .C .G 1 . A.2900 A 860 G 690 G G.G .A .G C. G. U. U. G. G. G. C. . A. . . . . G C. 7G C U A G C A.U.U C U G.A A. 28 620 G. .G A. U.2 . C. 3'end . .110 140 ...... G. . .A U G. U .C G .A U C. G C 640 .A 25 U . A. . . .G . .U. 160 A. C .C G.C. 770 . .33 C. G. .G .G 520 G C 20 5'end U A A U. C. . A. A U . .C . . . A .G G. A . . . . A. 29.C G. U U. A . .U .A . . 910 AA U C G U .G .A 540 . U. G U. .A U . . . . A.. .G 630 550 .C G. C. U . .C .C .G C G U G A A ..C 70 A. . . . U G. . . . . G G .G 510 C. G. .G G.A A. . .A .A U .G U.870 U G . C...... C G 6..C .U 8 .A G. C A. . . . U. . ..G AACC . . . .U G . .G .C . 9 U .U 170 U. .U .C G A U A U G...A U U.C . .C A. U. 120 .U .A . .A A. U. A . A. AA . . .UG C5 .G C. G .U A . . G. A .G G. .. . .A.U.G .. .C A. . G. C. .G C. AA.A.G. .. 60 . C G A C A U. .U U. G. A U.C G U.A. .C 10 C. G. .A A. 500 G .A.A 130 U .G 760 G .. .. A. .G A. .C G. . C. .C G.700 .A... . A . A 50 U 150 G. C. C. .G . A. .G . G .G G G G G A G. U. . G. A C. . 490 C. ..A C G 30 .A 900 A. A. .C .C U.34 G. .C.C.C.C.A.A.G A . . C. . . 35 730 A. G G. C. .G A A G G. U. 480 G.180 C. G. . C. .G G 24 .A.G C.3 .C A . A G. . .A C. A. .C G. 880 . U 470.A 40 G. A. U . A. A U.. .4. . A. U G . A 740 . G G G C A G.U.C.A.G .A.G 250 . . U . G. A . . . C. 190 U G . A G. . G A G.G ...... G. . 11 . . A.A . . C.C..G U CA G U C..U C. A. C C.G....G.A.A. . C G . U. U. G. . 23 A C G .G GGG C U... . . A. U.G.C C. G. 440 A. . .. .C ... GAA. A U .G C. G. U A G C . G.CCC.C A. . . 750 A. A. .A . . A. GA. .A .. .G C. A. . .U A. U. 460 .C A. 430 A. . .U A. 210 . . U CU..C G U U. U. C C. U ..A. A 890 C. .A A. U. 710 . U. A. .G .C.A.G G GA 200 C. .U G. A. A.G. C.22 A .C . . . C 720 U. G. A G. C. 420 . 260 G. C G A. . .G G. U. 14 .U . 240 . GG A 310 .C G. C. . .U 450 410 G. C. .C .A. .G .C A . . A-site finger A G C A.A.G G. .A.A. .G . . G A. U A . . C G . G C G.. .C 13 . . . C U .U.G C A .A C G .A. . .A C U. .A . . A . U...... A.G U G .U A G A. U. A A 220 . . A C C. G C . 230 G. A12 19 . . A. . C. . . C. G. ... C A 300 . . 400 G. 270 A GA ..A G C G. .U . U. A. 280 290 A C. 21 .C .C U. .A. .G G. .G.U . G.A.G.C C.A.G.U.G.U.G.U.G.U.G.U.U G.U.G 390 .U . .G G. A. 320 ..A. .C.C .CG A...... GU U . .G U. . . C UC G. . . .G UC GU AC ACG U AA. C.A C. .U . G G A.G.U.A G A G U A. .A G. .AG 380 350 A.A.A U. .C C 16 360 18 340 G.C G.A U.G..A. 370 . . C C. G. 20.C G. C. U. G.. ..G A CA 330 Figure 37. Secondary structure of E. coli rRNA following the layout style of Gutell (1993). 5’ half of 23S rRNA. I, II and III represent domain numbers for the regions of 1-561, 562-1269 and 1270-1646, respectively. Numerals are as in Figure 36.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 11-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

1870 .A .C . .C.C. G. .A G. A. C. G. C. G. .G C.G A U A . . . . .A G. C.2150 U. A. 2140 G U G . . U. . U. G. G. C. G C 68 1880 . . G. U. 78U. A. G. C. . . 1860. . . A..A G U. .A A. .G.A.U G U U G.A. G. G. A.1810 U U G...... U. . .G C. G. U A A U. A.A GC U U. A. A. U. U. G.1890 2130 C. G. C. G. 1850 G. C. G. C.2160 A C. . . . A.G C . . . .A G C G. A. A. . . A.G G A . 2120 G A. . UA...... A. A. C. G. C. .U G G U G G 1800 C G G C C. 1730 . . . T=5-Methyl U G C.C.A.C.C . U. C. G. . . . C. .C. .A P=Pseudo U U A U. G. G U C. G. . C 77 . .U . . . A.C. G. . . . U. .U C. C. U A 1820 G. UA. . U U A.A.A.G C . . . A. .P 2110 A. U. 2180 . G. 66G C U AA 1900 .A C G . . 1840 . . .P A G. U. 2170 . . U. G. .G C 1910 G . U A U A63 G. . . .C 1920 . . . . C. G. . .C .U A CC G .G . .G.G U. A. U. . . .C G C. U. A. G. C C G G . G. 1790 . G. . . . .U 69 C. G. G. . 65 .G U. 1830 .G C . G 1720 U. . .G A .A...... C.A G ...... U C. 1930 G. U. G .A A A A C A A C G C C C. A G. C. 1740 U.70 . .A.G. . U. U...... A.A.G.G U C. 2100 G U A. U. U..U U U G U.C. A.U G C G G. G. . . . . A A...... U. G.2190 .G G G. G.U G. A. U.A U. U U C C. . .A C 2310 . A 1780 C G.198067 . U U.A.A U. A. C. . U. . . A. 76 U. .A . .A .A A. . G. 1940 A. U. G U AU A 1970 . .C C. U . . . . 1770 G. U. G C. . . .U A. . C. G. G. C. A U U G A C G C. A U . . . . 1960 G. G. . . 2210 84U. A. 1710 G C C. G. C. 1950 A. U. 2200 . . C G . . G...... U ...... U.A . . U A 1750 64G. C. C. G. .G C U A A CG U GACC CG . 2300 C G 1700 . . G C 71 U. U. U .A . . .G. C G . . A . G.C.G .U.U.G.G.G.C. . U A. A. A. . . . U. A. U. . . C. . C U . G. G .G A. G A . G 2220 . .G. A. 62 . U. .A 1760 .C G. 2090 A. U.G.. .. 79 A G.2320 G G G.G.C.A.C C . . .G C G A C A . C. G A.U .A.C .C A G. . . .U .C...... C 1990 A U2230 A. U. .G A. U. .C. C.C G U G. . .A U.C...... U .A U G...... G G .A U. .G C. U A G . A. G. U A A A A C 1670 A G 2270 . .G.G U 1690 A.U . . U. U. . C 1680 G. U. G.A. C. G.A. . U. G. A. G.U 2330 G. C. C G A. .A G. C. A. .A 75. . . . G C U .A U. .G U. P-LoopA. .A . . A. C. A. .G. U. .C U. A.2340 G C G. G G. C. G. 83 . . .G . C G U. G. U A U. G. 2250 G. C. . . 2290 G C . . G U A81 . . G C. 2080 A. U.2240 U. .80 . . G U . 61 G C G. . . 1660 G. C. 2000 U. A. C. . . .A U. A U. 2260 C. G.2280 .A G. 2350 G. C. C. G. . . . 82.G A...... U .G C U A G .C ...... GA.. . U G. C. G. .A U U A C CUGC G CG A. U A. U. .A.A...... C 2360 . U A U. .U G . .G ACG.A.G.C GC.G.G .A.C G. . . A. C G G A. U. A. . . . . 2370 .. A. 2430 G C U.C. .C.U.A A A. U. 86 IV GA G. C. C. G. . . G. C. . . C G G. ..A C G G. U. 2380 C. G.87 1650 A. U. . . C A . . .G C. G C 2010 74A U. 2420U. G. U. .A . . . A. G A...G .U.U A.G . . A. U. 2390 A A U G . . .C 88 G A A. 2070 A. U. C. . A. 2440 C. G.2400 U. .G C. U. U. G. U. G. G. C. G. U. A. 2030 C. G. G. C. A.2020 G.A. C. G. .A U. .U. A. G C. G C. G. A. . 72..G . . . 2410 G .A U. .CU ..A . A C G A. .G. .A .G .C.G U U . U C . .A .G A.2450 .UG 2060 A A G. . . 2040 A C. U. . A A . 2470 A 2050 . G 2460 .. . 73 . . . . . C.AA. . .U. C.C.C.G.C.G.G.C A A.G.A C G.G G.C.U .G.A.U.A.C.C.G.C.C G.AG U...... C V G G G U G C C G U C U. .C C. C GG. .UU GUG GC G G. .C U.A..A . A.U CENTRAL . U C..G U . . C 2620 C A 2490 89 A U U. . . . 2480 A. G 2610 U 2800 A U. . . LOOP C . G . . G. C. .C . C 10 A. C. U G. . G. G. 98 . . G C. VIG. 2630 A. UC .U 2500 .C . . . 2600 .G.U .G .G. . 2810 U. A. . C. G .C ..G. .C G A. . 99 . G.U C C. A. . .A .AA A. . .G .A .C G U. . . . .G.U.C C U. G. .AG . .C U . . . . .U.A 1.A .U G C. G. A A C. A.94 G. .. G 2590 U U G G. U. G 2890 2820 . 2790 U. A. .CU .U .U .C G...... A A. U. C. U G 2580 G. C. .U .A .A A A C G U U G G. G. U.G. 93 U. G. .G .A A. . . A. .A A. 2660 C .U U G.C.A.G.C. . 2780 .G G. C. G. .G . 2900 G. A.G C G.2640 . . A. .C G.2830 .U .G.C. .U. .A C G G. C. 2510 U. A.G.A C.G. A. G.U . . . A U 5'end U. A. A. 100 G. 2650 .C .. G A . . U C. .C AG ..G C U A. . G. . .C.U . . .C G.C . A 2770 .C G G. .G . G C .. U. . C.U . .AG SRL CAU A 3'end 2880.C G . . G. ..U G . . . AA G 2670 2570 G. U. A. . A. .U G. G. C.U 95 G. C.90 U. G. 2760 ..G. A. U A G C .A.C GA A.. . . . 2520 2530 . .2840 . . .C . A G C. .A.G C. 91 . . G C .C U A.A.G . .G. .A G. G A...... G A. . . .U ..GU .. C. . A UCC U GGGGC U A. C. G. A U C A AU G C. A...... 2730 . A 2680 .UG GG. AC CCU G G..UA .C G.101 C G 2740 U. . . . U.U. A. A .A G . . . 97 A U C A U. A. 2540 U . G A.A.A . . 2560 A U .A. .U .C . . 2870 .C G.. 2750 A . U C. G. A.G U G. A. . C G 2550 . .U C G. 92. . G.U. .A.A . G. .G C. U G 2850 C. U.G C. U. . . 96 .G .U . . G. C. A. . U U G U.G U. 2690 C. G. . C. 2720G. A-Loop G. C. C. G. U A G. . . C. G. 2860 A. .G C. G.C .G U. U. . U. CA. G C. . 2710 G U. . C. G. A 2700 U. . . A U G . . . A.C C Figure 38. Secondary structure of E. coli rRNA following the layout style of Gutell (1993). 3’ half of 23S rRNA. IV, V and VI represent domain numbers for the regions of 1647-2014, 2015-2627 and 2628-2904, respectively. The α–sarcin-ricin stem-loop region (SRL), A-site finger, P-loop, and A-loop are in the nucleotide ranges from 2653 to 2667, from 879 to 898, from 2251 to 2253, and from 2552 to 2556, respectively. Numerals are as in Figure 36.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 12-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

350 .U.U. A. A. C. G. C. G. 340 .A U.22 . .U...... C A GU UUUAAA . 360 .A. . U.A U...... U.C A. . .A. . UUAUUUUU. .AAA . .G AA.UA. U UA A. .U UA. . . . . 330 21 A 320 U. U .A370 U. .U . A. C A. U . . A. . A. A. . .UAA. AA A. . U. 390 U. A. A UA. U.A20 . U. A. .U A. U. 400 .G .A 310 C. . . U A 420.G .U CA. A. .A380 . . . .U . G. .A . U. U. A .U A. .U . U U. 24 .U 430 U UA A. U. .A .G . U.A. . .A U. .A .U G. UA A. .U .A 26 300 U. .U .U .G C .U .A .U . .C .A .U 240 G. A .U .U U .U. .U .G.U.U. . . A .C GC. G. . 410.U 440 G AA. . . .C . U. G. .UAA. .G G. U. A. A. .A 510 . 290 G U U. U. . A. . U . .19 .A A. C.C. .AA .U A. U. G. 200 A G U UG A U . . U. . . .A.U C. . . UA. U. U U 250 .A 450 A.G 460 . . . U. A. A. A . . .A C...... AAA U . 230 A. U. U A.U.G.A.U.A G.A.UUA.U U. A. U. . A. . A G. C. U...... U500 U. G. . . . A.17 AU. UACUAU. U. UG UG. .G U U ACU. U. 18G. U. A . A. .A . . A 210 U A 280 . . 480 AA 470 A U. G. 35 . U. . . C.U.A 27 UA. A .A.A.A U . A . A. U. U. . . .U A . 190 . A. G. .U U. A. 520 AU. U. . A.C. A U . 2 490 U G A C U. G. . C. . . .AAA. . U. 220 A. 20 A.A . A . A. U.U.U.A A.U A. A. . G. C. 270 U. U. . A . . . .C A . U.U.G.U...... U U U A UU .U.A . U. . . . .AAAU UA. . . .AA A . 3 .A .G . . . GGAA AAG . 260 .U . .U . .U .UGG.U.C.AA . 530 .U .AU A .A U 540 28 A . .U . .U U 10 . 590 38 180 . A.AA . A. .U . 1 A U . U . .AU C U . . A.A.U.A.U.A.U.U.U.A. .U .U . U. A. U. 600 .A.A . A. U A...... A.A . .A 30 U. A. UUAUAUAAA.AA .A . . U A. G U A.A . UU . . 610 . .UU A. U. G. 45 A . .U U1 3' end . AU 4 550 U. A. . G C 170 U. A. 40 5' end . U A C. . . . C .A U. . A G C. G. . . A. U. 580 A. U. 44C G A U . . . . 5 5' C. C. G. U. .A G U A. .A 3'm . . A. U. U A. 50 . 560 U. A. U. U. A. G. C. U. A. 160 U. A. U. A. G. U. U. A. 570 G. U.7 A. U. G U. . . . UAA U. U. A. .A U. G. A. 60 A. .A U. A. .U G 150 A. . U. C. G. G U . . .G. U. 130 U. U. A. U. 140 . 70 80 . . .U. . . .U U. . . . .U.A UA...... 8. . . .G.A. . .UUU. U. AGA UUGU UUUUUGAAA UU U. A. .U.U.U. . A.A.U.A. . . .A.A.G.G.A.U.U.U.U. . .A.A. . . .G G U UU AU AA UAU 120 . . 100 90 10 U. G. 9A. U. A. U. .U A. A A .A.U. 110

Figure 39. Secondary structure of 9S small subunit mitochondrial rRNA of Trypanosoma brucei. Residue number 1 is 1189th U EMBL database accession number X02547. Residues 493- 506 UAUUAUUUAUAUUA in light blue are supposed to base-pair with the red residues in the SRL region of large subunit rRNA in Figure 40. Residues 515-519 UGUGU in green are supposed to base-pair with the T-loop of tRNAs. Large numbers 1, 2, 3, … represent helix numbers corresponding to those for E. coli secondary structure (Brimacombe, 1995), but some are missing (e.g. 6). 5’, C, and 3’m denote 5’-domain (the region of 1-556 in E. coli 16S rRNA), central domain (557-918 in E. coli 16S rRNA), and 3’-minor domain (1397-1542 in E. coli 16S rRNA), respectively,but 3’-major domain (919-1396 in E. coli 16S rRNA) is missing.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 13-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

720 .A. .A.U. A. A. .C U. 700 A A . .A 370 . . UG U . 350 360 .U A. . . .U.GA. . . . .U.U.U. . .A. A.A. . . . .A. . .G. . .U. . .U .U A CAA AA U UAA C U UU A A. A. A A. A. . . A. 69. .U . . .U. .G.U.U. . . .U.U. .A A.UU. G. .A. .AA. .C .A U. .A U UUA UCU U. .U .U. U A U .A. A A C . A. U. 380 A. U. 68. . . G. 340 . 330 U. A. U A .A U. AA43 U 44 . . . .U .U730 A. . 66U A690 .U C C .A .U A . . . . . 710 .G . .A UUCU C 740 U. U . . . U. A. 390 . . . . .A.A.G.A. AU ...... A. . 680 UAA U. AGUAAUUUGUUA C. 750 .UG. A. U. A. U. A. . U. . C ...... A. A. 320 U. A A. U. 67 770 C UUGUGAAAUUAU.AAA U U .A .A U . . A U .U 760 U. A. 890 930 . . U. . 70 . A C. . A. U. .U.U. . .A. U U A A. U. .A. .A U. A. U. 780 C. G. . . . .U U. U. U. U. A. 880 A U. A. U. .U AA A. A. 650 . A . G. .A 460 A. .A A. A. U A. . . . U. G. U. . . . U A. U. G G A U U. A. G A . 400 .U .U . . 670 A U A. U. U. A. A. U. A .U 640A. U. . . 64 42 . . U .A U.A .G U. P- Loop .U G. U AA A. . .A. .C.U A . 75 .G. . . U .U A. U. U . A. .C A. G. G. A. C. .U . A .G. .A G. U A A. U. . U. A. U. A. U. A. A U. 910 G. C. . . 450.U . .G U A G C 790 . A U.81 310 A. U. U .AU . . U. G. . . A. U. A.80 . U G . .G . . G. 63 U. A. A U 900 U A. A. 940 . . .U GUU U. A. . . . 920 U A A .G 470 . . U. A. 600 660 .U A. C G A U. U. A. . . G. . .U U. U. 590 U G U. . . . .C A . . . . C G A. 410 . . U . A. . UAUU .U A U. U. U. G A . A. . . . A. A. A. U. .A A U . UAUU .A .A U A . . C. . . . .A . .A A .A .A .A . . 520 .U GUAA. A . . A . . . .G. . U. 510A U A. . 870 . . .A. . . U C G AU AA ..U .G . . 49U. A. 630 U. A. U. A. AU U . C. .C .A .U G. . . .A U. A. U A420 440 . A. A .AAA U.C. U. A. U. A. 960 . . U. .A . A. G. A U. 41U A. . .U A A. .A U...... A. U.61 U. A. . A . G A. . U. A AUA AU . .U A. 480 . A. U. U. A. 800 C. G. 970 A. U950 .A .A . U. A. 580 A U610 U A . G. U. U. .UG . . . . 810 74.C G.U. A. U. G A. 46U. G. .A. A U III IV .U A . . .A. A. U. . U U. . . . UU A. U. U. .G C.88 U A .U .U 45 A. U. . A A. U. 620 G. .A U. . 300 . . U .A G. . C G C. G. C U.U.U U A . . U. A. G. U. A. . . .A A. U .A490 A U530 47 U. G U U. A. . 430 A. . . . G U. U. A. 48 . . .G U. . U 500 . A U.U...... U A.U .UA. .C G. G. U UAU. . . A.U.A U U U G C. .U A. G.A.U.A.U . A.A.C.U. U. . . A. U U 72 U A. C. U A . U 560 570 A. .U.U . 860 U A . . . . . 210 . .A U A. .A . . .A . . . U.AUA. .UUGA. U A .AAA C. G. 220 . . .U . 550 .U . .A 980 AU .U U 820 UU 830 .C G.U. . U. A. . .G . .U U. . .U.U.AU 540 A.U. .U. 26 A A .G A. .GA U 36 U. A U.G. . . A A...... U.A. A . C A. .U.GAUA. U. U. A.U. . U . .A . A U U .A AC.A. . A 840 850 A. . . .A U. . U. UA. . 73...... C 290 39 A. U . U U. U. 150 II A.A.A.U.U.A.AAA.U.AGGG A. 990. . . . . 200GU A. U. A G. . . . . AAA AA2. 80 . A .. . A. . . .U...... GCAUU . 240 U A U A GA A. V UUUAAUU UAU. . U. .A .U A . U G. .C A.. . . 160 . AUA U.G.U.A.A. . . .A . . U. . . 180A U U. 1100 U. . AUA A .U A. A A . . .U CENTRAL G . . . A U. .A U. 32 A U U. U. . 1000 .C A U . U. .G .U 190 . . 27 . U U89 C U. A. G U U. 1090 G. . . A. . 230 U. . U. . A. A . A LOOP U A. . . G. A. UA . U. . .U .UUA 37 U. A. A. U. . U U .G A . .U VI A. 1110 . .U.U . 270 . UG .A U. A. . .AG.A C. .U .A A A. 140 U. . . U.. A . 1080 .C . .A. .A A. . U. . . A A U. U . .G .UA A. .A U. U. U . A. G A. U. .AA . .C A .U .G 250 . . .U . . A. A. A. A. . . G 1070 . .U .A A. UUA170 .G U. . I . . . . .A.AU. 10 C. A9. 4 . UU U. . .C C. .U .G G G. . . .A U.A A A U A U G. C U. UG . UG. .G 1010 . U .U .U A U A . . . A. 20 .A A. 93 G. U. U. G. 38 .U A. U. A. 50 U. .U 1120 U. . .U A. . A. 1130 .G C. .G A. U .G .U110 100.A U.2 U. U. .U U . . .GC. U U U. C 130 . .U A. . AA A.C SRL . . . . U A U. U. 1 U A U. A. U. . .U. A. U A U .A .U . 60 . . A A G . .A . . G U .U A. U. U. U...... GG 1060 G C . A U A U. . U. .AU UA . . 260 A. . . . A. A. A1 U.A . . . 1140 . .A U .A U. U. A. U. U. . A.GU .UU . A U C. 5' end . .UU G U 25. . U. A. 40U. . A . . . . A- si t e f i nger .U .U . A A. U 95 CUU A1020 .U .A 90 .UA U. U. U. . . U .U A. 3' end 1150 A. U . . A. A. U. U A.A. 30 . UA 120 U. C . . A. U. . . G. . U GU. . C. G G 1050 U. A. UU. G. . .G C. . . UA CA A. U. 70 . U. U G. A. . .A.G . U. AA U 3 .A . . U. G. A. U. 1030 A. U. C. G. 80.U A. U A U. . . . U.AA 1040.A U. C. .U U.U.G A- Loop

Figure 40. Secondary structure of 12S large subunit mitochondrial rRNA of Trypanosoma brucei. Residue number 1 is 11th U of X02547. Residues 1130-1143 AAGUACGCAAGGAU in red are supposed to base-pair with the light blue residues of small subunit rRNA in Figure 39. Large numerals as in Figure 39. I, II, III, IV, V, and VI represent domain numbers for the regions corresponding to those of 1-561, 562-1269, 1270-1646, 1647-2014, 2015-2627, and 2628-2904 in E. coli 23S rRNA, respectively. The α-sarcin/ricin stemloop region (SRL), A site finger, P-loop, and A-loop correspond to those inE. coli 23S sequence in the nucleotide regions from 2653 to 2667, from 879 to 898, from 2251 to 2253, and from 2552 to 2556, respectively. Central loop is the highly-conserved nucleotide region surrounding tRNAs on the ribosome. the basis of the low Mg2+ concentration described above. In contrast base at the decoding site of 16S rRNA, the secondary structure of which to Figure 48, Figures 50 and 51 show the structural difference in the is shown in Figure 36. Such base pairs are long-range interactions in A-site codon-anticodon helix between the cognate case in Figure 50 contrast to the hydrogen-bonded U33 base inside the anticodon loop and the near-cognate case in Figure 51. In these two structures, the so far observed in X-ray structures. Accordingly, a transition between bases of both U33 and OMC32 as well as YG37 are exposed in order U33-folded and U33-extended forms would be more frequent in the to represent a delicate difference between the cognate and near- 2+ cognate codon-anticodon helices. As a result of the difference, a Mg2+ physiological condition of the Mg concentration. Although the G-C hexahydrate between OP2 atom of YG37 and O2 atom of OMC32 is pair between C1399 and G1504 has been predicted [14,15] because expelled, giving rise to a slight difference in the interactions between of its high conservation, it was not confirmed in the crystal structure the bases of U33 and A1396. The interactions of A1492 and A1493 to of 2HGR (accession number in PDB database [26]). On the other the minor groove of the codon-anticodon helix can be achieved almost hand, G1401-C1501 pair as well as the base stacking of C1400 to the the same as those in the X-ray structures [23], but the interaction of first base of the anticodon of P-site tRNA [27]was confirmed. It has G530 is rather difficult without a conformational change in the 530 been known that both adenines and cytidines in the decoding site are loop due to the big conformational change in the anticodon stem-loop (ASL) structure as well as the big shift in the location of h45, which will universally conserved. Accordingly, it was quite natural to think that be described later. The near-cognate G-U base pair at the first bases of U33 base of the U33-extended form of P-site tRNA could be base- the A-site codon in Figure 51 is quite the same as that of Figure 48, but paired with A1398. It necessarily leads to the assignment of A1396 to resulting in a delicate but recognizable difference from the cognate case the base-pairing partner of the U33-extended form of aa-tRNA at the in Figure 50, while the structure shown in Figure 48 cannot be achieved recognition mode of A site (T site). It seemed possible for either A1502 because of its big structural difference from the corresponding one in or A1503 with U33 of the U33-extended form of deacylated tRNA at Demeshkina et al. [23]. the E site. The result of the model building of Nagano (2015) has shown The exposed base of U33 in the U33-extended form is supposed to be that A1502 is most favorable, while A1503 seems a little too far from supported by base-pairing with a certain universally conserved adenine the U33 of E-site tRNA.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 14-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

Eubacteria 23SrRNA 16SrRNA tRNA SRL 5'h35 h37L 3'h35 5G7 5A7 5X7 Y Aaeol UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 38 6 0 0 Tmari UUGUACGAGAGGAC GUCAGCUCGUGCCG UUAAGU CGAGC 31 15 0 0 Tther UAGUACGAGAGGAC GUCAGCUCGUGCCG UUAAGU CGAGC 6 3 0 1 Dradi UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 28 19 0 0 Tpall UAGUACGAGAGGAC GUCAGCUCGUGCCG UUAAGU CGAGC 27 18 0 0 Bburg UAGUACGAGAGGAC GUCAGCUCGUGCUG UUAAGU CGAGC 22 11 0 0 Gobsc UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 0 0 0 0 Pbras UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 0 0 0 0 Ctrac UAGUACGAGAGGAC GUCAGCUCGUGCCG UUAAGU CGAGC 23 14 0 0 Cpneu UAGUACGAGAGGAC GUCAGCUCGUGCCG UUAAGU CGAGC 24 14 0 0 Climi UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 0 0 0 0 Bfrag GUCAGCUCGUGUCG UUAAGU CGAGC 1 0 0 0 Atume UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 3 1 0 0 Ccres UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 44 7 0 0 Rbell GAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 0 0 0 0 Rprow UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 20 13 0 0 Bjapo UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 1 1 0 0 Rpalu UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 1 1 0 0 Rspha UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 1 2 0 0 Rcaps UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 0 0 0 0 Pdeni UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 0 0 0 0 Nmeni UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 53 5 0 0 Bcepa UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 2 0 0 0 Ecoli UAGUACGAG AGGAC GUCAGCUCGUGUUG UUAAGU CGAGC 78 25 0 0 Hinfl UAGUACGAGAGGAC GUCAGCUCGUGUUG UUAAGU CGAGC 40 16 1 0 Paeru UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 58 6 0 0 Pstut UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 2 0 0 0 B.APS UAGUACGAGAGGAC GUCAGCUCGUGUUG UUAAGU CGAGC 13 18 0 0 Pmult GUCAGCUCGUGUUG UUAAGU CGAGC 40 17 0 0

Ramyl UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 0 0 0 0 Vchol UAGUACGAGAGGAC GUCAGCUCGUGUUG UUAAGU CGAGC 72 25 0 1 Xfast UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 47 3 0 0 Saura UAGUACGAGAGGAC 2 0 0 0 Mxant GUCAGCUCGUGUCG UUAAGU CGAGC 0 1 0 0 Cjeju UA GUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 30 13 0 0 Hpylo UA GUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 23 11 2 0 S6803 UA GUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 26 13 2 0 S6301 UA GUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 3 0 0 0 Mlute UA GUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 0 0 0 0 Mlepr UA GUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 33 12 0 0 Mtube UA GUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 33 12 0 0

Slivi UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 10 5 1 0

Scoel UAGUACGAGAGGAC GUCAGCUCGUGUCG CGAGC 0 1 0 0 UUAAGU Sambo UA GUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 0 1 0 0 Sgris UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 0 2 0 0 Bsubt UA GUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 71 15 0 0 Bhalo UA GUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 64 14 0 0 Saure UA GUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 52 8 2 0 Uurea UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 21 6 3 0 Cbotu UA GUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 2 0 0 0 Mpneu UA GUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 25 9 3 0 Mgeni UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 18 15 3 0

Figure 41. Nucleotide sequence comparisons of two highly conserved regions from large subunit- and small subunit-rRNAs as well as the numbers of four typical sequence patterns of tRNA of respective species. (a) Eubacteria.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 15-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

Archaea 23SrRNA 16SrRNA tRNA SRL 5'h35 h37L 3'h35 5G7 5A7 5X7 Y Aflug UAGUACGAGAGGAA GUCAGUUCGUGUAC UUAAGU CGAGC 23 23 0 0 Hhalo UAGUACGAGAGGAA GUCAGCUCGUACCG UUAAGU CGAGC 13 32 2 0 Hmari UAGUACGAGAGGAA GUCAGCUCGUACCG UUAAGU CGAGC 1 1 0 0 Hmorr UAGUACGAGAGGAA GUCAGCUCGUACCG UUAAGU CGAGC 0 0 0 0 Hvolc GUCAGCUCGUACCG UUAAGU CGAGC 39 8 0 0 Nmaga UAGUACGAGAGGAA GUCAGCUCGUACCG UUAAGU CGAGC 0 1 0 0 Mther CAGUACGAGAGGAA GUCAGCUCGUACCG UUAAGU CGAGC 13 26 0 0 Mjann UAGUACGAGAGGAA GUCAGCUCGUACCG UUAAGU CGAGC 8 29 0 0 Mmari GUCAGCUCGUACUG UUAAGU CGAGC 14 23 1 0 Mvann UAGUACGAGAGGAA 6 10 0 0 Mhung UAGUACGAGAGGAA GUCAGUUCGUACUG UUUAGU CGAGC 0 0 0 0 Pabys UAGUACGAGAGGAA GUCAGCUCGUACCG UUAAGU CGAGC 9 37 0 0 Pfuri UAGUACGAGAGGAA GUCAGCUCGUACCG UUAAGU CGAGC 8 37 0 0 Phori UAGUACGAGAGGAA GUCAGCUCGUACCG UUAAGU CGAGC 10 37 0 0 Tcele CAGUACGAGAGGAA GUCAGCUCGUACCG UUAAGU CGAGC 2 2 0 0 Tacid UAGUACGAGAGGAA GUCAGCUCGUACCG UUAAGU CGAGC 16 18 11 0 Tvolc UAGUACGAGAGGAA GUCAGCUCGUACCG UUAAGU CGAGC 17 17 11 0 Apern UAGUACGAGAGGAA GCCAGCUCGUGCCG UUAAGU CGAGC 13 36 1 0 Dmobi CAGUACGAGAGGAA GCCAGCUCGUGCUG UUAAGU CGAGC 0 0 0 0 Paero CAGUACGAGAGGAA GUCAGCUCGUGCCG UUAAGU CGAGC 13 28 0 1 Ssolf AAGUACGAGAGGAA GCCAGCUCGUGUUG UUAAGU CGAGC 11 35 0 0 Stoko UAGUACGAGAGGAA GCCAGCUCGUGUUG UUAAGU CGAGC 2 44 0 0 Sacid UAGUACGAGAGGAA GCCAGCUCGUGUUG UUAAGU CGAGC 0 0 0 0 Tpend CAGUACGAG AGGAA GCCGGCUCGUGCCG UUAAGU CGAGC 1 1 0 0 Ttena GUCAGCUCGUGUCG UUAAGU CGAGC 3 2 0 0

Figure 42. Nucleotide sequence comparisons of two highly conserved regions from large subunit- and small subunit-rRNAs as well as the numbers of four typical sequence patterns of tRNA of respective species. (b) Archaea.

Chloroplast 23SrRNA 16SrRNA tRNA SRL 5'h35 h37L 3'h35 5G7 5A7 5X7 Y Pfalc UAGUACGAGAGGAU GUCAGUUCGUGCUG UUUAGU CGAAC 1 7 0 0 Pberg UAGUACGAGAGGAU GUCAGUUCGUGCUG UUAAGU CGAAC 5 12 1 1 Cpara UUGUACGAGAGGAU GUCAGCUCGUGUCG UUAAGU CGAGC 15 16 1 0 Crein UAGUACGAGAGGAC GUCAGCUCGUGCUG UUAAGU CGAGC 5 6 1 0 Ceuga UAGUACGAGAGGAU GUCAGCUCGUGCGU UUAAGU CGAGC 1 1 0 0 Chumi UAGUACGAGAGGAC GUCAGCUCGUGCGU UUAAGU CGAGC 0 0 0 0 Celli UUGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 7 0 1 0 Cvulg UAGUACGAGAGGAC GUCAGCUCGUGUCU UUAAGU CGAGC 22 10 1 0 Gthet UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 14 15 1 0 Olute GUCAGCUCGUGUCG UUAAGU CGAGC 1 1 0 0 Osine UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 15 13 1 0 Along UAGUACGAGAGGAC GUCAGCUCGUAUCG UUUAGU CGAGC 9 18 2 0 Egrac UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 23 8 1 0 Plitt CAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 1 1 0 0 Ccald UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 10 18 2 0 Ppalm UAGUACGAGAGGAC GUCAGCUCGUGUCG UUAAGU CGAGC 0 1 1 0 Camer UAGUACGAGAGGAC GUCAGCUCGUGCCG UUAAGU CGAGC 0 0 0 0 Ainca UAGUACGAGAGGAC GUCAGCUCGUGCCG UUAAGU CGAGC 2 2 0 0 Pthun UAGUACGAGAGGAC GUCAGCUCGUGCCG UUAAGU CGAGC 16 15 0 0 Evirg UAGUACGAGAGGAC GUCAGCUCGUGCCG UUAAGU CGAGC 9 7 1 0 Ntaba UAGUACGAGAGGAC GUCAGCUCGUGCCG UUAAGU CGAGC 21 15 1 0 Taest UAGUACGAGAGGAC GUCAGCUCGUGCCG UUAAGU CGAGC 16 13 1 0 Osati UAGUACGAGAGGAC GUCAGCUCGUGCCG UUAAGU CGAGC 21 16 2 0 Psati UAGUACGAG AGGAC GUCAGCUCGUGCCG UUAAGU CGAGC 6 9 1 0 Soler UAGUACGAGAGGAC GUCAGCUCGUGCCG UUAAGU CGAGC 15 15 1 0 Zmays UAGUACGAGAGGAC GUCAGCUCGUGCCG UUAAGU CGAGC 19 14 1 0 Mpoly UAGUACGAGAGGAC GUCAGCUCGUGCCG UUAAGU CGAGC 18 16 1 0 Figure 43. Nucleotide sequence comparisons of two highly conserved regions from large subunit- and small subunit-rRNAs as well as the numbers of four typical sequence patterns of tRNA of respective species.(c) Chloroplast.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 16-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

Eukarya 23S-like rRNA 16S-like rRNA tRNA SRL 5'h35 h37L 3'h35 5G7 5A7 5X7 Y Hinfl CCUAGUUCGUGGUG CUUAUU CGAGC 0 0 0 0 Garde CAGUACGAGAGGAA CCCAGCCCGUGGCG UUCACU CGAGC 0 0 0 0 Gmuri CAGUACGAGAGGAA CACAGCCCGUGGCU UUGACU CGAGC 0 0 0 0 Ginte CAGUACGAGAGGAA CCCAGCCCGUGGCG UCCAUU CGAGC 0 0 0 0 Ecuni UAGUACGAGAGGAA UUAAAUGGAUGGCG UUAAGU GAUGU 42 4 0 0 Pfalc UGGUACGAGAGGAU UUUAGUUCGUGAAU UUAAUU CGAAC 6 13 4 0 Tparv UAGUACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 0 0 0 0 Tgond UAGUACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 2 4 1 0 Ddisc CUUAGUUGGUGGAG UCAAUU CGGAC 19 1 2 0 Celli CAGUACGAGAGGAA CUUAGUUGGUGGGU UUGAUU CGAAC 15 0 0 0 Tther UAGUACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 3 0 0 0 Pmica CAGUACGAGAGGAC CUUAGUUGGUGGAG UUAAUU CGAAC 0 0 0 0 Egrac UAGUACGAGAGGAA 0 0 1 0 Ppoly CAGUACGAGAGGAA CUUAGUUCGUGGAU UUUAUU CGAGC 1 0 0 0 Dirid CAGUACGAGAGGAA CUUAGUUCGUGGAU UCUAUU CGAGC 0 0 0 0 Pmega CAGUACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 1 0 0 0 Ehist CAGUACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 2 0 0 0 Cfasc UAGUACGAGAGGAA UUUUGUUGGUUUUA UUAUGGA CCAGU 2 0 0 0 Tfoet GGCCGUUGGUGGUG GUUGAU CGAGC 0 0 0 0 Tbruc UAGUACGAGAGGAA ACUGGUGCGUCAGA UUAGACC CGAAC 16 3 0 1 Pcari UAGUACGAGAGGAA CUUAGUUGGUGAAG UUAAUU CGAAC 0 0 0 0 Scere UAGUACGAGAGGAA CUCAGUUGGUGGAG UUAAUU CGAAC 196 70 5 2 Spomb UAGUACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 18 6 2 0 Cneof UAGUACGAG AGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 0 0 0 0 Calbi UAGUACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 12 0 0 1 Mrace UAGUACGAGAGGAA CUUAGUUCGUGGAG UUAAUU CGAAC 0 0 0 0 Athal UAGUACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 489 80 47 4 Bnapu UAGUACGAGAGGAA 3 0 0 0 Fanan UAGUACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 0 0 0 0

Lescu UAGUACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 0 0 0 0 Osati UAGUACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 17 6 3 0 Salba UAGUACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 0 0 0 0 Celeg UAGUACGAGAGGAA CUUAGUUGGUGGAG UUUAUU CGAGC 421 139 19 6 Mxant CAGUACGAGAGGAA CUUAGUUCGUGGAA UUAAUU CGAAC 0 0 0 0 Dmela UAG UACGAGAGGAA CUUAGUUCGUGGAG UUAAUU CGAAC 159 106 18 2 Pmage CAG UACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 0 0 0 0 Spurp CAG UACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 0 0 0 0 Hmomu CAG UACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 0 0 0 0 Hsapi CAG UACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 278 153 57 23 Mmusc CAG UACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 9 5 1 0 Xlaev CAG UACGAGAGGAA CUUAGUUGGUGGAG UUAAUU CGAAC 6 4 2 0

Figure 44. Nucleotide sequence comparisons of two highly conserved regions from large subunit- and small subunit-rRNAs as well as the numbers of four typical sequence patterns of tRNA of respective species. (d) Eukarya. A-P and P-E tRNA docking pairs in the pre- and post- cytidine residues in T-loop of tRNAs are C56 and C61. (However, there translocational states exists an exceptional case in human mitochondrial Ser-tRNA missing totally its D-loop. How we can interpret its translational movement? The universal rule of nucleotide sequence complementarities That is an important question). In contrast to the X-ray structure of the (Figure 46) was first noticed by a finding that oligonucleotide elbow region of the isolated L-form tRNA, in which G19 is paired with fragments such as TѰCG or UUCG inhibited the binding of aa-tRNA C56 as a Watson-Crick type, but G18 is hydrogen-bonded to Ѱ55 in at the A site [28]. It was found that such a highly conserved sequence an irregular fashion [8], the universally conserved guanines, G18 and exists on the 3’-side of h35 of 16S rRNA, and that the other side of G19, of aa-tRNA at the T site are base-paired with the other universally h35 is complementary to the conserved loophead sequence of SRL conserved cytidines, C56 and C61, of P-site tRNA, respectively, and in 23S rRNA [29]. In addition to the above finding, it was also found also G18 and G19 of P-site tRNA with C56 and C61 of E-site tRNA, that various tRNA fragments containing D-loop inhibited aa-tRNA respectively. Such a big difference in the shape of tRNA between the binding at the A site [28]. This strongly suggested that the T-loop of isolated L-form of X-ray structure [8,21,26] and the docking model of aa-tRNA making base-pairs with the 3’-side of h35 and the D-loop Nagano et al. [29] as well as Nagano [9] comes from a big difference in of aa-tRNA making base-pairs with the T-loop of P-site tRNA are an the Mg2+ concentration, as mentioned above. essential necessary requirement for the aa-tRNA entering the A site. The results of X-ray structures of the whole ribosome have shown It is also important to note that there exists a pair of highly conserved that two conserved regions of 16S rRNA, the ridge containing G1338 guanines at the almost all tRNAs (although their relative locations are and A1339 near h29 and the head of 790 loop, constitute barriers that slightly different from one another), while that the highly-conserved

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 17-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

Mitochondria 23S-likerRNA 16S-likerRNA tRNA SRL 5'h35 h37L 3'h35 5G7 5A7 5X7 Y (i) Protista Tbruc AAGUACGCAAGGAU UAUUAUUUAUAUUA ------UGUGU 0 0 0 2 Pfalc GAGUACGUAAGGAA AUGAGAGUUCACCG ------UGAGC 4 7 4 0 Ceuga GAGUACGAAAGGAC GGAAGUCCGAGUAG CCCUCG CGGAC 2 2 0 0 Crein GAGUACGAAAGGAC AAAAGUUCGAGCGG CCGGUC UGAAC 3 0 0 0 Pwick UUGUACGAGAGGAC GUCGGUUCGUGCCG UUAAUU CGAAC 17 9 0 0 Paure CAGUACGAGAGGAC GUCGGUUCGUGUUU UUAAGU CGAAC 2 1 0 0 Tther UAGUACGAGAGGAC GUCAGUUCGUGCUG UUAAGU CGAAU 7 0 1 0 Tpyri UAGUACGAGAGGAC GUCAGUUCGUGCUG UUAAGU CGAAU 7 0 1 0 Ppoly UAGUACGAAAGGAC GUAAGUUCGUGCCG UUCAUU CGAAC 2 1 1 1 Acast UUGUACGAGAGGAU GGCAGUUCGUGUUG UUAAUU CGGAC 16 0 0 0 Ppurp UAGUACGAGAGGAU GUCAGUCCGUGCCG UUAAUU CGGAC 12 12 0 0 Ccris UAGUACGAGAGGAC GUCAGUCCGUGCUG UUCAUU CGGAC 19 4 0 0 Plitt UAGUACGAGAGGAC ACCAGUUCUAGUAA UUACGA CAAAC 4 20 0 0 (ii) Fungi Pcana UAGUACGCAAGGAU UUCAGUUCGUGCUG UUAGUU CGAAC 14 10 1 0 Scere UAGUACGCAAGGAC UUUAGUUCGUGCUG UUAAGU CGAAC 11 15 1 0 Anidu UCGUACGAAAGGAG UUUAGUUAAUGUCG UUAAGU CGAAA 9 9 6 2 (iii) Plantae Mpoly UAGUACGAGAGGAC GUCAGCUCGUGUCU UUAAGU CGAGC 13 14 0 0 Obert UAGUACGAGAGGAC GUCAGCUCGUGUCG UCAAGU CGAGC 9 5 0 0 Taest UAGUACGAGAGGAC GUCAGCUCGUGUCG UCAAGU CGAGC 13 4 0 0 Zmays UAGUACGAGAGGAC GUCAGCUCGUGUCG UCAAGU CGAGC 7 6 0 0 (iv) Animalia Plivi CAGUACGAAAGGAC AUCAGCUUGUAUAC CUACU- UUGGC 0 4 15 4 Aquad UAGUACGAAAGGAC AUCUGUUCUGUUAU UUGGA- UAAAU 0 0 5 17 Spurp UAGUACGAAAGGAC GGACGUCAGAUCAA GUUGG- UGAGC 0 2 9 11 Dmela UAGUACGAAAGGAC ACCUGUUUUUUAAU -UCC-- ACGAU 0 0 3 22 Amell UUGUACGAAAGGAC AGAUGUUUGUCGAU AUAGUC ACGAU 0 0 1 21

Lmigr UAGUACCAAAGGAC AUUUGUUUGUAUAU ------CAGAA 0 2 2 23 Apect UAGUACGAAAGGAC AGGAGCUUGCCAUC UAACC- CGAAA 2 7 8 9 Clacu UAGUACGAAAGGAA GUCAGCUUACCCUG -CAAU- UAAGC 1 2 15 4 Ccarp UAGUACGAAAGGAU GUCAGCUUACCCUG AAUAAA UAAGC 0 3 18 2 Gmorh CAGUACGAAAGGAC GUCAGCUUACCCUG -AAA-- UAAGC 3 1 21 3 Pdoll UA GUACGAAAGGAC CCUAGUUUGUUUCC UUCAUG CAGAU 1 1 14 6 Xlaev UAG UACGAAAGGAC GUAGGCUUAAUGAU CAAUC- CAGGU 1 5 11 6 Eeuro UA GUACGAAAGGAC UGUAGCUCGUAGUU ACAGUU UAAGU 1 3 5 15 Ramer UA GUACGAAAGGAC UUGUGCUCGUAGUU CUUAGG UAAGU 2 1 13 6 Ggall UA GUACGAAAGGAC UUGUGCUCGUAGUU CAUGUG UGGGA 2 2 8 10 Amiss UA GUACGAAAGGAC CGUCGCAAGCCCGU AGGGAA CGCGC 1 0 15 18 Mmusc CAG UACGAAAGGAC UUCAGCAAACCCUA -UUAA- UAAGC 1 3 6 12

Ecaba CAGUACGAAAGGAC UUCAGCAAACCCUA -CCGA- UAAGC 1 3 10 9

Btaur CAGUACGAAAGGAC UAAGC 2 0 8 12 UUCAGCAAACCCUA -AAAA- Hsapi CU GUACGAAAGGAC UUCAGCAAACCCUG -ACAA- UAAGC 5 1 8 14 Acoer UAGUACGAAAGGAC CGUUGUCUUCAUGA UACAAA UGAAU 0 0 0 21 Fhepa UA GUACGAAAGGAU GUUUGUUAGUGUAU GAUUUG UUAGU 0 0 0 22 Asuum AAG UACGAAAGGA- UUAUUUUAUUCUU- GUAAU- UUAGA 0 0 0 22 Celeg UAGUACGAAAGGA- UUAUUUUAUUCUU- -UAU-- UAAAA 0 0 0 22

Figure 45. Nucleotide sequence comparisons of two highly conserved regions from large subunit- and small subunit-rRNAs as well as the numbers of four typical sequence patterns of tRNA of respective species. (e) Mitochondria.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 18-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

Eubacteria (Escherichia coli) rRNAs and tRNA (in standard form) A 56 2660 or . . . .C G p5. .A. . .A A. G U G T-loop U. p6. 3'. 5'. SRL G .A . . p7 . U U. . 3' C. G. GTPase- C. G. . .U . G C G. C. 61 A. U. binding site A..G. . . h37 76 A. G. C. n4. o4. G. C. U. A. 37 n3 o3 C. . 3' 5' . G. C. .1103 .. . . G. A. . . G C. .C v2. . n2 o2 C . C. G. A. C .U G A. .3' R. vz . . . A. G...... k7 .v1 .n1 o1. t1. G. U. U C. G. U A C G A G C k5 ...... G. C. . . j5.j4.j3j2.j1 b7.b6.b5 b4 b3 b2 b1 A. U. C G G .U. . .U.C. k4 A. U. . . + . G C G. + ...... a1 = + G C U A U. G. A .i5.i4.i3.i2.i1. .a7 a6 a5 a4 a3 a2 .5' G. C...... k3. C G or H95 C G A. U. 1068 C .k1 h1. c1. C. G. . . A . . h36 h35 . N .c2 1 G U .A U. U. e1. d1. A. U. 5' 3' . . A G e2 d2 U. G. . . . SpcF G. 33 . . U. G. G U e3. d3. G U C. G. 5' or or . . U A A .e4 d4.G orA orU A U 5'.G A.3' 16SrRNA gy f1 . . . . U G . f2. D-loop . . g2. . 5' 3' 23SrRNA fx g1.G .G . 18 tRNA

Figure 46. Universal rule of complementarities between 23S and 16S rRNAs and between 16S rRNA and tRNA.

block a translocational movement of P-site bound tRNA toward E site, if we assume that such a movement is just a simple-minded translation from the crystallographic P site to the crystallographic E site. Transition from the X-ray structure to the predicted one of Nagano [9] would start from the U33-extended form of aa-tRNA by interaction between U33 (aa-tRNA) and A1396, which is exposed on the X-ray structure, as well as the base-pairing of GT CG (or GT CA) of aa-tRNA with CGAGC1107. The effector region in the body of elongation factors, both EF-Tu and EG-G, would collideѰ with theѰ long helix composed of h30, h32, and h34 [14,15], resulting in accomplishment of the long- range interactions between the SRL-region of 23S rRNA and the 5’- side of h35 of 16S rRNA. Under the condition of fixing the location of Figure 47. Stereoview of main-chain representation for double-stranded helix formed A76 of both P- and E-site tRNAs, the U33-extended forms make the between SRL-region, U2653-C2666, of 23S rRNA, which is drawn in magenta, and the distance between G18 (P-tRNA) and C56 (E-tRNA) shorter and C56 region G1064-G1077of 16S rRNA in skyblue. The other region in yellow is G2645-C2652 (P-tRNA) closer to G18 (aa-tRNA), and, the twisting makes splitting and C2667-A2675 of H95. Some base pairs in H95, such as 2650-2652/2668-2670 are unfolded. Only main chains and base-pairing bases are drawn for simplicity. Red letters of D-loop from T-loop for both P- and E-site tRNAs at their respective denote the locations of nucleotides, G1064, G1077, G2645, A2675, U2653, and C2667, elbow regions. Such a conformational change for both P- and E-site respectively. tRNAs allows base-pairing between G18 (aa-tRNA) and C56 (P-tRNA) and between G18 (P-tRNA) and C56 (E-tRNA), and successively, G-C pairs between G19 (aa-tRNA) and C61 (P-tRNA) and between G19 (P-tRNA) and C61 (E-tRNA), respectively. Thus, the A-P-E tRNA docking triplet form can be completed, and finally accomplishes the codon-anticodon helix at the E site. This is a series of conformational changes after unfolding of h35 by the collision with the body of elongation factors, EF-Tu and EF-G, ending up with formation of the codon-anticodon helix at the E site. Figure 52 shows that U33 at the P site (green) is base-paired with A1398 (red), U33 at the E site (purple) is base-paired with A1502 (pink), but U33 at the A site (magenta) is not base-paired with A1396 (brown). C1399-G1504 pair is formed and shown in gold. Figure 53 shows that U33 at the P site (green) is base-paired with A1398 (red), U33 at the Figure 48. Virtual picture of stereo view for the near-cognate G-U wobble base pair, which A site (magenta) base-paired with A1396 (brown), but U33 at the E is drawn by blue guanine and yellow uridine, at the first position of the A-site codon- anticodon helix. Mg2+ ion in green is kept between O2P atom of YG37 and O2 atom site (purple) is not base-paired with A1502 (pink). C1399-G1504 pair of 2’-O-methylated OMC32. In order to keep the wobble base pair, the base of U33 in (gold) is formed. On the other hand, Figure 54 shows that all three magenta must be extended outward and destabilized because of the loss of hydrogen- U33 at the A, P and E sites, (magenta, green, and purple, respectively), bonding energy. are base-paired with A1396 (brown), A1398 (red), and A1502 (pink), respectively, at the expense of separation of the bases of C1399 and

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 19-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

hydrolysis) are rejected, while only cognate one (after GTP hydrolysis) can enter the A site to expel the deacylated tRNA at the E site from the ribosome. This would be a reasonable explanation of the mechanism of translation concerning how high accuracy could be achieved at the decoding site. When the region of U2653-C2666 of 23S rRNA (magenta) including the SRL is bound together with the region of G1064-G1077 of 16S rRNA (skyblue), the conformation of which is shown in Figure 47, the region of 16S rRNA is extended from the main part of 30S subunit. In order to achieve such a long-range base-pairing interactions, it is essential that the long helix composed of h30, h32 and h34 of 16S rRNA should have bends of a considerable degree around the regions between h32 and Figure 49. Stereoview of mRNA kinks used in Nagano (2015). Three codons at the A, P, h34 as well as between h30 and h32. It necessarily causes a big deviation and E sites, that are composed of all uridines, and two clusters of Mg2+ hexahydrate binding of the barrier, G1338 and A1339, toward the joint region of h32 and at the P/A- and E/P-kinks, by representing green big ball for Mg2+ ions and yellow big balls for water molecules. The sharp kinks are formed by binding one Mg2+ hexahydrate at one h34 from its original position, and the location of h45 is changed very kink instead of two, which are observed in X-ray structures. much. As a result, the universally conserved dimethylated adenines,

Am21519 and Am21520, at the loop of h45 makes a contact with the

Figure 50. Stereoview of cognate A-site codon-anticodon helix and the ASL of Nagano (2015), in which U33 of the ASL and A1396 at the decoding site of 16S rRNA are drawn in magenta and brown, respectively. Significance of A1396 will be described in Figure 51. Figure 52. Stereoview of full-atom representation for the residues at the decoding site of This picture is shown to look similar to that in Figure 48 as far as possible. 16S rRNA. U33 of P-site tRNA (green) is base-paired with A1398 (red), that of E-site tRNA (purple) base-paired with A1502 (pink), but U33 (magenta) at the recognition mode of A site (T site) is not base-paired. A1396 (brown) is far from the U33 base (magenta). G1401-C1501 (skyblue) is always formed. C1399-G1504 (gold) is formed in this case. This structure leads to rejection of either noncognate or near-cognate aa-tRNA from the A site. C1400 is drawn in yellow, while U1506, which starts h45, is in blue.

Figure 51. Stereoview of near-cognate A-site codon-anticodon helix and the ASL of Figure 53. Stereoview of full-atom representation for the residues at the decoding site of Nagano (2015), in which G-U wobble base pair at the first base pairs of the A-site codon- 16S rRNA. This picture is very similar to Figure 52, but different from Figure 52 in the anticodon helix is drawn in the same colors as those in Figure 48 and both U33 and A1396 following two points. U33 at the A site (magenta) is base-paired with A1396 (brown), and are as those in Figure 50. U33 at the E site (purple) is not base-paired with A1502 (pink). G1504 (gold). Since both tRNAs at the P and E sites are cognate, in contrast to the case of X-ray structure, aa-tRNA at the T site would take a transition-state conformation, as shown in Figure 54, if it is also cognate. During fluctuation of the whole structure, both residues, C1399 and G1504, would become close enough to base-pair, as they are not so far separated. Then, A-U base pair of either A site or E site is broken. Which A-U pair is broken depends on the stability of the codon-anticodon helices at the A and E sites, since the distance between the U33 base and C1400 base, for example, would become larger in the Figure 54. Stereoview of full-atom representation for the residues at the decoding site case of less stable anticodon loop. From this reason, both noncognate of 16S rRNA. This picture is also very similar to Figures 52 and 53, except that all three U33 at the A, P, and E sites are base-paired with A1396 (brown), A1398 (red), and A1502 (before GTP hydrolysis) and near-cognate aa-tRNA (at the step of GTP (pink), respectively, but C1399-G1504 (gold) is broken.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 20-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

downstream of mRNA. This result can explain why those residues are P/E and A/P hybrid states. Such a transition would not occur if either dimethylated, while the X-ray structure, 2HGR [26] cannot. A1396 the 3’-end of P-site tRNA is peptidylated or some of the above key approaches the U33 base of aa-tRNA, while h44 of 16S rRNA makes a nucleotides are chemically modified. This could be an explanation as to contact at the inner groove of A-site codon, with universally conserved why peptidyl-tRNA bound to the P site locks ribosomal functions such A1492 and A1493 [30]. When U33 of the A-site tRNA is base-paired as translocation, termination, and recycling [34]. Figure 59 shows that with A1396 and U33 of the E-site tRNA is not base-paired with A1502, the foot of H88 (cyan), where C2395 and C2421 are situated, reaches as shown in Figure 53, the deacylated tRNA at the E site (purple) in the adenine base of P-site tRNA (green), while the P-loop of H80 (red) the three tRNA binding structure changes its conformation from the catches the bases of C74 and C75 of A-site tRNA (magenta). This is the U33-extended and docking with P-site tRNA state to the U33-folded 3D structural representation of the 3’-end of tRNAs at the A/P and P/E and isolated form. Then, the deacylated tRNA leaves the ribosome in hybrid states. When EF-G·GTP complex comes to occupy the space the direction of L1 stalk. When the conformations of nucleotides at the of A-site tRNA, the A-P tRNA docking pair rotates around the axis decoding site are such as those in Figure 52, what would happen to the passing closer to the A-, P- and E-site tRNAs to reach the site of P-E T site? When the aa-tRNA is noncognate, the base-pair of the codon- tRNA docking pair at the classical P/P and E/E states. The 3D structural anticodon helix would not be formed without need of U33-A1396 representation of them can be shown as in Figure 60. base-pair formation. In the case of either near-cognate or cognate, the codon-anticodon helix could become an ideal A-form, and the help How elongation factors work on the present models to of U33-A1396 would be very important at the step of GTP hydrolysis. produce GTP hydrolysis? After fluctuation of the conformations at the decoding site between The recent result of the crystal structure of the ribosome bound to those as in Figures 54 and 52, the helix at the T site would be broken, EF-Tu and aa-tRNA [37] does not seem to fully explain the mechanism if the one at the E site is a little stronger. Then, the conformation of the of accommodation of aa-tRNA into A site. Starting from the tRNA aa-tRNA would become a U33-folded form with its isolated L-form structure in magenta in Figure 58, which is either leaving the ribosome elbow region, and leaves the ribosome in the direction of L7/L12 stalk. or entering the ribosome as a ternary complex with EF-Tu·GTP, its At the instant, the conformation of the A-site codon would loosen, 3’-end region was modified by transplanting 3’-end region of Trp- and become ready for accepting the next anticodon of aa-tRNA as a tRNATrp along with EF-Tu and GDP bound to a cognate codon on the ternary complex, that would have probably been waiting for opening 70S ribosome 2Y18 [38]. GDP structure was changed to GTP by adding the A-site codon by using the 3’-side of h35 and A1394. A1394 is also Pγ phosphate. The conformation of EF-Tu was changed so that the Pγ highly conserved, although it is less conserved than A1396 and A1398. atom comes closer to N6 atom of A2660 and Pβ atom closer to N4 atom Translocation as a rotational movement of A-P tRNA of C2658. Pγ and Pβ atoms are covered by the region of amino acid docking pair to P-E pair residues 20-26 (VDHGKTT) in EF-Tu. The G base of GTP moiety is hydrogen bonded to U2656, while the O3’ and O2’ atoms on its ribose The inhibition of spectinomycin on the EF-G-dependent ring are hydrogen-bonded to N6 atom of A2657 and O6 atom of G1074 translocation has been noticed long time ago [31,32]. In addition of 16S rRNA, respectively. Such a local structure of GTP binding to the to the above, Peske et al. [33] have concluded that EF-G produces present model at the interface of RSL of 23S rRNA and the 5’-half of conformational rearrangements on the ribosome in the vicinity of h35 of 16S rRNA on the ribosome structure is shown in Figure 61 [9]. the decoding region on the 30S subunit and that spectinomycin has Inspection of this local structure suggests that GTP hydrolysis a direct inhibitory effect on tRNA-mRNA movement, that is quite could occur when the region of EF-Tu with GTP moves in the consistent with the basic assumptions of the present work. Comparison direction from N4 atom of C2658 to N6 atom of A2660 in contrast to of Figure 56 with Figures 57 and 58 suggests that translocation is not a translational movement of tRNAs from the crystallographically the movement vertical to the above. When the distance between the obtained P site to the E site but a rotational one around an axis that is N4 and N6 atoms becomes larger accompanying the Pβ and Pγ close to both A- and E-site tRNAs. If the aminoacyl group remains at atoms, respectively, NH2 group of the N2 position of G2659 attacks the A site and the P-site tRNA is still peptidylated, translocation would the Pβ-Pγ bond. The total structure of EF-Tu with the related regions of not occur because of locking at the P site [34]. Peptidyl transferase T- and P-site tRNAs is shown in Figure 62. After GTP hydrolysis in the reaction results in peptidyl-tRNA at the A site and deacylated tRNA structure shown in Figure 62, the conformation of EF-Tu returns to the at the P site, the state of which automatically brings about P and E original one observed in the crystal structure of the ribosome bound Trp sites on the 50S subunit. This is due to a rotational freedom around the to EF-Tu and Trp-tRNA 2Y18 [38]. Such a structure is shown in hinge between H14 and H22 (nucleotides AA423) and a rigid binding Figure 63. The EF-Tu molecule would leave the ribosome soon under 2+ of two helices, H22 and H88 at a pseudoknot CCAU416/AUGG2410 the normal condition of low Mg concentration. The conformation of of 23S rRNA. C2394 and G2421 at the base of H88, which is empty the A-site tRNA is the same as those shown in Figures 56 and 57. Since as a result of the process shown in Figure 57, could turn to catch the the binding of A-site tRNA very close to P-site tRNA, the location of adenine base of the CCA end of deacylated tRNA at the P site [35]. which is almost the same as the other structures shown in Figures 55 According to chemical modification experiments, 2’-hydroxyl groups to 58, the cavity between the SRL region and P-site tRNA becomes a of the nucleotides 71 and 76 of the deacylated P-site tRNA can be little wider, where EF-G is supposed to bind. The structure of tRNA recognized by nucleotide C1892 in the minor groove of H68 and A2433 in gold, as shown in Figure 64, is a putative ancestral form of EF-G, in and A2434 at the base of H74 of 23S rRNA, respectively [36]. Those which the tRNA-like body in gold represents the domain IV of EF-G residues also constitute part of the E site on the 50S subunit. It would [39], while the other part represents the rest of the whole structure expel the P-loop at the loop of H80 as well as A2450 and its associated of EF-Tu. If the conformational change in the part of EF-Tu occurs nucleotides of the central loop of 50S subunit from the CCA end of after GTP hydrolysis, the structure very much like the one shown in P-site tRNA, that is the P site on the 50S subunit, and transfer them to Figure 62 would be obtained. Because the exposed form of the domain the CCA end of the neighboring A-site tRNA. These are what we call IV of EF-G, or the tRNA moiety in gold, would be very unstable, the

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 21-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

A-site tRNA accommodation catalyzed by EF-Tu does not seem to be affected by spectinomycin. The reason for this would be that the region of CGAGC1107 on the 3’-half of h35 is unfolded by interaction with the GTѰCG region of T-loop of aa-tRNA, or could be due to a slight difference in the distance between Pγ atom and 3’-end of P-site tRNA (27 ). GTP hydrolysis results in a big conformational change in EF-Tu with a right hand (amino acid sequence from Ala1 to Arg190) pushing theÅ GTPase-binding site and with a left hand (from Gly191 to Glu405) pulling the whole body of P-site tRNA. The final shape of

Figure 55. Stereoview of main-chain representations of tRNAs and mRNA. P-site tRNA (green) in the center. Nucleotides are G2645-C2652 (white) and G2668-A2675 (white), with a label A indicating the location of G2645. The region of U2653-C2667 (yellow) is the SRL of 23S rRNA. Label B denotes the location of A76 of tRNA at the P site, mRNA (red) with Label C for its first residue. The following regions of 16S rRNA are shown in different colors: G1064-G1077 (skyblue), C1103-C1107 (cyan), G527-A533 (yellow tint), U1189-A1191 (pink), G1338-A1339 with bases (gold), A1408-A1420 (orange), G1480-A1493 (orange), U1506-G1529 (blue), C1399-G1504 pair with bases (violet), and A1396 (brown). Labels from D to P denote the locations of C545, G922, U956, U991, A1041, G1117, A1180, A1236, G1334, A1350, C1384, C1452, and U1542, respectively.

Figure 58. Stereoview of main-chain representations of three tRNA binding model, in which the three tRNAs are arranged as corresponding to the local picture of the decoding site in Figure 52, and aa-tRNA (magenta) takes an isolated L-form with its U33-folded anticodon loop.

Figure 59. Stereoview of main-chain representation for the conformations of the 3’-end Figure 56. Stereoview of main-chain representations of transition-state three tRNAs, regions of tRNAs surrounded by the nucleotides at the H80 (P-loop) and H88. It is known A-site tRNA (magenta), P-site tRNA (green), and E-site tRNA (purple). This structure that C2422 can bind to the 3’-end adenine base (A76) of deacylated tRNA at either P or corresponds to the local picture of the decoding site represented in Figure 54. Red label Q E site. The loop of H88, A2407-G2410, is tightly bound with C413-U416 at the loop of denotes the location of C805 of 16S rRNA H22, which is drawn in yellow, and works as widely movable long rod, by fixing its one end at the position of G424 on the 50S subunit, between the 3’-end region of P-site tRNA (green) in the PRE state, and that of E-site tRNA (purple) in the POST state. This structure of 3’-end regions of two tRNAs on the 50S subunit in the PRE state. The regions of G1-U8 and A66-A76 of A-site tRNA (magenta), G1-U8 and A66-A76 of P-site tRNA (green), G2246-C2258 (H80) (red), C2395-G2421 (H88) (cyan), G406-C421 (H22) (yellow) of 23S rRNA and the other associated regions (greyish white) are drawn. This corresponds to the intermediate structure in the A/P and P/E hybrid states.

Figure 57. Stereoview of main-chain representations of three tRNA binding model, in which E-site tRNA (purple) takes an isolated L-form with its U33-folded anticodon loop. This structure corresponds to the local picture of the decoding site represented in Figure 53. structure would become immediately the same as that in Figure 63 after translocation. This is exactly what happened in the crystal structure of the ribosome shortly after binding of EF-G [40]. The reason why EF-G Figure 60. Stereoview of main-chain representation for the conformations of the 3’- dependent translocation is inhibited by binding of spectinomycin [31- end regions of tRNAs surrounded by the nucleotides at the H80 (P-loop) and H88. This 33] can be explained very clearly as the result of missing the contribution structure of 3’-end regions of two tRNAs on the 50S subunit in the POST state. The regions of G1-U8 and A66-A76 of P-site tRNA (green), G1-U8 and A66-A76 of E-site tRNA from the 5’-half of h35 in the mechanism of GTP hydrolysis shown (purple), G2246-C2258 (H80) (red), C2395-G2421 (H88) (cyan), G406-C421 (H22) in Figure 61. In contrast to the EF-G dependent translocation, the (yellow) of 23S rRNA and the other regions (greyish white) are drawn. This corresponds to the structure in the classical P/P and E/E states.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 22-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

[43] will give us a lot of information about the protein-rRNA and protein- protein interactions. Such an analysis has not been done yet.

Figure 61. Stereoview of elongation factors during the course of A-site accommodation and translocation with GTP hydrolysis. Main-chan representation of GTP molecule (red) at the border of RSL from U2656 to G2661 (yellow) and 5’-half of h35 of 16S rRNA from G1069 to G1072 (skyblue)

Figure 62. Stereoview of elongation factors during the course of A-site accommodation A-site tRNA is just like a sitting-up of the T-site tRNA. On the other and translocation with GTP hydrolysis. The same model in Figure 61 is shown from the hand, GTP hydrolysis in translocation by means of EF-G produces a other direction that is from the 3’-end of P-site tRNA (green) toward the mRNA. When the codon-anticodon interaction is that of cognate, the conformations of the ASL and the elbow little different result from that of EF-Tu. This could be due to a slight region of T-site tRNA (magenta) induces the GTP hydrolysis, ending at the conformation difference in amino acid sequence between EF-Tu and EF-G (one of A-site tRNA as shown in Figures 56 and 57. Blue ribbon, and the other nucleotide amino acid insertion) and result in a different rotational movement chains represent the peptide chain of EF-Tu from A1 to E405 with VDHGKTT (20-26) missing, RSL region (C2652-C2667 in yellow), 5’-side of 16S rRNA (G1064-G1077 in with EF-G’s left hand pushing the whole body of A-site tRNA toward skyblue), T-site tRNA (A-U8, A14-U15, 5MC49, U68, U72 and A76 in magenta), and P-site tRNA and with its right hand pushing the GTPase-binding site P-site tRNA (G45-C56 and C61-G65 in green), respectively. The regions of T- and P-site to break the long helix made by the SRL region of 23S rRNA and 5’-side tRNAs are only shown for those in close contact with EF-Tu. The region of U72-A76 in Trp of h35 of 16S rRNA. The big distortion introduced by formation of the Trp-tRNA (brown) is taken to connect the main body of T-site tRNA and the EF-Tu polypeptide regions (ribbon) stretching toward GTP binding site by A1-R190 and P-site GTPase-binding site is cancelled and returns to the original state. This tRNA by G191-E405. movement of the whole ribosome represents exactly where the driving force of the ratchet-like movement of the ribosome [41] comes from. Evolution of ribosomal binding proteins and protein factors During the course of evolution of ribosome, various ribosomal binding proteins have also pursued their structural evolutions by changing their amino acid sequences. The present author has been carrying a question as to what type of amino acid sequence can selectively recognize their responsible locations on the 16S and 23S rRNAs. The number of ribosomal proteins is found to range between 50 – 60 in bacterial ribosomes and 70 – 80 in cytoplasmic ribosomes of eukaryotes [42]. In E. coli there are 21 small-subunit ribosomal proteins and 32 individual large-subunit ribosomal proteins [42]. Figure 63. Stereoview of elongation factors during the course of A-site accommodation and translocation with GTP hydrolysis. The location and conformation of EF-Tu is Using large subunit ribosomal proteins and ribosome (50S) from E. shown after the accommodation of A-site tRNA is finished, EF-Tu leaves the ribosome coli, Nierhaus performed the analysis on the stepwise assembly of 50S immediately. The rest of the model is the same as that of Figure 57. ribosome by reconstitution [43]. Nierhaus’s 50S assembly diagram shows that large subunit ribosomal proteins, L2, L3, and L4, recognizes somewhere on the 23S rRNA sequence around 1500, 2200 and 650, respectively. Figure 65 shows that L2 from E. coli has a considerable sequence homology with those from Paenibacillus pabuli (P. pab) and Thermus thermophillus (T. ther), L3 from E. coli with those from Alicyclobacillus sendaiensis (A. sen) and T. ther, and L4 from E. coli with those from Bacillus vallisimortis (B. vall) and T. ther. The three- dimensional atomic models of the present article are based on the Cartesian coordinates of Korosterev et al. [26] (PDB accession numbers are 2HGR for small subunit and 2HGU for large subunit). The locations of all ribosomal proteins for both 16S and 23S subunits are shown on the atomic models of the present work as from Figures 66 to 111. If we compare the sequence comparison of L2, L3 and L4, as shown in Figure Figure 64. Stereoview of elongation factors during the course of A-site accommodation 65 with the atomic coordinates of 23S rRNA and those of L2, L3 and L4, and translocation with GTP hydrolysis. The ancestral form of EF-G bound to the PRE state we might be able to obtain something about the idea about the strength ribosome is shown. The body of tRNA in gold corresponds to the domain IV of EF-G. The rest of the ancestral EF-G is the same as that of EF-Tu, which appeared in Figure 62. of the interactions between peptide structure and rRNA structure. In Since A-site is occupied by a big body of tRNA, the location of GTP binding to the border that case the result of the analysis of step-wise reconstitution of 50S subunit of RSL of 23S rRNA and 5’-half of h35 is shifted by as far as 27 from the P-site tRNA.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 23-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

Ribosomal Protein L2

10 20 30 40 50 60 70 80 90 100 110 120 130 E. coli MAVVKCKPTSPGRRHVVKVV ------NPEL HKGKPFAPL LEKNSKSGGRNNNGR I TTRH I GGGHKQAYR I VDFKRNKD - - G I P AVVER LEYDPNRSAN I ALV LYKDGE RRY I LAPKGLKAGDQ I QSGVDAA I KPGNTLPMR P. pab P I K Y A NMSVS T ------FE I TD E KS APL QA Q K V H R K I T GR AT I TS I H A K I V F PE SD I A LE T. ther K F YT S FMTVADFSE I TKTE K S LV L ------K T Q V F R R L I W- KV K AA I R L H V K I D QV Q VVA P P QV A L

140 150 160 170 180 190 200 210 220 230 240 250 260 270 E. coli N I PVGSTVHNVEMKPGKGGQLARSAGTYVQ I VARDGAYVTLRLRSGEMRKVEADCRATLGEVGNAEHMLRVLGKAGAARWRGVRPTVRGTAMNPVDHPHGGGEGRNF - GK- HPVTPWGVQTKGKKTRSN - KRTDKF I VRRRSK P. pab TV I I L V A EA L L GKEE K - SV S V R I L KV I S QD E I K I RS L Q E VV N M AP I RK S MS KP L Y KKN AS Y I T T. ther F TV A LE K AK A SA -Q I QG E D I P L HGE Y V A D KN I RS L R H A APR RP - AS W L KRR PSSR I A K

Ribosomal Protein L3

10 20 30 40 50 60 70 80 90 100 110 120 130 E. coli M I GLVGKKVGMTR I FTEDGVS I PVTV I EVEANRVTQV - KDLA - NDGY RA I QVTTGAKKANRVTKPEAGHF AKAGVEAGR GLWEFR LAEGEEFTVGQS I SVE LFADVKK VDVTGTSKGKGFAGTVKRWNFRT - QDATHGNSL A. sen ------QV DVV GPCVVLQ RVL DK E L AD S A V A TAPK Y I R I GVNV YE QLKAD I PGD I V P I H QHRGPM - - K T. ther K I L V RD RA- V L AGPCP V R - RTPE - K T V LGFLPQNPK NR LK PV I R I D - - - FNPE DTVT I KPGER R VM AG - GPDS AHK

140 150 160 170 180 190 200 E. coli SHRVPGS I GQNQTPGKVFKGKKMAGQMGNERVTVQSLDVVRVDAERNL L LVKGAVPGATGSDL I VKP - AVKA A. sen Y GV L S - I A NRT QT R - - G N E K P K V I I S PRN FVT I RS - QPAKK T. ther I H NRK R Y HY A NM E D I P E PN GLV I VRET KKA K

Ribosomal Protein L4

10 20 30 40 50 60 70 80 90 100 110 120 E. coli MEL V I KDAQS ------A L TVS E TTFGRD FLEA L VHQ V V VA YAAGARQGTRAQ KTRAEVTGSGKKPWRQKGTGRARSGS I KSP I WRSGGVTFAARPQDHSQKVNKK MYRGA L K S I L S E L V RQDR L I V V E KF S V B. vall PK A L YNQNGSTA - GD I ELN ASV I E PN S V FD A I LMQR SL SHKV N S R G R Q R Q G V GPT R S Y Y L P VR L I A S K VDNS I V L D I T L

T. ther KE A V YQ I P VLSP S G R RELAADLP - AE I NPH LWE RWQL KR R AST G AY R I P H H D GA F VG V GPK R Y YT L P VR KKG AMA VA DR A EGK L L A A G

130 140 150 160 170 180 190 200 E. coli E AP KTK L LAQ K LK DMA LE DVL I I - T G E LDENLF LAARNLH K VDVR D ATG I D PVSL I AFD K V VMTADAV KQV EE MLA

B. vall D V EM A I G L S V K KAL V A DVN TVS S I P G T V Q D NVLDV V NHE L L I KA E K V

T. ther VNG E FL AWA E A G D GS ES VL L V TGN LVR R PW VT L A P E L NVYD I V R TE R L D L W E V FQNR I GGEA

Figure 65. Comparison of amino acid sequences of large subunit ribosomal proteins L2, L3 and L4. L2 sequences are taken for 1) E. coli, 2) Paenibacillus pabuli (P. pab) and 3) Thermus thermophillus (T. ther), L3 sequences are for 1) E. coli, 2) Alicyclobacillus sendaiensis (A. sen), and 3) T. ther and L4 sequences are for 1) E. coli, 2) Bacillus vallismortis (B. vall), and 3) T. ther . Common amino acids at the corresponding position with that of E. coli sequence are represented by blank spaces. Amino acid residues, which are common in three species and basic such as R or K, are drawn in blue, those, which are common in three species and acidic such as D or E, are drawn in red, and those, which are common in three sequences and hydrophobic such as V, or L, or I, or M, or F, are drawn in brown.

Figure 66. Binding site of ribosomal protein S2 on the present model. Blue ribbon Figure 68. Binding site of ribosomal protein S4 on the present model. Blue ribbon represents the location of the binding protein. represents the protein.

Figure 67. Binding site of ribosomal protein S3 on the present model. Blue ribbon Figure 69. Binding site of ribosomal protein S5 on the present model. Blue ribbon represents the protein. represents the protein.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 24-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

Figure 70. Binding site of ribosomal protein S6 on the present model. Blue ribbon Figure 74. Binding site of ribosomal protein S10 on the present model. Blue ribbon represents the protein. represents the protein.

Figure 71. Binding site of ribosomal protein S7 on the present model. Blue ribbon Figure 75. Binding site of ribosomal protein S11 on the present model. Blue ribbon represents the protein. represents the protein.

Figure 72. Binding site of ribosomal protein S8 on the present model. Blue ribbon Figure 76. Binding site of ribosomal protein S12 on the present model. Blue ribbon represents the protein. represents the protein.

Figure 73. Binding site of ribosomal protein S9 on the present model. Blue ribbon Figure 77. Binding site of ribosomal protein S13 on the present model. Blue ribbon represents the protein. represents the protein.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 25-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

Figure 78. Binding site of ribosomal protein S14 on the present model. Blue ribbon Figure 82. Binding site of ribosomal protein S18 on the present model. Blue ribbon represents the protein. represents the protein.

Figure 79. Binding site of ribosomal protein S15 on the present model. Blue ribbon Figure 83. Binding site of ribosomal protein S19 on the present model. Blue ribbon represents the protein. represents the protein.

Figure 80. Binding site of ribosomal protein S16 on the present model. Blue ribbon Figure 84. Binding site of ribosomal protein S20 on the present model. Blue ribbon represents the protein. represents the protein.

Figure 81. Binding site of ribosomal protein S17 on the present model. Blue ribbon Figure 85. Binding site of ribosomal protein THX on the present model. Blue ribbon represents the protein. represents the protein.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 26-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

Figure 86. Binding site of ribosomal protein L1 on the present model. Blue ribbon Figure 90. Binding site of ribosomal protein L5 on the present model. Blue ribbon represents the protein. represents the protein.

Figure 87. Binding site of ribosomal protein L2 on the present model. Blue ribbon Figure 91. Binding site of ribosomal protein L6 on the present model. Blue ribbon represents the protein. represents the protein.

Figure 88. Binding site of ribosomal protein L3 on the present model. Blue ribbon Figure 92. Binding site of ribosomal protein L9 on the present model. Blue ribbon represents the protein. represents the protein.

Figure 89. Binding site of ribosomal protein L4 on the present model. Blue ribbon Figure 93. Binding site of ribosomal protein L11 on the present model. Blue ribbon represents the protein. represents the protein.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 27-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

Figure 94. Binding site of ribosomal protein L13 on the present model. Blue ribbon Figure 98. Binding site of ribosomal protein L18 on the present model. Blue ribbon represents the protein. represents the protein.

Figure 95. Binding site of ribosomal protein L14 on the present model. Blue ribbon Figure 99. Binding site of ribosomal protein L19 on the present model. Blue ribbon represents the protein. represents the protein.

Figure 96. Binding site of ribosomal protein L15 on the present model. Blue ribbon Figure 100. Binding site of ribosomal protein L20 on the present model. Blue ribbon represents the protein. represents the protein.

Figure 97. Binding site of ribosomal protein L16 on the present model. Blue ribbon Figure 101. Binding site of ribosomal protein L21 on the present model. Blue ribbon represents the protein. represents the protein.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 28-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

Figure 102 Binding site of ribosomal protein L22 on the present model. Blue ribbon Figure 107. Binding site of ribosomal protein L29 on the present model. Blue ribbon represents the protein. represents the protein.

Figure 103. Binding site of ribosomal protein L23 on the present model. Blue ribbon Figure 108. Binding site of ribosomal protein L30 on the present model. Blue ribbon represents the protein. represents the protein.

Figure 104. Binding site of ribosomal protein L24 on the present model. Blue ribbon Figure 109. Binding site of ribosomal protein L31 on the present model. Blue ribbon represents the protein. represents the protein.

Figure 105. Binding site of ribosomal protein L25 on the present model. Blue ribbon Figure 110. Binding site of ribosomal protein L35 on the present model. Blue ribbon represents the protein. represents the protein.

Figure 106. Binding site of ribosomal protein L27 on the present model. Blue ribbon Figure 111. Binding site of ribosomal protein L36 on the present model. Blue ribbon represents the protein. represents the protein.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 29-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

Figure 112 shows the similarity of amino acid sequences of personal opinion of the present author is that the RNA structure of the effector region for four types of GTPases, viz. EF-Tu and its the ribosome would have started at the bottom of the mantle under related elongation factors (eEF-1), EF-G and its related ones (eEF- the strong influence of silicate rocks without much help of protein 2), termination factors (RF3), and initiation factors (IF-2). The factors and binding ribosomal proteins, when their environment, in proteins in the group of EF-Tu has a common sequence type such as particular, was controlled by high pressure and temperature, and that D****E**RGITI, while those in the group of EF-G has a sequence type the evolution of various types of protein structures helped the ancestral of D****E***G**I, and those in the group of RF3 has a sequence type form of ribosomes to be independent of the silicate rocks. There must of D****E**RGIS*. It seems important to note that all of them except have been an increasing trend of water concentration as well as alkali the group of IF-2 started their evolution from a common sequence metal concentration. In order to protect the hydrolytic effect of such of GTPase, but gradually diverged into three different sequences for trends, evolution of ribosomal proteins and cell membrane structure performing a slightly different functions with respect to 70S ribosome. must have been very important. This is the evolution of ribosome, It is, however, a quite different function with initiation factors that that is also highly associated with the evolution of aminoacyl-tRNA must start their function by splitting the 70S ribosome into 30S and 50S synthetase, very briefly described above. The story about the origin subunits. The amino acid sequence of the effector region of IF-2 seems of life begins from the completion of ribosome structure and how to tell us that it took a long time to reach the present state if it also started an assembly of messenger RNAs performs self-reproduction before its evolution from the common structure of the ancestral GTPase. The appearance of DNA in the world. This is the story of RNA world [6].

EF-Tu RF-3 Ecoli IGHVDHGKTTLTAAITT------VLAKTYGGAARAFDQIDNAPEEKARGITINTS Ecoli ISHPDAGKTTITEKVLLFGQAI------QTAGTVKGRGSNQHAKSDWMEMEKQRGISITTS Taqua IGHVDHGKTTLTAALTY------VAAAENPNVEVKDYGDIDKAPEEKARGITINTA Sente ISHPDAGKTTITEKVLLFGQAI------QTAGTVKGRGSSQHAKSDWMEMEKQRGISITTS Hinfl IGHVDHGKTTLTAAITT------VLAKHYGGAARAFDQIDNAPEEKARGITINTS Paeru ISHPDAGKTTITEKLLLMGKAI------AVAGTVKSRKSDRHATSDWMEMEKQRGISITTS Pfalc IGHVDHGKTTLTTAISY------LLNLQGLSKKYNYSDIDSAPEEKIRGITINTT B.APS ISHPDAGKTTITEKMLFMGKVI------RVPGTIKGRGSGKYAKSDWMNIEKERGISITTS clEgrat IGHVDHGKTTLTAAITT------MALAATGNSKAKRYEDIDSAPEEKARGITINTA Linno ISHPDAGKTTITEQLLLFGGVI------RSAGTVKGKKSGKFATSDWMEIEKQRGISVTSS clPpurp IGHVDHGKTTLTAAISA------TLSTLGSTAAKRFDEIDAAPEEKARGITINTA Llact ISHPDAGKTTITEQLLKFGGAI------REAGTVKARKTGNFAKSDWMDIEKERGISVTSS mtCeleg IGHIDHGKTTLTSAITR------VQAKKGFAKHIKFDEIDKGKEEKKRGITINVA Spneu ISHPDAGKTTITEQLLYFGGEI------REAGTVKGKKTGTFAKSDWMDIEKQRGISVTSS mtScere IGHVDHGKTTLTAAITK------TLAAKGGANFLDYAAIDKAPEERARGITISTA N7120 ISHPDAGKTTLTEKLLLYGGAI------HEAGAVKARRAQRKATSDWMAMEQQRGISVTSS eEF-1 Saure ISHPDAGKTTLTEKLLYFSGAI------REAGTVKGKKTGKFATSDWMKVEQERGISVTSS Scere IGHVDSGKSTTTGHLIYKCGGIDK------RTIEKFEKEAAELGKGSFKYAWVLDKLKAERERGITIDIA Atume IAHPDAGKTTLTEKLLLFGGAI------QLAGEVKAKKDRIQTRSDWMKIERERGISVVTS Tbruc VGHVDAGKSTATGHLIYKCGGIDK------RTIEKFEKEAADIGKASFKYAWVLDKLKAERERGITIDIA Bmeli IAHPDAGKTTLTEKLLLFGGAI------QLAGEVKAKKDRIQTRSDWMNIERDRGISVVTS Pfalc IGHVDSGKSTTTGHIIYKLGGIDR------RTIEKFEKESAEMGKGSFKYAWVLDKLKAERERGITIDIA Ccres ISHPDAGKTTLTENILLAGGAI------RAAGAVRARGENRRTRSDWMKIERERGISVSAS Zmays IGHVDSGKSTTTGHLIYKLGGIDK------RVIERFEKEAAEMNKRSFKYAWVLDKLKAERERGITIDIA Tpyri IGHVDSGKSTTTGHLIYKCGGIDK------RVIEKFEKESAEQGKGSFKYAWVLDKLKAERERGITIDIS Hsapi IGHVDSGKSTTTGHLIYKCGGIDK------RTIEKFEKEAAEMGKGSFKYAWVLDKLKAERERGITIDIS EF-G IF-2 Ecoli SAHIDAGKTTTTERILFY------TGVNHKIGEVHDGAATMDWMEQEQERGITITSA Ecoli MGHVDHGKTSLLDYIRSTKVASGEAGGITQHIGAYHV------ETENG-MITFLDTPGHAAFTSMRARGAQATDI S6803 AAHIDAGKTTTTERILFY------SGVVHKIGEVHEGTAVTDWMAQERERGITITAA Bsubt MGHVDHGKTTLLDSIRKTKVVEGEAGGITQHIGAYQI------EENGKKITFLDTPGHAAFTTMRARGAEVTDI Taqua AAHIDAGKTTTTERILYY------TGRIHKIGEVHEGAATMDFMEQERERGITITAA Saure MGHVDHGKTTLLDSIRHTKVTAGEAGGITQHIGAYQI------ENDGKKITFLDTPGHAAFTTMRARGAQVTDI Dradi AAHIDAGKTTTTERILFF------TGRIRNLGETHEGASQMDWVEQERERGITITAA clPpurp LGHVDHGKTSLLDYIHKSNNANKEVGGITQNIVAYEV------EFKHQQIVFLDTPGHEAFTSMRSRGANLTDI clGmax MAHIDAGKTTTTERILEY------TGRNYKIGEVHEGTATMDWMEQEQERGITITSA mtScere MGHVDHGKTTIIDYLRKSSVVAQEHGGITQHIGAFQI------TAPKSGKKITFLDTPGHAAFLKMRERGANITDI mtScere IAHIDAGKTTTTERMLYY------AGISKHIGDVDTGDTITDFLEQERSRGITIQSA mtSpombe MGHVDHGKTTLLDAFRKSTIASTEHGGITQKIGAFTV------PFDKGSKFITFLDTPGHMAFEAMRKRGANIADI mtRnorv SAHIDSGKTTLTERVLYYTGR------IATMHEVKGKDGVGAVMDSMELERQRGITIQSA eEF-2 aIF-2 Scere IAHVDHGKSTLTDSLVQR------AGIISAAKAGEARFTDTRKDEQERGITIKST Mther LGHVDHGKTTLLDHIRGSAVASREAGGITQHIGATEIPMDVIEGICGDFLKKFSIRETLPGLFFIDTPGHEAFTTLRKRGGALADL Bvulg iAHVDHGKSTLTDSLVAA------AGIIAQEVAGDVRMTDTRADEAERGITIKST Afulg LGHVDHGKTTLLDRIRKSKVVAKEAGGITQHIGATEVPLDVIKQICKDIWK---VEVKIPGLLFIDTPGHKAFTNLRRRGGALADL Athal IAHVDHGKSTLTDSLVAA------AGIIAQEVAGDVRMTDTRADEAERGITIKST Hsali LGHVDHGKTSLLDKIRGSAVIEGEAGAITQHIGATAVPAGHRLGGRRQLVD--PTEFDLPGLLFIDTPGHHSFFEHALGGGALADI Celeg IAHVDHGKSTLTDSLVSK------AGIIAGSKAGETRFTDTRKDEQERCITIKST Dmela IAHVDHGKSTLTDSLVSK------AGIIAGAKAGETRFTDTRKDEQERCITIKST Hsapi IAHVDHGKSTLTDSLVCK------AGIIASARAGETRFTDTRKDEQERCITIKST Ecoli SAHIDAGKTTTTERILFY------TGVNHKIGEVHDGAATMDWMEQEQERGITITSA

Figure 112.This picture shows a comparison of amino acid sequences of effector region for 4 GTPases, EF-Tu (and eEF-1), EF-G (and eEF-2), termination factor 3 (RF3), and initiation factor 2 (IF-2) (and aIF-2). The amino acid sequences near GTP binding site and highly conserved and acidic such as D and E are drawn in red, those of basic such as K or R are in blue, and those of the rest are in green. The abbreviated species name, the species name, the accession file name, and thir residue number range of the amino acid sequence are obtained from Swiss Protein Sequence Data Bank (PIR): (1) eubacterial elongation factor EF-Tu; E. coli, Escherichia coli EFECT 17-65, Taqua, Thermus aquaticus S00229 17-166. Hinfl, Haemophilus influenza E64078 17-65, Pfalc, Plasmodium falciparum plastid S72277 17-65, elEgrat, Euglena gracilis chloroplast EFEGT 17-66, clPpurp, Porphyra purpurea chloroplast S73208 17- 65, mtCeleg, Caemorhabditis elegans mitochondria T37273 50-98, mtScere, Saccharomyces cerevisiae mitochondria EFBYT 53-101. (2) eukaryotic elongation factor eEF-1α; Scere, S.cerevisiae EFBY1A 12-75. Tbruc, Trypanosoma brucei A54760 12-75, Pfalc, P. falciparum S21909 12-75, Zmays, Zea mays S66339 12-75. Tpyri, Tetrahymena pyriformis A49171 13-76, Hsapi, Homo sapiens EFHU1 12-75. (3) eubacterial elongation factor EF-G; E. coli, E. coli EFECG 15-65, S6803, Synechocytis sp. PCC 6803 S75863 15-65, T. aqua, T. aquaticus EFTWG 17-67, Dradi, Deinococcus radiodurans E75536 18-68, clGmax, Glycine max chloroplast S35701 105-155, mtScere, S. cerevisiae mitochondria S43748 46-96, mtRnorv, Rattus norvegius mitochondria S40780 52-105. (4) eukaryotic elongation factor eEF-2; Scere, S.cerevisiae A41778 24-72, Bvulg, Beta vulgaris T14579 24-72, Athal, Arabidopsis thaliana A96602 27-75, Celeg, C. elegans A40411 24-72. Dmela, Drosophila melanogaster S05988 24-72. Hsapi, H. sapiens EFHU2 24-72. (5) eubacterial peptide chain release factor RF-3; Ecoli, E. coli E91295 18-72. Sente, Salmonella enterica AD1072 18-72, Paeru, Pseudomonas aeruginosa B83159 16-70, B.APS, Buchnera sp. APS C84993 18-72, Linno, Listeria innocua AB1556 17-71, Llact, Lactococcus lactis E86668 15-69, Spneu, Streptococcus pneumoniae D97921 15-69, N7120, Nostoc sp. PCC 7120 AC2353 21-75, Saure, Staphylococcus aureus D89870 15-69, Atume, Agrobacterium tumefaciens AC2615 17-71, Bmeli, Brucella melitensis AB3539 15-69, Ceres, Caulobacter crescentus B87382 17-71. (6) eubacterial initiation factor IF-2; Ecoli, E. coli FIEC2 396-463, Bsubt, Bacillus subtilis A35269 224-291, Saure, S. aureus H89900 214-281, clPpurp, P. purpurea chloroplast S73178 268-335, mtScere, S. cerevisiae mitochondria S66706 150-219, mtSpombe, Schizosaccharomyces pombe mitochondria T39351 176-245. (7) archeal initiation factor aIF-2; Mther, Methanobacterium thermoautotropicum E69132 10-95, Aflug, Archaeoglobus fulgidus H69345 18-100, Hsali, Halobacterium salinarum T43849 20-103.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 30-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

References 22. Endo Y, Wool IG (1982) The site of action of alpha-sarcin on eukaryotic ribosomes. The sequence at the alpha-sarcin cleavage site in 28S ribosomal ribonucleic acid. J Biol 1. Engel MH, Macko SA (1997) Isotopic evidence for extraterrestrial non-racemic amino Chem 257: 9054-9060. [Crossref] acids in the Murchison meteorite. Nature 389: 265-268. [Crossref] 23. Demeshkina N, Jenner L, Westhof E, Yusupov M, Yusupova G (2012) A new understanding 2. Chyba CF (1997) Origins of life. A left-handed solar systems? Nature 389: 234-235. of the decoding principle on the ribosome. Nature 484: 256-259. [Crossref] [Crossref] 24. Yusupov MM, Yusupova GZ, Baucom A, Lieberman K, Earnest TN, et al. (2001) 3. Chyba CF (1998) Origins of life. Buried beginnings. Nature 395: 329-330. [Crossref] Crystal structure of the ribosome at 5.5 A resolution. Science 292: 883-896. [Crossref]

4. Corliss JB, Dymond J, Gordon LI, Edmond JM, von Herzen RP, et al. (1979) Submarine 25. Selmer M, Dunham CM, Murphy FV 4th, Weixlbaumer A, Petry S, et al. (2006) thermal springs on the Galapagos rift. Science 203: 1073-1083. [Crossref] Structure of the 70S ribosome complexed with mRNA and tRNA. Science 313: 1935- 1942. [Crossref] 5. Martin W, Baross J, Kelley D, Russell MJ (2008) Hydrothermal vents and the origin of life. Nat Rev Microbiol 6: 805-814. [Crossref] 26. Korostelev A, Trakhanov S, Laurberg M, Noller HF (2006) Crystal structure of a 70S ribosome-tRNA complex reveals functional interactions and rearrangements. Cell 126: 6. Joyce GF (1989) RNA evolution and the origins of life. Nature 338: 217-224. [Crossref] 1065-1077. [Crossref] 7. Nagano K, Nagano N (2007) Mechanism of translation based on intersubunit 27. Prince JB, Taylor BH, Thurlow DL, Ofengand J, Zimmermann RA (1982) Covalent complementarities of ribosomal RNAs and tRNAs. J Theor Biol 245: 644-668. crosslinking of tRNA1Val to 16S RNA at the ribosomal P site: identification of [Crossref] crosslinked residues. Proc Natl Acad Sci U S A 79: 5450-5454. [Crossref] 8. Holbrook SR, Sussman JL, Warrant RW, Kim SH (1978) Crystal structure of yeast 28. Sprinzl M, Wagner T, Lorenz S, Erdmann VA (1976) Regions of tRNA important for phenylalanine transfer RNA II. Structural features and functional implications. J Mol binding to the ribosomal A and P sites. 15: 3031-3039. [Crossref] Biol 123: 631-660. [Crossref] 29. Nagano K, Nagano N (1997) Transfer RNA docking pair model in the ribosomal pre- 9. Nagano K (2015) Structural implications of universal complementarities in translation- and post-translocational states. Nucleic Acids Res 25: 1254-1264. [Crossref] High accuracy at the decoding site. Advances in Biol Chem 5: 151-167. 30. Ogle JM, Brodersen DE, Clemons WM Jr, Tarry MJ, Carter AP, et al. (2001) 10. Eriani G, Delarue M, Poch O, Gangloff J, Moras D (1990) Partition of tRNA synthetases Recognition of cognate transfer RNA by the 30S ribosomal subunit. Science 292: 897- into two classes based on mutually exclusive sets of sequence motifs. Nature 347: 203- 902. [Crossref] 206. [Crossref] 31. Bollen B, Davies J, Ozaki M, Mizushima S (1969) Ribosomal protein conferring 11. Cusack S, Berthet-Colominas C, Härtlein M, Nassar N, Leberman R (1990) A second sensitivity to the antibiotic spectinomycin in Escherichia coli. Science 165: 85-86. class of synthetase structure revealed by X-ray analysis of Escherichia coli seryl-tRNA [Crossref] synthetase at 2.5 A. Nature 347: 249-255. [Crossref] 32. Sigmund CD, Ettayebi M, Morgan EA (1984) Antibiotic resistance mutations in 16S 12. Ibba M, Sӧll D (2000) Aminoacyl-tRNA synthesis. Annu Rev Biochem 69: 617-650. and 23S ribosomal RNA genes of Escherichia coli. Nucleic Acids Res 12: 4653-4663. [Crossref] [Crossref] 13. Cech TR (1987) The chemistry of self-splicing RNA and RNA enzymes. Science 236: 33. Peske F, Savelsbergh A, Katunin VI, Rodnina MV, Wintermeyer W (2004) 1532-1539. [Crossref] Conformational changes of the small ribosomal subunit during elongation factor 14. Gutell RR (1993) The simplicity behind the elucidation of complex structure in G-dependent tRNA-mRNA translocation. J Molec Biol 343: 1183-1194. [Crossref] Ribosomal RNA. In: Nierhaus KH., Franceschi F., Subramanian AR., Erdmann VA. 34. Valle M, Zavialov A, Sengupta J, Rawat U, Ehrenberg M, et al. (2003) Locking and and Wittmann-Liebold B., Eds., The translational apparatus, structure, function, unlocking of ribosomal motions. Cell 114: 123-134. [Crossref] regulation, evolution, plenum press, New York, 477-488. 35. Sergiev PV, Lesnyak DV, Kiparisov SV, Burakovsky DE, Leonov AA, et al. (2005) 15. Brimacombe R (1995) The structure of ribosomal RNA: a three-dimensional jigsaw Function of the ribosomal E-site: a mutagenesis study. Nucleic Acids Res 33: 6048- puzzle. Eur J Biochem 230: 365-383. [Crossref] 6056. [Crossref] 16. Lotfield RB, Vanderjagt D (1972) The frequency of errors in protein synthesis.Biochem 36. Feinberg JS, Joseph S (2001) Identification of molecular interactions between P-site J 128: 1353-1356. [Crossref] tRNA and the ribosome essential for translocation. Proc Natl Acad Sci U S A 98: 11120- 17. Hopfield JJ (1974) Kinetic proofreading: A new mechanism for reducing errors in 11125. [Crossref] biosynthetic process requiring high specificity. Proc Natl Acad Sci U S A 71: 4135- 37. Schmeing TM, Voorhees RM, Kelley AC, Gao YG, Murphy FV 4th, et al. (2009) The 4139. [Crossref] crystal structure of the ribosome bound to EF-Tu and aminoacyl-tRNA. Science 326: 18. Szer W, Ochoa S (1964) Complexing ability and coding properties of synthetic 688-694. [Crossref] polynucleotides. J Mol Biol 8: 823-834. [Crossref] 38. Schmeing TM, Voorhees RM, Kelley AC, Ramakrishnan V (2011) How mutations in 19. Thompson RC, Dix DB, Gerson RB, Karim AM (1981) Effect of Mg2+ concentration, tRNA distant from the anticodon affect the fidelity of decoding. Nat Struct Mol Biol polyamines, streptomycin, and mutations in ribosomal proteins on the accuracy of 18: 432-436. [Crossref] the two-step selection of aminoacyl-tRNAs in protein biosynthesis. J Biol Chem 256: 39. AEvarsson A, Brazhnikov E, Garber M, Zheltonosova J, Chirgadze Y, et al (1994) 6676-6681. Three-dimensional structure of the ribosomal translocase: Elongation factor G from 20. Egebjerg J, Larsen N, Garrett RA (1990) Structural map of 23S rRNA. In: Hill EE., Thermus thermophiles. EMBO J 13: 3669-3677. [Crossref] Dahlberg A, Garrett RA., Moore PB., Schlesinger D., Warner JR., Eds., The ribosome 40. Gao YG, Selmer M, Dunham CM, Weixlbaumer A, Kelley AC, et al. (2009) The structure, function and evolution, American society for microbiology, Washington DC, structure of the ribosome with elongation factor G trapped in the posttranslocational USA, 168-179. state. Science 326: 694-699. [Crossref] 21. Moazed D, Noller HF (1987) Interaction of antibiotics with functional sites in 16S 41. Frank J, Agrawal RK (2000) A ratchet-like inter-subunit reorganization of the ribosome ribosomal RNA. Nature 327: 389-394. [Crossref] during translocation. Nature 406: 318-322. [Crossref]

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 31-32 Nagano K (2017) Abiogenic evolution of ribonucleic acid as a left-handed helix and its protecting substances on primordial earth – A story about evolution of ribosome and the origin of life

42. Wittmann HG, Littlechild JA, Wittmann-Liebold B (1979) Structure of ribosomal 43. Nierhaus KH (1979) Analysis of the assembly and function of the 50S subunit from proteins. In: Chambliss G., Craven GR., Davies J., Davis K., Kahan L., Nomura M. Escherichia coli ribosomes by reconstitution. In: Chambliss G., Craven GR., Davies J., (Eds.), Ribosomes: structure, function, and genetics. University Park Press, Baltimore, Davis K., Kahan L., Nomura, M. (Eds.), Ribosomes: structure, function, and genetics. USA, 51-88. University Park Press, Baltimore, USA, 267-294.

Copyright: ©2017 Nagano K. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Integr Mol Med, 2017 doi: 10.15761/IMM.1000284 Volume 4(3): 32-32