INFORMATION TO USERS
The most advanced technology has been used to photo graph and reproduce this manuscript from the microfilm master. UMI films the original text directly from the copy submitted. Thus, some dissertation copies are in typewriter face, while others may be from a computer printer.
In the unlikely event that the author did not send UMI a complete manuscript and there are missing pages, these will be noted. Also, if unauthorized copyrighted material had to be removed, a note will indicate the deletion.
Oversize materials (e.g., maps, drawings, charts) are re produced by sectioning the original, beginning at the upper left-hand corner and continuing from left to right in equal sections with small overlaps. Each oversize page is available as one exposure on a standard 35 mm slide or as a 17" x 23" black and white photographic print for an additional charge.
Photographs included in the original manuscript have been reproduced xerographically in this copy. 35 mm slides or 6" x 9" black and white photographic prints are available for any photographs or illustrations appearing in this copy for an additional charge. Contact UMI directly to order.
r
Accessing the World'sUMI Information since 1938
300 North Zeeb Road Ann Arbor Ml 48106 1346 USA Order N um ber 8726G95
DNA sequences and comparison of argininosuccinate synthetase genes from two methanogenic archaebacteria
Morris, Christina Jane, Ph.D.
The Ohio State University, 1987
UMI 300 N. Zeeb Rd. Ann Arbor, MI 48106 PLEASE NOTE:
In all cases this material has been filmed in the best possible way from the available copy. Problems encountered with this document have been identified here with a check mark V
1. Glossy photographs or pages ____
2. Colored illustrations, paper or print ______
3. Photographs with dark background _____
4. Illustrat'ons are poor copy ______
5. Pages with black marks, not original copy ____
6. Print shows through as there is text on both sides of page ______
7. Indistinct, broken or small print on several pages S
8. Print exceeds margin requirements ______
9. Tightly bound copy with print lost in sp in e ______
10. Computer printout pages with indistinct print ______
11. Page(s)______lacking when material received, and not available from school or author.
12. Page(s)______seem to be missing in numbering only as text follows.
13. Two pages numbered ______, Text follows.
14. Curling and wrinkled pages ______
15. Dissertation contains pages with print at a slant, filmed as received ______
16. Other ______
University Microfilms International DNA SEQUENCES AND COMPARISON OF AR6IHIN0SUCCINATE SYNTHETASE
GENES FROM TWO METHANOGEN1C ARCHAEBACTERIA
DISSERTATION
Presented in Partial Fulfillment of the Requirement for
the Degree Doctor of Philosophy in the Graduate
School of the Ohio State University
By
Christina Jane Morris, B Sc , M.S.
The Ohio State University
1987
Reading Committee: Approved By
John N. Reeve
Charles J Daniels
Darrell R. Galloway
Berl R. Oakley Adviser
Department of Microbiology PLEASE NOTE:
Copyrighted materials in this document have not been filmed at the request of the author. They are available for consultation, however, in the author's university library.
These consist of pages:
Cartoon in pre-pages
University Microfilms International 300 N ZE EB RD . ANN AH BOB. Ml 48106 (313) 7fj 1 4 700 T o Mum and Dad
ii ACKNOWLEDGEMENTS
I uiould like to thank my adviser, Dr John Reeve, for his advice, encouragement and support throughout this project
Thanks also go to the past and present members of Dr Reeve's and Dr Daniels’ labs for their assistance, ideas, understanding in times of need and, most of all, friendship Special thanks go to Dr Chuck Daniels for the use of his computer and office.
To quote Maureen Stapleton: "I mould like to thank everyone I’ve ever met in my mhole life", but perhaps I’ll keep it short and finally thanx my family for their love, support and phone calls VITA
October 4, 1959 ...... Born - Bristol. England.
1981 ...... B.Sc. (Honours), Queen Mary Col lege, Uni vers it y of London/ London. England.
1981-1982 ...... Teaching Associate. Department of Microbiology, The Ohio State University. Columbus. Ohio, USA
1982-1987 Research Associate, Department of Microbiology, The Ohio State University
1983 ...... M S The Ohio State Un i versity
1987 ...... UCLA International School on Mo1ecu 1ar Evo1ut ion Fellowship. Los Angeles, Ca1i forn i a
PUBLICATIONS
Morris, C.J. and JN. Reeve <1984). Functional expression of an archaebacterial gene from the methanogen Methanosarcina barken in Escherichia coli and Bacillus subtilis In, Microbial Growth on Cl Compounds. Morris, C J and J N Reeve <1985) Expression of a methanogen gene in Escherichia coli and in Bacillus subtilis In, Proceedings of the First International Symposium on Biotechnological Advances in Processing Municipal Wastes for Fuels and Chemicals Reeve, J.N., P.T Hamilton, G.S. Beckler, C.J. Morris and C H Clarke <1986). Structure of methanogen genes. Syst Appl. Microbial. 7-. 5-12 iv Reeve, J N., G.S. Beckler, J.W Brown, D.S. Cram, E.S. Haas, P.T. Hamilton, C.J. Morris, B.A, Sherf and CF. Weil <1986). Divergence of methanogens, conservation of the his 1 gene sequence in all three biological kingdoms and the status of Methanohacterium thermoautotrophicLim Proceedings of the Fifth Inter national Symposium on Growth on Cl Compounds. Haren, Netherlands, P255-260. FIELD OF STUDY Molecular Biology TABLE OF CONTENTS Page ACKNOWLEDGEMENT...... ni UITA ...... iv LIST OF TABLES viii LIST OF FIGURES ix LIST OF ABBREUIAT IONS x INTRODUCTION I LITERATURE REUIEW 4 Genome structure ...... 4 DNA-dependent RNA polymerases ...... 6 Ar chaebac terial ribosomes ...... 7 7S RNA 11 Protein-encoding genes from ar'chaebacteria 11 Computer analysis of amino acid and DNA sequences ...... 14 Argininosuccinate synthetase- genes and pseudogenes ...... 17 METHODS ...... 19 Grouith of cells ...... 19 Isolation of methanogen DNA ...... 19 Large scale isolation of plasmids from E, coli ...... 21 Large scale isolation of plasmids from £ subtilis 23 Small scale isolation of plasmids from E coli and £ subtilis ...... 24 Preparation and transformation of competent cells of £ coli...... 25 Preparation and transformation of competent cells of £ subtilis 26 Restriction and ligation of plasmid DNA...... 27 Agarose and polyacrylamide gel electrophoresis ...... 29 Minicell preparation and radioactive labeling of plasmid encoded polypeptides ...... 29 Nick translation ...... 31 vi Radioactive labeling of oligomer probes 32 DNADNA hybridization with dehydrated agarose gels ...... 33 DNADNA hybridization with Zeta-Probe membrane. 34 Maxam and Gilbert sequencing 35 6 reaction ...... 37 G + A reaction 38 C * T reaction ...... 38 C reaction ...... 38 Strand cleavage reaction ...... 39 M13 sequencing. . 40 RESULTS...... 43 Construction of plasmids pET821, pET822, pET8£3, PET824 and pET825 43 Expression of methanogen derived DNA in JL suhtilis 46 Polypeptides synthesized in DS410 minicells encoded by pET821> pET823, pET824 and pET825 47 Nucleotide sequence of pET821 53 Nucleotide sequence of pET371 ...... 68 Codon usage ...... 67 Computer analysis of carbamyl phosphate synthetase genes ...... 70 Computer analysis of argininosuccinate synthetase genes ...... 76 Construction of an evolutionary tree for argminosuccinate synthetase genes ...... 86 Occurrence of the 14 bp and 29 bp direct repeats in £3 barken MS and £3 barkeri 227 89 DISCUSSION...... 94 Expression of methanogen derived DNA in tuuo eubacterial systems ...... 94 Sequences needed for expression of genes in methanogens ...... 98 U barkeri direct repeats ...... 99 Codon usage and gene organization in £3 barkeri and £3 uannielii 101 Evolutionary relationship of ar gininosuccinate synthetase genes 103 Introns within the human argminosuccinate synthetase gene 105 Carbamyl phosphate synthetase in£3 barkeri 106 CONCLUSIONS 108 BIBLIOGRAPHY 110 vii LIST OF TABLES Table P age 1 List of microorganisms 20 2 Codon usage 68 3 X Similarity between carbamyl phosphate synthetases of different species 77 4 X Similarity between argminosuccinate synthetases of different species. 85 viii LIST OF FIGURES Figure Page 1 Plasmids derived from pET821 ...... 44 2. Autoradiogram of the electrophoretic separation of radioactively labeled polypeptides in plasmid containing £.. coli minicells ...... 49 3 Autoradiogram of the electrophoretic separation of radioactively labeled polypeptides in plasmid containing £.. coli mimcells ...... 51 4 Detailed restriction map of 2.1 Kbp of pET821 DNA ...... 54 5 Nucleotide sequence of 2 1 Kbp of pET821 DNA 57 6, Detailed restriction map of pET371 DNA ...... 61 7. Nucleotide sequence of pET371 DNA ...... 63 8 Diagon analysis of carbamyl phosphate synthetase ...... 71 9. Carbamyl phosphate synthetase amino acid comparison ...... 74 18. Diagon analysis of argininosuccinate synthetase ...... 78 11. Argininosuccinate synthetase amino acid comparison ...... 80 12 Amino acid residues conserved between M barken. £1 vanmelii and human argininosuccinate synthetase sequences 82 13. Evolutionary tree for argminosuccinate synthetase ...... 87 14. Autoradiogram of DNA^DNA hybridizations between fcl harkeri DNA and radioactively labeled PET371 probes . . 91 IX LIST OF ABBREVIATIONS Abbreviation. Full Name A adenosine ADP adenosine diphosphate aEF archaebac teria) elongation factor ala alanine AMP adenosine monophosphate AMP ampicillin asn asparagine asp aspartic acid ATP adenosine triphosphate Aggg absorbance at 260 nm A*5Q0 absorbance at 580 nm bp base pair C cytosine C- carboxyl CAM chloramphenicol CAT chloramphenicol acetyl transferase cDNA complementary DNA CoM coenzyme M (2-mercaptoethanesulfonic acid) ddNTP dideoxy NTP DNA deoxyribonucleic acid dNTP deoxy NTP DTT dithiothreitol EDT A ethylenediaminete trace tic acid EF elongation factor x EMBL European Molecular Biology Laboratory EtBr ethidium bromide G guanosine g grams g gravitational force h hour HumASPs human argininosuccinate synthetase pseudogene lie isoleucine 1PT6 isopropyl thiogalac toside KAN kanamycin Kbp kilobase pairs 1 liter LB Luria broth leu leucine M molar mg milligram's) min minute(s) ml millili ter MM M9 minimal medium mM millimolar mRNA messenger RNA N any base A, C, G or T N- amino nm nanometers 0^860 optical density at 860 nm ORF open reading frame PAGE polyacrylamide gel electrophoresis XI PAUP Phylogenetic Analysis Using Parsimony PFP paraf luor opheny Lalanine phe phenylalanine PIR Protein Information Resource PPi inorganic pyrophosphate PFO 2,b, diphenyl-l,3-oxazole pro proline R purine A or G RNA ribonucleic acid rRNA ribosomal RNA S Svedberg sedimentation unit Sfljj association coefficient SDS sodium dodecyl sulfate ser serine SSC standard sodium citrate <4 41g Na citrate^l, 877g NaCl^l) T thymidine TA Trts acetate (40mN Tris.HCl pH 8 0, 0.2mM CH3COONa, 2mM EDTA) TBE Tris borate (89mM Tns.borate, 89mM boric arid, 2mM EDTA, pH 8 0) TCA trichloroacetic acid TE 18mM Tris.HCl pH 8 5, ImM EDTA TEMED N,N,N' ,N' ~tetramethylethylenediamine TES 30mM Tris.HCl pH 80, 50mM NaCl, 5mM EDTA TET tetracycline Tris tris(hydroxymethyl>aminomethane tRNA transfer RNA trp tryptophan xii tyr tyrosine U uracil UU ultraviolet U volts uj/ u weight by volume X-gal 5-bromo-4-chloro-3 indoly 1 galacto- pyranoside in dimethy If or mamide Y pyrimidine C or T U9 microgram Ml microliter (s > ju M micromolar Mill Introduction The existence of three primary kingdoms, eukaryotes, eubacteria and archaebacteria, uias proposed by Noese and Fox in 1977 (143) Before 1977, the classification of living organisms had separated them into the eukaryotes, those organisms which had a nuclear membrane and cellular organelles, and the prokaryotes, those that did not Further classification within these kingdoms using morphological and physiological traits, which could not be distinguished as having arisen through divergent or convergent evolution, was in retrospect misleading. Zuckerkandl and Pauling (153) suggested the study of "semantides* (informational macromolecules) would provide a better indication of evolutionary history and the advent of macromolecular sequencing has allowed this hypothesis to be tested. UJoese and Fox chose the 1£S ribosomal RNA CrRNA) component of the 30S subunit of the ribosome and its eukaryotic equivalent, the 18S rRNA from the 40S ribosomal subunit as the base for their studies. The 16S^18S rRNA molecule is ideal in that it is present in all organisms, maintains the same function in each, is sufficiently large and has both highly conserved and not so highly conserved sequences which allows analysis of both distant and close phylogenetic relationships. Data obtained from the cataloguing of oligonucleotides produced by ribonuclease (RNase) T1 digestion of 1£S/'18S rRNA molecules from very diverse organisms, suggested that the prokaryotes comprise two phylogenetically 2 distinct groups, now designated the eubacteria and the archaebacteria. Thermoplasma sp , extreme thermophiles (or sulfur dependent archaebacteria), extreme halophiles and methanogens are grouped together to form the archaebacteria. Their classif ication as a kingdom, separate from the eubacteria and eukaryotes, is supported by their cell uiall composition <67, 73, 137>, SNA-dependent RNA polymerase subunit compositions <123, 152>, ether-linked lipids <57, 81, 129), transfer RNA base modifications (10, 48), 5S rRNA structure (41) and the presence of unique coenzymes <1, 3, 21, 37, 46, 87) Further inquiry into the molecular biology of the archaebacteria has shown that although they are prokaryotic and similar in size to the eubacteria, members of the archaebacteria also have features in common with the eukaryotes Intervening sequences (introns) have been found in rRNA and tRNA genes <29, 66, 69, 78), SNA-dependent RNA polymerase subunits are immunologically related to eukaryotic RNA polymerases <150, 151) and diphtheria toxin is capable of ADP-ribosylating archaebacterial and eukaryotic elongation factors but not eubacterial elongation factor EF-G (68) It has been proposed, therefore, that "simple" prokaryotes have not evolved into the more complex eukaryotes but that each of the three kingdoms has arisen independently from a common ancestor, the progenote (142, 143, 144) Methanogens, obligately anaerobic organisms which generate methane by growth on Cl compounds, are the most diverse group within the archaebacteria. Studies of genome organization and gene structure in methanogenic archaebacteria were initiated in 1982 (16, 49, 109). It uias demonstrated that cloned fragments of methanogen genomic DNA could complement 3 Escherichia coli auxotrophic mutations in the his A <26, 145), PurE <49, 50, 51>, proC (49, 50, 52) and *caG <49, 50, 145) genes 0 continuation of the study of the Methanosarcina barkeri derived DNA capable of complementing £. coli argG and Bacillus subtilis ar £L barkeri DNA, clone and determine the DNA sequence of the homologous gene from Methanococcus vannielii. a methanogen but very distant relative of £L barkeri. and make an evolutionary comparison to the human argininosuccinate synthetase cDNA (13) These experiments mere undertaken to provide information and insight into the evolutionary relationships betuieen tuio methanogens and between members of the three biological kingdoms. LITERATURE REU1EM The proposed evolution of the archaebacteria into a phylogenetic ally separate kingdom is supported by studies of their molecular biology Information that is currently available is presented here to indicate the similarities betuieen the archaebacteria and the differences between the archaebacteria and the eubacteria and eukaryotes Knowledge pertaining to methanogens will be emphasized. Genome structure. The prokaryotic nature of archaebacteria emphasizes the similarities with the eubacteria. The genome of Me th ano bacterium thermoautotrophicum at 1.1 x 109 daltons is 402 the size of the E. coli genome, whereas the Halobacterium halobium genome is equivalent in size to that of L coli at 2.5 x 109 daltons <1111. A small percentage (€%> of the M. thermnjiitnirnohirnm genome was determined, by renaturation kinetics, to contain repeated DNA sequences <100) 1L halobium has been shown to have a large number of repeated DNA sequences arranged in clusters and dispersed around its chromosome <111, 112). These repeated DNA sequences comprise greater than fifty families, are highly mobile and repeat sequence-associated genomic rearrangements occur at frequencies >4 x 10-3 events per family per cell generation <112). Nine different insertion elements have now been identified 4 in fcL hjlnhmm and one in the methanogen Methanohreuiharter smithii (33, 52V The majority of these insertion elements ere structurally similar to insertion elements in eubacteria and eukaryotes; at least one open reading frame uiithin the element, terminal inverted repeats and duplication of the target DNA sequences. The m ol/t G + C of the archaebacteria covers a very broad range Within the three orders of the methanogens. Me thanobac. ter tales . Methanococcales, and Methanomicrobiales. molG + C values range from 25 8X (Methanosphaera stadtmamae) to 61.2/f acidophilum has a molZ G+C of 46 and the other members of the sulfur-dependent, extremely thermophilic archaebacteria have mo 1/i G + C values that range betuieen 31 and 62. The major ON A fraction of the extreme halophiles has a narrower range of mo IX G + C between 60 and 68, however molX G + C of the minor DNA fractions of repeated DNA sequences are lower, between 49 7 and 59 Extrachromosomal DNA has been found in a number of archaebacteria, most often in halophiles (101). The first report of a cryptic plasmid in a methanogen was from JtL thermoautotrophicum (88) and this has been more recently followed by others in Methanococcus isolate PL-12/M <127) and Methanococcus iannaschii (147). Baresi and Bertani <4) reported the presence of a phage isolated from Methanohreuibacter sp , and a variety of other phages have been isolated from Halobacterium and Thermoproteus s p p <60, 116, 130) Covalently closed circular prophages have been reported in Sulfolnbus spp <84) and fcL iannaschii (146) To date, only DNA-containing phages have been identified. 6 DNA-dependent RNA Pfllumerases Eubacteria have a single DNA-dependent RNA polymerase with a minimal subunit composition of p.D'jtzg,? (150) whereas there are three eukaryotic RNA polymerases I, II and 111 which have much more complex subunit configurations (151, 152). Archaebacterial RNA polymerases are more similar to the eukaryotic RNA polymerases containing at least eight subunit polypeptides (151, 152). Antisera, raised against the large subunits of Saccharomuces carevisiae DNA-dependent RNA polymerases, cross react more strongly with archaebacterial RNA polymerases than with eubacteria RNA polymerases (45, 152) Based on immunological cross-reactivities of subunits, functional relationships have been proposed for the largest subunit B of the thermoacidophile enzyme with eubacterial 0 subunit, the second Largest eukaryotic subunit and subunits B' and B' ' of the methanogens and halophiles. The second largest subunit of the thermoacidophile enzyme, subunit A, is related to the p* subunit of E_ coli polymerase and the largest subunit of the eukaryotic polymerase. The effects of various antibiotics have also indicated similarities between the RNA polymerases of the three kingdoms. Both eukaryotic RNA polymerase 1 and 1. acidophilum RNA polymerase are stimulated by silybin whereas eubacterial RNA polymerases and eukaryotic RNA polymerases II and 111 are not (152). Archaebacterial RNA polymerases are similar to the eukaryotic RNA polymerases in that they are insensitive to rifampin and streptolydigin, inhibitors of eubacterial RNA polymerase (152) however like eubacterial RNA polymerases archaebacterial RNA polymerases are resistant to cc-amanitin, a strong inhibitor of eukaryotic RNA polymerases II and III (152) 7 Archaebacterial ribosomes The archaebacterial and eubacterial ribosomes resemble each other in size (70S), are comprised of two subunits <30S and 50S>, and contain only three ribosomal RNAs, 16S, 23S and 5S (134). Eukaryotic ribosomes are larger in size (80S), comprised of two larger subunits (40S and 60S), and have, in most cases (135), a 5.8S rRNA molecule in addition to 18S, 28S and 5S rRNAs. Cataloguing of 16S and 18S rRNA oligonucleotides, resulting from RNase T1 digestion, was the original basis for dividing the prokaryotes into the eubacteria and archaebacteria (2, 42, 143) As complete sequences of 16S/'18S rRNA become available (65, 77, 83, 132, 148) phylogenetic relationships continue to became clearer. The complete sequences for nine archaebacterial 16S rRNAs have supported the concept of extensive diversity within the archaebacterial kingdom and a clear distinction between the archaebacteria, the eubacteria and the eukaryotes (148). The sequence data divided the archaebacteria into two phyla, one composed of the methanogens, halophiles and Thermootasma spp and the second composed of the aerobic and anaerobic extreme thermophiles <2, 148). In addition, the sequences indicated a specific relationship between the halobacteria and the Methanomicrobiales. which was unclear when only partial sequencing data was available (2). The pyrimidine rich 3' end of the 16S rRNA molecule in eubacteria is involved in binding of mRNA to eubacterial ribosomes to initiate translation (119) A pyrimidine rich region is also present at the 3' end of the archaebacterial 16S rRNA molecule (2) and is assumed to function in a similar manner in methanogens (52). The data are ambiguous for a similar role in H. halobium (11, 36) The bacterio-opsin mRNA of H. halobium contains only three nucleotides at the 5' terminus preceding 8 the initiation codon, uihich is much shorter than the Leader sequences found in most eubacterial mRNAs. 8 pyrimidine rich sequence complementary to the 3' end of the halobacterial 16S rRNA does occur howeuer, downstream of the initiation codon, within the protein encoding sequence Archaebacterial SS rRNA secondary structures haue features in common with the eukaryotic or eubacterial SS rRNAs and also haue unique features <41> Halnbacterium tnorrhuae SS rRNA has an extra 188-nucleotide sequence present in its 231 nucleotide sequence that is otherwise analogous to Halobactenum rutirubrum. howeuer the additional bases do not seem to disrupt the secondary structure of the molecule. An intron has been identified in the 23S rRNA gene of Desulfurococrus mobilis <69). The intron contains the consensus sequence necessary for splicing of rRNA introns (class 1) in eukaryotes (19) Other proper ties howeuer, including putatiue base pairing between the ends of the RNA and the absence of a uridine at the 3' end of the 5' exon, are charac teristic of eukaryotic class 111 introns The gene order of structural rRNAs in archaebacteria is similar to that in eubacteria. S' 16S-23S-SS, except in L acidonhilum where the genes are not Linked <56, 59) In the methanogens and halobacteria, the rRNA genes form an operon with transfer RNA (tRNA) genes between the 16S and 23S rRNA genes and in some cases also distal to the SS rRNA gene <139, 148). fcJL uannielii contains four rRNA operons and an additional unlinked SS rRNA gene; unlinked SS rRNA genes are a feature of eukaryotes <139, 141). The unlinked SS rRNA gene is Located in a cluster of seuen tRNA genes which may all be transcribed as an operon. In contrast to the methanogens, most other archaebacteria studied so far, have the unusual feature of only one set of rRNA genes. Many tRNAs genes have been characterized, from a variety of archaebacteria <29, 47, 48, 66, 141), the most extensive studies describe the tRNA genes from Halobactenmm uoldanii <29, 47> and £1- uannielii <141). The archaebacteria! tRNA genes are mostly similar to eukaryotic tRNA genes in that they do not encode the 3' CCA terminal sequences found on mature tRNAs <138) Only one tRNA gene gene cluster which contains the 5S rRNA gene, encodes the 3' CCA (138). To date, seven tRNA genes from archaebacteria have been shouin to contain introns, tuuo from Sulfolohus solfataricus. four from Thermoproteus tenax and one from H volcanii <29, 66, 140). The £, solf ataricus tRNAser and tRNA**u introns are 25 and 15 nucleotides long respectively, and the lengths of the JL tenax intron sequences range from 16 to 20 nucleotides whereas the U. volcanii tRNA*,rp intron is 185 nucleotides in length. Most of the introns are located 3' to the anticodon, as are introns in eukaryotic tRNAs. These tRNA molecules are capable of forming a normal cloverleaf pattern if the introns are not included in the tRNA secondary structure. Three of the L tenax introns are unusual in that they are located either within or 5' to the anticodon sequence topics under investigation Similarities between archaebacteria and eukaryotes have been found in studies of elongation factors involved in 10 translation Diphtheria toxin catalyzes the covalent binding of A DP (adenosine diphosphate) to the eukaryotic polypeptide elongation factor EF-2. The toxin does not houiever ribosylate the eubacterial equivalent of EF-2, namely EF-G Like the eukaryotic translocation factor, the archaebacterial aEF-2 contains a post-translational ly modified histidinyl r esidue, diphthamide, uihich is ADP-ribosylated by diphtheria toxin (68, 100). Although the eukaryotic and archaebacterial elongation factors are similar they cannot substitute for each other or for eubacterial EF-G in in vitro polypeptide synthesis using ribosomes prepared from organisms from each kingdom <71) The aminoacyl-tRNA binding factor from archaebacteria aEF-lcc is inter changeable in vitro with the eukaryotic EF-lac and probably the eubacterial EF-Tu (71). However eukaryotic and eubacterial aminoacyl-tRNA binding factors are rarely interchangeable Archaebacterial and eubacterial growth is sensitive to neomycin, virginiamycin and chloramphenicol however studies in vitro showed streptomycin and chloramphenicol, inhibitors of eubacterial 70S ribosomes, do not bind to ribosomes from M. barkeri (115). This explains the resistance of all archaebacteria to streptomycin. Inhibition of methanogen growth by chloramphenicol is related to the presence of a nitro group on the chloramphenicol molecule. This compound probably acts as an oxidizing agent (6). Anisomysin, an inhibitor of eukaryotic translation <113) and virginiamycin, an inhibitor of eubacterial ribosomes, both inhibit ribosomes from methanogens Archaebacteria] and eubacterial ribosomes generally contain a similar number of proteins, although in Methanosarcina, Methanococcus and Sulfolobus. there are more proteins associated with the 30S subunit <85). The number of acidic ribosomal proteins tends to be higher in archaebacterial ribosomes and this seems to correlate with the cytoplasmic concentrations of ions in halobacteria and some methanogens (6!) The ribosomal ft proteins of £L thermaautotrophicum and U. show more sequence similarity to the eukaryotic A protein than to the equivalent eubacterial component LZ/LIS (63). The archaebacterial 5S rRNft — binding proteins also have more sequence similarity to the eukaryotic than the eubacterial equivalent proteins. In contrast, archaebac terial proteins uihich are similar to the eubacterial proteins S9 and S15 do not shouu any similarity to eukaryotic proteins (63> 7S RNA Archaebacteria contain large amounts of an additional stable RNA molecule, not associated with the ribosome whose function is currently unknown (79, 89). There are similarities between the sequences of the H. halohium 7S RNA molecule and the mammalian 7S-L RNA, which is a component of the mammalian signal recognition particle (89). The E, coli 6S non ribosomal RNA does not appear similar to the archaebacterial and eukaryotic 7S RNfts. Protein-encoding genes from archaetaacteria. The first protein—encoding archaebacterial gene to be studied in detail was bap from H. halohium which encodes bacterio-opsin (11, 36) The amino acid sequence of bacterio-opsin and the UNA sequences of the hup and bro (bacterio—opsin-related protein) genes have been determined (11, 36) Their comparison indicated that the halobacteria use the standard genetic code and that no intervening sequences are present in the bap gene. Subsequent studies of protein-encoding archaebacterial genes have confirmed these 12 observations. The lack of observed introns may be misleading as a generality due to cloning of genes in £_ call by demanding their functional expression in L coli. Genes cloned from M smithii <49, 50, 51, 52), CL barken <15, 49, 50), CL thermalithotr oohicns <136), CL uannielii <7, 25, 26), MethanocnccLis voltae <26, 72, 145), CL thermoautotrophicum <14, 51), and Methanobreuibacter arboriohilus <16) have been seen to direct polypeptide synthesis in E. coliA number of these genes complement L coli auxotrophic mu tat ions, his A <26, 145), his I <7>, proC <49, 50, 52), PiirF <49, 50, 51), and arg 6 <49, 50, 145), and uiith the exception of aroS cloned from CL voltae <145), the cloned DMA sequences encoding these genes also contain promoters recognized by L coli RNA polymerase. Promoter sequences recognized by £. coli RNA polymerase do not appear to be the same as the sequences recognized by archaebacterial RNA polymerase. Conserved sequences upstream of CL uannielii and CL voltae tRNA genes RNA polymerase binding sequences upstream of the CL uannielii hisA gene that are similar to the proposed promoter consensus sequence for CL uannielii tRNA genes which also binds RNA polymerase <18). Methyl coenzyme M Studies have shouun that in all cases the genes are arranged in a cluster in the order mcrB 25, ?2). DNA sequencing has located two additional open reading frames 15, 25), however the function of their products is unknown The mrr genes are thought to be transcribed as an operon into a single transcript since a probable promoter sequence exists upstream of mcrB and the translation initiation codons of the other four ORFs are located very close to the termination codon of the preceding QRF. Sequences complementary to the 3' end of 16S rRNA are found preceding all five ORFs in the mcr operons <14, 15, 25). Codon usage of the £L vannielii mcr ft ■ mcrB and mcrS genes indicates a strong preference for codons with a C in the wobble position in the codon pairs ftflC/U (asp), CACs'U The mrrC and mcrD genes within the cluster and other M. vannielii genes do not show this preference and usually have a U or A in the wobble position The codon pairs above are likely to be translated by the same tRNA with a 6 in the first anticodon position (138) It has been suggested therefore, that the presence of a C in the wobble position might ensure maximum efficiency of translation of transcripts of highly expressed genes such as those encoding methyl reductase Codon usage within the £L thermoautotrophicum and £1. barkeri mcr gene clusters is in accordance with this hypothesis <14, 15), although the preference for a U or A in the wobble position of codons in the genes of these organisms is not as strong as in M_ vannielii Comparison of protein and TNA sequences of the mcr genes from the different methanogen species should provide information on conserved structures and possible 14 functional domains Since methyl CoM reductase is common to all methanogens, comparison of sequencing data will also provide further insight into their divergence during evolution. Genes cloned from different methanogenic species that complement the same L coli mutation have been compared <26, 51). The purl genes from a thermophilic methanogen, M. thermoautotrophicum. and a mesophilic methanogen, fl. smithii are 532 identical and encode polypeptides that are 742 similar if identical and conservative amino acid substitutions are considered <51) Genes cloned from LL vannielii and tL voltae that complement an £. coli hisA mutation are 662 identical and encode polypeptides that are 762 similar if identical amino acids and conservative substitutions are considered <26). In both cases the genes were cloned from methanogenic species within the same order, tL thermautntrnohinim and tL Smithii are classified as Methanohacteriales. and tL vannielii and tL voltae are classified as Methanococcales <2) The comparative sequencing data was therefore in agreement with the proposed classification. A gene cloned from tL vannielii capable of complementing an £. coli hisl mutation has been compared to the homologous genes from JL coli. Salmonella tuphimurium. Neurosoora crassa and S. cerevisiae (7). It was concluded that all the genes were derived from a common ancestral gene and were indicative of the overall relatedness of the eubacteria, archaebacteria and eukaryotes Computer analysis of amino acid and DMA sequences. The use of computers in the analysis of amino acid and DNA sequences has increased in recent years <34). Uery simple computer programs are available that search DNA sequences for open reading frames 131) and can predict DNA and RNA secondary structures. A number of databanks are now commercially auailable that list published amino acid and DNA sequences eg. P1R (Protein Information Resource) and EMBL (European Molecular Biology Laboratory) databanks respectively. These databanks can be accessed through computer programs, such as the IBM Genepro program, that search through the sequences of the databank using any of a number of characteristics as a basis for the search A search of the databank titles can be made to identify sequences that encode proteins of similar function using key words such as hydrogenase or dehydrogenase. Searches can also be made of all available sequences from a particular organism eg. human, Uicia faba or L coli. If the identity of a sequence is known eg. in this study argininosuccinate synthetase, its name can be used to search the databank for analogous sequences isolated from different organisms. Should the identity of a sequence be unknown a computer search can compare its amino acid or DNA sequence to those of the databank and determine which sequences in the databank, if any, are similar If there is a sequence within the databank that appears similar to the unknown sequence their relationship, or lack thereof, can also be determined using available computer programs An alignment program eg DNAstar ALIGN, will compare and align two or more sequences and maximize their similarity by introducing gaps within the sequences. It is best to align amino acid sequences since DNA sequences are predicted to have at least 25% similarity by random chance as there are only four possible bases at each compared location (93). Diagon analyses of DNA and amino acid 16 sequences can also be used to determine the extent of relationships between two sequences ( 121) If a sequence is compared to itself in a Diagon analysis, the plot will indicate if internal homologies are pre' ent in the amino acid sequence or if a SNA sequence contains inverted or direct repeats. A number of computer programs are now used to predict the secondary strueture of an amino acid sequence eg the Beckman Microgenie program which uses the Chou and Fasman algorithm <22, 23, 24). There is controversy however as to the accuracy of these predictions and X-ray crystallography remains the preferred method (43, 64) Determination of the hydropathicity of an amino acid sequence is possible through programs using algorithms based on calculations made by Kyte and Doolittle <74) and Hopp and kloods (58). This can be useful in identifying likely antigenic determinants of proteins. Hydrophilic regions of an amino acid sequence are more likely to be located on the exterior of a folded protein and therefore presented as an antigen Antibodies raised against a synthetic peptide, synthesized with the same hydrophilic sequence of amino acids, would therefore be likely to recognize and bind to the complete protein. Computer programs are available which predict evolutionary relationships between DNA or amino acid sequences and hence between the organisms from which they originate A program developed by Lake (75) determines an unrooted tree for four DNA or RNA sequences by the method of evolutionary parsimony. The PAUP program, developed by Swofford, can use DNA, RNA or amino acid sequence alignments to determine the most parsimonious tree (rooted or unrooted) for four or more taxa (44, 126) In the latter program, different weightings can be chosen by the researcher to compensate for conservative 17 amino acid substitutions, transitions verses transuersions in SNA sequences, single event deletions etc , allouiing more flexibility in the analysis of the data Overall therefore, the extent to which a sequence can be analyzed has recently increased rapidly as more advanced computer programs have become available Argininosuccinate sunthetase genes and pseudogenes. Previously me showed that a gene cloned from H barken ivas capable of conferring arginine independence to £. coli X760. The data indicated that the DNA encoded argininosuccinate synthetase, an enzyme which catalyzes the penultimate step of arginine biosynthesis in £. coli: L-citrulline -*■ L-aspartate ♦ ATP ------> AMP + PPi + ar gininosuccinate 6enes encoding ar gininosuccinate synthetase (argG) have also been cloned from d. voltae <145), L coli (28>, Streptomuces cattleua <124), tL crassa <31), cerevisiae (31) and human (13), however only the human cDNA clone has been sequenced completely <13). The human genome contains one expressed arginmosuccinate synthetase gene and approximately 14 pseudogenes that are dispersed on at least 11 chromosomes (5) The pseudogenes are not expressed in human cells. There are at least 13 exons contained in the expressed gene which spans a total of 63 Kbp, 7 of the intron/exon boundaries have been determined <39, 48) The pseudogenes have presumably resulted from reverse transcription of the ar gininosuccinate synthe tase mRNA and insertion of the resultant cDNA back into the human chromosomes. Some of the ar gininosuccinate synthetase pseudogenes appear to have resulted from duplication of other argininosuccinate synthetase pseudogenes (94). A number of the 18 pseudogenes have been cloned, sequenced and compared to the human argininosuccinate synthetase cDNA sequence which provides an indication of their evolutionary relationships <62, 94> In this study, an additional argfi gene was cloned, from M. vannielii The sequences of the tL vannielii and tL frarkfeTJ argS genes were determined and compared. This represents a comparison of homologous sequences from species of different orders of methanogens, the Methanococcales and Methanomicrobiales respectively <2> Comparisons to the human cDNA sequence and two pseudogenes were made to construct an evolutionary tree for argininosuccinate synthetase METHODS Srouith of cells E, coli and D subtilis strains (Table I) were grown in LB broth as rich medium which contains 10g tryptone, 5g NaCl and 5g yeast extract per liter H^O, adjusted to pH 7 0 with 10M NaOH, 15g of agar was added per liter to solidify the medium for plates. Stock solutions were used to produce final antibiotic concentrations of 100 jug ampicillin/ml (AMP), 5 jug kanamycin/ml (KAN) and 30 jug chloramphenicol/ml (CAM) Determination of complementing ability used plates containing M9 minimal medium (32)- NagHPO^ 6 g/L KHgPO^, 3 g^l, NaCl, 05 g^l; NH4CI, 1 g-^1; CaClg, 01 mM; Mg SO 4, 10 mM; glucose, 0.4/i ui/vj amino acids, 40 jug-'ml; vitamins, 1 jig-'ml, agar, 15 g^l. Clones containing pHE3-based recombinant plasmids transformed into £. coli RR28 cells were selected on M9 minimal medium containing 200 jug para-fluorophenylalanine/ml (PFP) and 30 uig CAM/ml (55). Isolation of methanogen DNA (49). Frozen, pelleted cells of Methanosarcina barker! strains MS and 227 were crushed for 10 min, with a pestle, in a mortar pre-cooled with liquid nitrogen. Repeated additions of liquid nitrogen to the cells aided their disruption. Cell debris was transferred to a screw-capped test tube containing 10 ml TE buffer <10 mM Tris.HCl pH 85, 1 mM EDTA), TE-equilibrated phenol 19 SB Table 1 Microorganisms used in this studu. Mama Sauxt* Genotype Mathanosarcina barken MS M.P Bryant mild type Methanasarcina barken 227 M.P Bryant uiild type Methanococcus uannielii ft Stadtman mild type Q X 7 60 R Curtiss III iina-1, leu-1, an , tonftR. iAtY2, exoC119, tsx. purE 1. yaiK2, trp3. his ft 4 . ar_g636, CEil. xnl-1. mtl - 1. ili/06, thi-1, met -12 Fifhprichu coli DS418 J H Reeve jDlDfl, rnmB. Escherichia cnli RR28 H Hennecke F~, thi-1, RCUP2, leuB6. lacVl. g*J.IC2, ara-14. mil-1. a.yi-5, A**, suoE44. hsrlR ” pbeSl?. recfl!3 Latiieru-lna coli GM272 CBSCN 6478 dcm-6. dam-3. metBl, hidS21, lacVl or lac?4. thi-1. oalKg. mtl-g. tonP2 or tonft3l. sup E44 tsx -1 or tajt-78, X® Escherichia coli GM2L63 CGSCi 658) ara-14. leu06. tonOl3, lacVl. tSjt-78< sup E44 , aalKP. dcm-6. his64. ru tl3 E , dam-13: Tr>9. X®, isil-5. mtl-1. thi-1. h sdRg Escherichia coli JM105 J Messing thi. CJtsL, endft , shcBlS. hsdR4. Adac-croflB). CF', traD36. prodB. laclq7^.M153 Esthenciua cob JP221 J Carbon hsdR. hidM, tne^tES, lfuB. recft Bacillus subtilis MB97 S Baumberg ar0 0 2 . leufll Bacillus subtilis IE6 0 Dean tCJ*C2, the-5 Bacillus subtilis CU4B3 JN Reeve thvifl. thy B. metB. dm IU 01 21 <10 ml> added and mixed gently uiith the lysate on a rotary shaken for 30 mm. The mixture uuas transferred to a 30 ml Corex tube and the phases separated by centrifugation at 1000g for 5 min in the SS34 rotor. The upper aqueous phase was transferred to another screw-capped test tube, the phenol extraction repeated twice, the final aqueous layer transferred to a beaker, ls 9 volume of 3M C^COONa and 25 volumes of cold 95K ethanol added to precipitate nucleic acids which were harvested by centrifugation at 12,000g, 4°C for 30 min in the SB34 rotor The pellet was redissolved in a minimal volume <<15 ml) of TE and treated with bovine pancreatic RNase <50 m g/'ml) for 30 min at 37°C. 7 5 at a concentration of 1 mg/'ml and heated at 80°C for 15 min to inactivate DNases) The phenol extraction and ethanol precipitation steps were repeated. The DNA was finally dissolved in a minimal volume < Large scale isolation of Plasmids from E_ coli E. coli cells, containing the desired plasmid, were grown at 37°C in 500 ml LB containing AMP <100 jig/ml) to an A580=0 6 CAM was added to a final concentration of 200 U 9 / mt and incubation continued overnight. Cells were harvested by centrifugation in a GSA rotor at 5,800g, 4°C for 10 min, washed by resuspension in 10 ml 25Z ui/v sucrose, 50 mM Tris HC1 8 0), transferred to a SS34 polypropylene tube and centrifuged at 4300g, 4°C for 10 min Cells were then resuspended in 3 ml 22 25% ui/w sucrose, 50 mM Tris.HCl, the mixture placed on ice for subsequent additions, 0 6 ml of lysozyme <10 mg/ml) added and the mixture incubated at 4°C for 5 min EDTft (16 ml, 250 mM) uias added, incubation continued for 5 min at 0°C, 4 8 ml of lytic mix <255i w/v Triton X-100, 50 mM EDTA, 50 mM Tris.HCl pH 8.0) added slowly and the mixture swirled gently. Complete lysis was obtained by a 20 min incubation at 0°C. The crude lysate uias spun at 27,000g for 30 min at 4°C to pellet cell debris and chromosomal DNA The cleared lysate supernatant containing the plasmid DNA, was carefully decanted into a beaker and the volume measured. An addition of lg CsCl uias made for every 1 ml of cleared lysate and the solution mixed to dissolve the CsCl. The volume was remeasured and 0.8 ml of ethidium bromide solution <10 mg/'ml) added for every 10 ml of CsCl solution, bringing the final density to approximately 1.55 g/'ml. The solution was transferred to polyallomer UTi 65 centrifuge tubes, expelling all air. These tubes were heat sealed, capped and spun in a UTi 65 rotor, in a Beckman model L8-70 ultracentrifuge for 16-20 h at 20°C at 180,000g Plasmid and chromosomal DNA bands were visualized by illumination with ultraviolet light <300 nm) The top of the tube was pierced with a syringe needle and the lower plasmid band recovered by inserting a 21 gauge needle, attached to a 1 ml syringe. Just below the band and withdrawing the syringe plunger An equal volume of isopropanol, saturated with an aqueous solution of 5M NaCl, 10 mM Tris.HCl (pH 8.5) and 1 mM EDTA, was added to remove the ethidium bromide from the plasmid DNA. The mixture separated into two phases and the upper phase was discarded. Repeated extractions were made until all color had been 23 removed from the lower, DNA-containing phase Two volumes of sterile water and six volumes of cold 95/i ethanol were added and the mixture placed at -20°C overnight to precipitate DNA A white DNA pellet, obtained by centrifugation at 4°C, 12,000g for 30 min in the SS34 rotor, was washed once with cold 70/i ethanol, dried and dissolved in 100-200 jul TE Large scale isolation of Plasmids from B. sub tilts. B subtilis cells were grown in 4x 300 ml LB or M9 minimal medium plus KAN (5 ug/'ml) at 37°C, until an A580 of 0 5 was reached Cells were harvested by centrifugation in a GSA rotor at 8,000g, 4°C for 20 min, washed by resuspension in 100 ml cold TES <30 mM Tris HC1 pH 8 0, 50 mM NaCl, 5 mM EDTA) and centrifuged in the 6 SA at 8,000g, 4°C for 20 min The washed cells were resuspended in 20 ml TES containing 20/i w/v sucrose and 1 mg lysazyme^ml and incubated with shaking at 37°C for lh to produce protoplasts. TES containing 8/i ws'u SDS <20 ml) was added slowly to induce lysis and 20 ml aliquots then transferred to 12 x 50 ml polypropylene tubes NaCl <5M) in TES <5 ml) was added to each aliquot, the mixture incubated for 5 min at 6 8°C, 4°C for at least 8h, and then loaded into a pre-cooled SS34 rotor at 4°C and centrifuged at 39,000g for 30 m»n The resulting supernatants were decanted into four GSA centrifuge bottles. Two volumes of cold 95/i ethanol were added and the bottles placed at -20°C for lh DNA was precipitated by centrifugation in the GSA rotor at 16,000g, 0°C for 30 min, the ethanol decanted and the pellets dissolved in 24 25 ml TES Aliquots <12.5 ml) uuere transferred to 125 ml Erlenmeyer flasks and mixed with an equal volume of phenol equilibr a ted with TE The mixtures were t ransf erred to 30 ml Corex tubes and incubated on ice far 15 min with gentle shaking T wo phases were separ a ted by centnf ugation in the SS34 rotor at 0°C, 12,000g for 10 min, the upper aqueous phase removed and transferred to 125 ml Erlenmeyer flasks and an equal volume of chlorof ormisoamylalcohol (241) mix added The mixtures were transferred to 30 ml Corex tubes, incubated on ice for 15 min with gentle shaking and the two phases separated by centrifugation as described above The upper aqueous phase was removed to 30 ml polypropylene tubes, two volumes of cold 95X ethanol were added and DNA precipitated by incubation at -20°C for at least lh. The DHA was sedimented by centrifugation in the SS34 rotor, at 4°C and 27,000g for 20 min, dried and then dissolved in TE for CsCl-EtBr centrifugation in the UTi 65 as described above (see Large scale isolation of plasmids from £. coli.) Small scale isolation of plasmids from E £.ali-4tfld fl—subtilis C82). Cells were grown overnight at 37°C in 5 ml of LB plus the appropriate antibiotic. Aliquots (15 ml) were transferred to Eppendorf tubes and centrifuged for 1 mm in an Eppendorf model 5412 centrifuge (12,000g), the supernatants discarded, an additional 1.5 ml aliquot added to each tube and the centrifugation repeated A pellet obtained from a total of 3 ml of suspension, was resuspended in 100 jul of 50 mM glucose* 10 mM EDTA* 25 mM Tris.HCl (pH 8 0) containing 2 mg lysozyme^ml. 25 This mixture uias incubated on ice for 30 min,, 200 u 1 of 0 2N NaOH containing i/i uj/u SDS unarmed to 56°C added, the tube inverted to mix the solutions and placed on ice for a further 5 min of incubation. To this mixture 150 jul of 3M CH3COONa 4.8) uias added, the solution mixed and the incubation continued for lh on ice. The crude lysate uias centrifuged for 5 min in the Eppendorf centrifuge ( 12,0009 ) to pellet cell debris The supernatant uias transferred to a clean tube for DNA precipitation by addition of tuio volumes of cold 9 57i ethanol and 15 min incubation at -70°C Tubes mere centrifuged for 5 min in the Eppendorf centrifuge to pellet DNA, uihich uias mashed once uiith cold 70/i ethanol and dried. Each pellet mas dissolved in 50 jul TE containing 1 pg RNase A/ml A 10 jul aliguot of DNA solution mas normally sufficient for visualization of plasmid DNAs folloming electrophoresis through horizontal 0.7/i ujs'v agarose gels (see belom) Preparation and transformation of competent cells of E coli (9. 7fiX Cells mere grouin in 580 ml of LB at 37°C until an A580 of 0.2 mas reached, then chilled by incubation for 20 min on ice. All subsequent steps mere belom 4°C Cells mere harvested by centrifugation in a 6 SA rotor at 7,500g, for 15 min, resuspended in 200 ml 100 mM CaClg and incubated for 20 min Cells mere sedimented by centrifugation at 7,500g for 15 min. Resultant cell pellets mere resuspended in 5 ml 100 mM CaCl^ and incubated for 20-24h Cold glycerol mas added (final concentration I05i m/'v) and aliquots (500 jul) transferred to 1.5 ml Nunc (Uangard International, Neptune, Nem Jersey) storage 26 tubes and stoned frozen at - 70°C. For transformation, aliquots mere thauied on ice and 100 M 1 of competent cells added to transforming DNA <1 jug in 20 .uI) in 1.5 ml Eppendorf tubes The tubes uuere placed on ice for 10 min, incubated at 37°C for 2 mm and the cells then inoculated into 2 ml LB. Cultures were grown for l-2h at 37°C with gentle shaking to allow expression of plasmid-encoded genes Aliquots <100 Ml) were plated from the undiluted cultures and from serial dilutions onto LB plus antibiotic or MM-argimne selection of transformants Plates were incubated at 37°C ouernight Preparation and transformation of competent cells of B. suh tills Cells were plated for confluent growth on TBAB plates (S g NaClzl, 8 g agar/1, 17 5 g tryptose blood agar basest) and incubated overnight at 37°C. The resultant lawn of cells was suspended in 20 ml HS medium Cper 100 ml: 66.5 ml H 20, 10 0 ml 10X S-Base <20 g g Na citrate.2H2(Vl), 0 1 ml 100 mM MgSG4, 2.5 ml 20/i ui/y glucose, 0.5 ml 1 mg tryptophan/ml, 1.0 ml SZ casamino acids, 5 0 ml 2Ji yeast extract, 10 ml QZ w^v arginine and 4 Y. w/ v histidine!?, to give a final concentration of l-3xl0 7 cells/ml The suspension was incubated in a 250 ml Erhlenmeyer flask with vigorous agitation at 37°C until the culture approached the stationary growth phase 61gceral was added to 55i v tubes and stored frozen at -70°C. An aliquot (1 ml) of frozen cell suspension was allowed to thaw and added to 20 ml LS medium, 2 ml 10X S-Base, 0.02 ml 100 mM MgSO^, 0.5 ml 20/i w/'v glucose, 0.02 ml 5 mg tryptophan/ml, 0 1 ml 22 w^v casamino acids, 1.0 ml 22 w/u yeast extract, 0.1 ml 50 mM spermidine, 0.5 ml 100 mM MgClg, 15 66 ml HjjO The cell suspension was incubated at 37°C and 200 ul aliquots removed at 15 min intervals after an initial lh period of growth These aliquots were added to ice-cold solutions of transforming DNA dissolved in 100 Mi SSC <4 41 g Na citrate/1, 8.77 g NaCl''l>. After mixing, the cells plus DNA were incubated at 37°C for 30 min, LB (2 ml) added, growth allowed at 37°C for 2h, 100 m 1 aliquots spread on selective plates and the plates incubated overnight at 37°C Each batch of cells was so assayed to determine the time after inoculation into LS medium transformation. The minicell producing strain, Jl. subtilis CU403, required the addition of thymine <20 jug^ml) and methionine <20 jug^'ml) to all growth media. R e s tr ic tio n and liaa.ti.QnL o f plasm id JHdiL Digestion of DNA by restriction enzymes used the buffers recommended by the commercial suppliers In each case 1 unit of enzyme per jug of DNA was used and reactions incubated at the appropriate temperature for t-2h. Treatment at 6 8°C for 10 min terminated most enzyme digestions and sep ar a ted cleaved 28 sites, eliminating apparent partial digestion products due to non-disjunction of fragments Preparations of digested DNA were visualized following agarose or polyacrylamide gel electrophoresis (see below) Ligation of sticky-ended DNA in ligase buffer <10 mM Tris.HCl pH 7.5, 10 mM MgCle, 50 mM NaCl, 1 mM ATP, 10 mM p-mercaptoethanol) used T4 ligase at a concentration of 0.2 units per jug of DNA Incubations lasted 2-12h at 17°C Blunt end ligations used the same buffer plus 2 units T4 ligase per jug of DNA for 2-12h at 14°C The products of ligations were visualized following agarose gel electrophoresis (see below) Agarose and polyacrylamide gel electrophoresis. Electrophoresis of DNA in agarose used horizontal slab gels of 0.7X w^v agarose in TA buffer (40 mM Tris MCI pH 8 0, 0.2 mM CH3C00Na, 2 mM EDTA) An electrophoresis chamber was sealed with tape, the well forming comb placed in position and 100 ml of boiled agarose solution dispensed into the chamber Hhen the gel had solidified the tape was removed, the gel immersed in TA buffer to cover the gel surface and the comb carefully removed. One tenth volume of tracking dye <105i w-'v sucrose, 0.25X bromophenol blue) was added to each DNA sample before loading into the gel. Electrophoresis at room temperature was carried out at 100U for 3h or 20U overnight Following electrophoresis the gel was removed and DNA in the gel stained by immersion of the gel in ethidium bromide solution (1 ,ug''ml in TA) for 15 min. The DNA was visualized by UU transillumination (300 nm) and photographed using a Polaroid MP-4 Land camera, Kodak Wratten gelatin filter 23A and Polaroid 667 film. Exposures were at f 16 for 15 sec 29 Separation of DNA fragments less than 1000bp in length required electrophoretic separation through polyacrylamide gels Gels of 20?i, 85i or 5 V. w/v polyacrylamide uiere run in TBE buffer <89mM Tris borate, 89 mM boric acid, 2mM EDTA pH 8 0) Glass plates <24 x 16 cm) were arranged with spacers to form a pocket, clamped and the edges sealed with 2 .V, agar Acrylamide solution <50 ml) was de-gassed prior to the addition of 30 u 1 N,N,N ,N ,-tetramethylethylenediamine (TEMED) to initiate polymerization The solution was poured between the glass plates, a comb inserted, polymerization allowed TA) for 45 min and then the gels destained in water for 10 min. DNA bands were visualized as described aboue. Photography used Polaroid 665 film at f;5.6 for 15 sec Minicell preparation and radioactive labeling of Plasmid-encoded polypeptides C106X The minicell producing strains, £ coli DS410 and £. subtilis CU403, were transformed with the appropriate plasmids DS410 was grown overnight at 37°C in 150 ml LB plus AMP <100 jug^ml) or CAM <30 jug^ml) CU403 was grown overnight at 30°C in 150 ml M9 minimal medium supplemented with 0.4/i w^u glucose, 20 30 jug thymine/ml, 20 jug methionine/ml and 5 m 9 KflN/ml Cells and minicells mere harvested by centrifugation at 0,000g at 4°C for 10 min in a 6SA rotor Pellets mere thoroughly resuspended in 3-5 ml M9 medium and the suspension layered on tap of 30 ml 10—33% m/v sucrose gradients in M9 salts, made in SW2? cellulose nitrate ultracentrifuge tubes using a gradient maker. The gradients uiere centrifuged at 5,000g at 4°C for 20 min in a HB-4 rotor fitted with plastic adaptors to prevent the tubes from collapsing. Minicell bands were removed by insertion of a 21 gauge needle, fitted to a 10 ml syringe, just above the bottom of the mimcell band and withdrawal of the minicells into the syringe. (The lower and upper portions of the minicell band are not taken as they contain parental cells and lysed cells, respectively, mixed in with the minicells) Minicells were removed from suspension in the sucrose solution by centrifugation at 20,000g at 4°C for 10 min in a SS34 rotor The pellet was resuspended in 1 ml of M9 medium, layered on a second 30 ml sucrose gradient, centrifuged and the minicell band removed from the gradient as before. The minicells were pelleted and checked for parental cell contamination by phase contrast microscopy If contamination levels were acceptable, 1 parental cell per 10^~^ minicells, the absorption of the minicell suspension was measured at AS80 and the suspension used to label plasmid-encoded polypeptides radioactively during their synthesis in the minicells if) For labeling, minicells were resuspended (2x10 /ml) in M9 minimal medium supplemented with 0.4% w/v glucose, 20 ug D-cycloserine/ml and 1/10 volume of methionine assay medium. For each 100 jil of suspension, 05 jul L-(^^S)-methionine (970 31 Ci/mmol> uias added and the mixture incubated at 3?°C for lh. The labeled minicells mere pelleted by centrifugation in an Eppendorf centrifuge <12,000g) for 15 min, resuspended in 50 ml of SDS-PA6E sample buffer <0 125 ml 0 1M Tris HC1 pH 6 8. 0 3 ml 10JS uj/w SDS, 0 2 ml 502 v/w glycerol. 0.05 ml 0-mercaptoethanol per ml) per 100 jul of the original minicell suspension and incubated at 100°C for 3 mm. Similar trichloroacetic acid 3-oxazole 2 mm in developer, 30 sec in water and 5 min in fixer The film was washed in water for 30 min and allowed to dry Nick Translation (.32). Approximately 1 jig of the plasmid DNA to be labeled, dissolved in 15 jul TE, was mixed with 2.5 jul of 10X NT reaction buffer <0.5 M Tris HC1 p H 7.5, 0 1 M MgS04, 10 mM DTT and 500 jug BSA/ml) and 2.5 jul of a solution containing 0.2 mM dCTP, 0 2 mM dTTP and 0 2 mM d 6TP in a 1.5 ml Eppendorf tube. A solution of 1 mg DNase I/ml in 50 mM Tris HC1 pH7 5, 10 mM MgS04, 1 mM DTT and 502 u/v glycerol was diluted 1/40,000 in NT buffer and 0 5 jul of the diluted solution was added to the ■ » P nick translation reaction Additions of 5 jul of ac-< P>-dATP <50 juCi) and 1 jul of DNA polymerase I <4 units) were made and the reaction incubated at 14°C for 3h The reaction was 32 terminated by adding 23 j_il of stop solution (0.02 M EDTA, 2 mg salmon sperm DNA/'ml and 2% w^v SD5) and diluting to 0 5 ml with TE The reaction mixture uias loaded onto a Sephadex G-50 column (0 7 cm x 20 cm), pre-equilibrated with TE, and subjected to column chromatography using TE as the running buffer Fractions <0.25 ml) were collected and 2 m 1 aliquots of each fraction spotted onto 1 cm^ pieces of 3MM Whatman paper, the papers placed in scintillation vials and the r adioac tiuity in each vial counted in a Beckman model LS7500 scintillation counter Column fractions found to contain labeled DNA were pooled and used as the probe material in hybridization experiments. Radioactiue labeling of oligomer orohes Approximately 30 picomoles of oligomer DNA were added to 3 ill 10X kinase buffer (0 5 M Tris.HCl pH 9 5, 100 mM MgClg, 50 mM dithiothreitol, 50Z w^v glycerol), 1 j j . 1 u *^P-ATP <100 juCi), 1 jul T4 polynucleotide kinase <30 units) and water to bring the total volume to 30 jul The solution was incubated at 37°C for lh, 10 mM Tris.HCl pH 8.0 added to giue a final volume of 100 jil ready to load on a Sephadex-G50 spin column pre-equilibrated with 10 mM Tris HC1 pH 8 0 The spin column was prepared in a 1 ml plastic tuberculin syringe stoppered with glass wool The syringe was suspended inside a 15 ml tissue culture tube and filled with equilibrated Sephadex-650 The column was packed by repeated spins at 500g for 20 sec in a table top centrifuge until a total bed volume of 1 ml was achieved and then washed six times with 10 mM Tris.HCl pH 8.0 for 20 sec at 500g Some more 10mM Tris.HCl buffer was added, the column spun at 500g for 3 min, the 100 jil radioactiue sample loaded and the column 33 respun at 500g for 3 min The eluate uias collected into a lidless Eppendorf tube placed in the bottom of the tissue culture tube. One uiash uuith 100 jul 10 mM Tris.HCl pH 8.0 at 500g for 3 min eluteci the radioactively labeled oligonucleotide probe in the Eppendorf tube but left unincorporated nucleotides in the column A 2 jul aliquot uias used, as described above, to determine the amount of radioactivity incorporated into the probe DMA-DNA hybridization with dehudrated agarose gels <32) DNA fragments produced by restriction endonuclease digestions uiere separated through 10/i ui/v agarose gels in TA buffer. The DNA fragments uiere stained by immersion of the gel in a solution of 1 jug ethidium bromide/'ml in TA and photographed using UU transillumination. The gel was soaked for lh in 50 mM NaOH at room temperature to denature the DNA, rinsed with distilled water for 30 min, transferred to a sheet of 3MM Whatman paper which was placed on a Hoeffer model SE-1140 gel drier The gel was dehydrated by running the drier without heat for 30 min, with heat for 30 min, and without heat for a further 15 min. The dried gel and Whatman paper were separated from each other by flotation on distilled water The gel was then transferred to a heat-sealable plastic bag to which pre-hybridization solution CSX SSC, 58/i w/'u formamide, IX Denhardt’s reagent, 50 mM Na^HPC^ buffer pH 6 5) and 1 mg denatured salmon sperm DNA/'ml was added The bag was sealed and incubated at 42°C for 16h The pre-hybridization solution was replaced with hybridization solution <5X SSC, 50X ui^v formamide, IX Denhardt’s reagent, 20 mM Na^HPO^ buffer pH 6.5, 0.5 mg denatured salmon sperm 34 DNA/'ml and 18X wv'v SDS), heat-denatured, radioactiue probe uias added, the bag resealed and incubated at 42°C for a further 4-16h. The gel uias mashed twice uiith 2X SSC plus IX Oenhardt's reagent, twice with 2X SSC plus 0,1% uu^u SDS and twice with 8 IX SSC plus IX w^v SDS All washes were at 42°C for 15 min. Excess liquid was removed, the gel wrapped in Saran Wrap and then used to expose X-ray film inside an intensifying screen cassette at -70°C DMA-ORA hybridization with Zata-Probe membrane DMA fragments produced in restriction endonuclease digestions were separated through either 1.0X uj/w agarose gels in TA buffer or 0.5 X ui/w polyacrylamide gels in TBE buffer DNA fragments were stained by immersion of the gel in a solution of 1 jug ethidium bromide^ml in TA and photographed using UU transillumination The gel was soaked for 38 min in 0 2N NaOH, 0 5M HaCl for 30 min at room temperature to denature the DNA After neutralization by three 10 mm washes in TA buffer the gel was placed on a pre-soaked wick of 3MM Whatman paper drawing TA buffer from a reservoir Zeta-probe membrane, cut to the size of the gel and pre-soaked in TA buffer for 5 min, was placed on top of the gel and covered with two layers of pre-soaked 3MM Whatman paper Dry paper towels were placed on top to draw the TA buffer by capillary action and transfer the DNA to the zeta-probe membrane Transfer was allowed to proceed overnight, the zeta-probe membrane air dried and then baked for 2h at 80°C The baked membrane was transferred to a heat-sealable bag, pre-hybridization solution <5X SSC, 50X w^v formamide, 5X 35 Denhardt's reagent, 0.1% iu/w SDS, 258 iig denatured calf thymus DNA/'ml, 25 mM NagHP04 buffer pH 6 5) added and the bag sealed Approximately 0 1 ml of pre-hybridization mixture uias used per square cm of membrane After incubation <42°C for 12-14h) the pre-hybridization mixture uuas replaced by an equal volume of hybridization solution <5X SSC, 50% w-" v formamide, 5X Denhardt's reagent, 0 1% SDS, 250 Mg denatured calf thymus DNA-'ml, 25 mM NagHPO^ pH 6 5 containing denatured probe at final 2-3x10** counts/ml), the bag resealed and incubated at 42°C for a further 20-24h The membrane was washed with agitation four times, for 10 min in 2X SSC, 0 1% w^u SDS at room temperature, excess liquid removed, the membrane wrapped in Saran Wrap and used to expose X-ray film. A more stringent wash of 0.1X SSC, 0 1/1 uj/u SDS at 3?° — 42°C for 30 min was used if the signal to noise ratio was unacceptably low Maxam and Gilbert DMA sequencing <06) DNA <30 M9 > was digested with the appropriate restriction enzymes under the conditions specified by the enzyme supplier One tenth volume of 1 M Tris.HCl pH 7.9 and 8 units of calf intestinal alkaline phosphatase were added and the mixture incubated at 56°C for 30 min The reaction was halted by addition of 200 m 1 8 45M CH^COOHH^ deprateinized twice by extraction with 200 m 1 TE-equilibrated phenol and once by extraction with 200 m 1 chloroformisoamylalcohol <24 = 15 The dephosphorylated DNA was precipitated by adding 750 ul of cold 95/i ethanol, vortexing and placing the mixture at -70°C (dry-ice ethanol bath) for 5 min DNA was pelleted at 12,000g for 5 min 36 in the Eppendorf centrifuge, the supernatant removed with a Pasteur pipet, 1 ml cold 70/i ethanol added, the tube gently inverted, replaced at -70°C for 5 min and recentrifuged at 12,000g for 5 min The supernatant uias discarded, the pellet dried under vacuum, redissolved in 20 jul water plus 20 jul 2X denaturation buffer <1M Tris HCl pH 9 5, 10 mM spermidine, L mM EDTA) and the solution heated at 65°C for 15 min and quickly chilled on ice 6 mI of 10X kinase buffer (0.5M Tris HCl pH 9 5, 0 1M MgClp, 50mM DTT, 50X glycerol), 1 jul tf 32P-ATP <180 m CO, 1 jul T4 polynucleotide kinase <30 units) and 12 Ml of water uuere added and the mixture incubated at 37°C for lh The reaction was stopped by addition of 25 M1 4 5M CH 3COONH4 and 200 jul water The radioactively labeled DNA was deproteinized, precipitated and dried under vacuum as described above. The DNA pellet was redissolved in the appropriate buffer, 30 units of restriction enzyme added, the mixture incubated at the appropriate temperature for 4h, 1/10 volume of tracking dye <0.25X w/v bromophenol blue, 0.25X w/v xylene cyanol, 15Z ficoll) and 1/10 volume 882 w/v glycerol added and the labeled fragments separated by electrophoresis through a 55! w/v polyacrylamide gel (55! w/v acrylamide, 0.2% w/v N,N' -methylene-bis-acrylamide, 25/i w/v glycerol in TBE> Following electrophoresis, the gel was marked with radioactive ink, wrapped in Saran Wrap and used to expose X-ray film for 2 min Exposed regions on the X-ray film that represented the desired DNA fragments were cut from the film with a razor blade. The Saran Wrap was removed from the gel, the film repositioned using the radioactive ink as a guide, and a razor blade used to cut the gel through the guide holes in the X-ray 37 film. The desired gel slice uias removed using a spatula, chopped into small pieces and transferred to a 1000 jul pipetman tip sealed at its tip and plugged with glass uiool DNA was eluted from the polymerized acrylamide by addition of 500 jul elution buffer <500 mM CH 3COONH4, 10 mM (CT^COCD^Mg, i mM EDTA, 0.1 H w^v SDS) sealing the tip with parafilm and incubating overnight at 4£°C The DNA solution was removed from the tip by piercing the tip with a hot needle and centrifuging the tip in a test tube in a table top centrifuge at 500g for 1 min The contents of the tip were washed with 100 jul elution buffer and respun at 500g for 30 sec The total eluate was transferred to an Eppendorf tube containing 800 jul cold 95/i ethanol, the tube was vortexed, chilled at - ?0°C for 5 min and centrifuged at l£,000g for 5 min The supernatant was discarded, the pellet washed with 1 ml cold 70Ji ethanol, dried under vacuum, redissolved in 50 jul TE, extracted twice with 50 jul TE-equilibrated phenolchloroform (30:50), once with chloroform:isoamylalcohol (24:1), reprecipitated with cold 95!i ethanol, washed three times with cold 70/i ethanol and dried under vacuum. The labeled DNA was dissolved in £0 jul water and subjected to the four base-specific chemical cleavage reactions: 6 reaction, cleavage at guanines, G + A reaction, cleavage at purines, C + T reaction, cleavage at pyrimidines, and C reaction, cleavage at cytosines. The reactions were performed in 15 ml Eppendorf tubes. B reaction. £00 jul of DMS buffer (50 mM sodium cacodylate pH 8.0, 1 mM EDTA), 4 jul of end-labeled DNA, 3 jul of carrier DNA <1 mg sonicated salmon sperm DNA/'ml) and 1 jul dimethyl sulfate were 38 combined. The reaction uias incubated on ice for 8 min, stopped by addition of 50 ,ul DMS stop buffer <1 5M CH 3CO ON a pH 7 0, 1M p-mercaptoethanol) followed by 750 jul cold 95Ji ethanol The tube was wortexed, placed at -70°C for 5 min, centrifuged at 12,000g for 5 min and the supernatant removed 6+A reaction. 12 jul of water, 3 jul of carrier DNA, 6 jul of end-labeled DNA and 2 Ail of 1 M piperidine formate <4Ji v ' v formic acid adjusted to pH 2.0 with piperidine) were combined The reaction mixture was incubated at 37°C for 8 min, the reaction stopped by freezing at -70°C and the sample dried under vacuum. C + T reaction. 12 jul of water, 3 Ail of carrier DNA, 6 Ail of end-labeled DNA and 30 Ail hydrazine were combined. The reaction mixture was incubated on ice for 8 min and then stopped by the addition of 200 jul hydrazine stop buffer <0.3M CHjCOONa, 0 1 mM EDTA), followed by 750 jul cold 955; ethanol The reaction mixture was vortexed, placed at -70°C for 5 min, centrifuged at 12,000g for 5 min and the supernatant removed. L. r.eat:tion. 15 jul of 5M NaCl, 3 Ail of carrier DNA, 4 Ail of end-labeled DNA and 30 jul hydrazine were combined The C reaction was then treated in the same manner as the C + T reaction DNA pellets from the 6 reaction, C + T reaction and C reaction were dissolved in 250 jul 0.3M CH^COONa and vortexed vigorously, 750 m 1 of cold 95Z ethanol added, the tubes vortexed, chilled at -70°C, centrifuged at 12,000g for 5 min and 39 the supernatants discarded The pellets mere washed mith cold 705i ethanol, the ethanol removed and the DNA pellets dried under vacuum Strand cleavage reaction. The contents of all four reaction tubes mere treated identically for the remaining steps. DNA pellets mere mashed mith 20 jul mater, redried under vacuum, redissolved in 100 pi of 1M piperidine, incubated at 90°C for 30 min and then dried under vacuum mith heat to remove the piperidine and recover the DNA The pellets mere mashed tmice mith 10 jul mater and redried under vacuum. Tracking dye (80% m^v formamide, 10 mM NaOH, 1 mM EDTA, 0 1% m^u bromophenol blue, 0 1% m^v xylene cyanol) mas added to the pellets to obtain similar counts/pl. The mixtures mere incubated at 9B°C for 3 min, quick chilled on ice and 3 jul aliquots loaded on 20/i or 6% m/v polyacrylamide sequencing gels. Polyacrylamide gels (6% uj/v- 5 7Z acrylamide, 0 3Z N, N' ,-met hylene-bis-acrylamide or 20/i ui^v- 19% acrylamide, l/£ N, NJ,-methylene—bis — acrylamide) in TBE and urea (89mM Tris.borate pH 8 1, 2mM EDTA, 8.3M urea) mere cast betmeen a 35 5 x 44 cm glass plate and a 35.5 x 46 cm glass plate. The glass plates mere separatedby 0.8 mm spacers and clamped to form a pocket. After polymerization, the plates mere clamped into an electrophoresis chamber Electrophoresis mas carried out using TBE buffer and 80 matts until the dye front had travelled a sufficient distance After electrophoresis, one glass plate mas removed, the gel transferred to a used X-ray film, covered mith Saran ktrap and exposed to a X-ray film at -70°C using an intensifying screen 48 M13 sequencing 1110). Plasmid DNA and M13mpl8 RF (or M13mpl9 RF) DNA were digested with the appropriate restriction enzymes under conditions specified by the enzyme suppliers and then ligated as described above Competent JM105 cells (100 jjl) were added to the mixture, incubated on ice for 45 mm, transferred to 37°C for 15 min, added to 3 ml molten 0 5^ ui/o B agar (10 g tryptone^l, 8 g NaCl/1, 1 ml thiamine Bl/1), 10 u 1 100 mM isoprapylthiogalactaside (IPT 6 >, 50 Ml 2/i w/v 5-bromo-4-chloro- 3-indolyl- galac topyr anoside in dimethylformamide (X-gal), mixed and poured onto B agar plates <15?; ui/'v agar) Turbid blue or white plaques were visible after incubation overnight at 37°C White plaques (i e. those that contain inserts) were stabbed with a sterile needle and the adhering phage particles inoculated into 5 ml TE, a few drops of chloroform added and the phage suspension vortexed vigorously A sterile loop was used to spread a drop from these suspensions onto a B agar plate overlaid with 3 ml 0.5X w/v B agar, 300 ol JM105 cells <109 cell/ml), 10 jul 100mM IPTB and 50 jul 2 V. w/v X-gal. Infection of the JM105 cells produced isolated plaques after incubation overnight at 37°C. A plug of agar with a white plaque was removed using a Pasteur pipet, inoculated into 2 ml YT medium (8 g tryptone/1, 5 g yeast extract/1, 2 5 g NaCl/1) containing 10 ill JM105 cells <10 9 cells/ml) and grown at 37°C for 5-8h One ml of this culture was transferred to a 15 ml Eppendorf tube, centrifuged at 12,00 0g for 2 min and the supernatant transferred to a second tube 200 Ml of 2.5M NaCl, 20% w/v polyethylene glycol (PEG-6000) was added, the solution vortexed, incubated on ice 41 for 30 min and centrifuged at 12,000g for 5 min to pellet the phage. The supernatant uias discarded, the phage pellet dried under vacuum and then resuspended in 100 jj 1 TE, coat proteins removed by extraction uiith 50 jil TE-equilibrated phenol followed by extraction with 50 m 1 chloroform-isoamylalcohol <24 1). The upper aqueous layer was collected, a 0.5 volume of 7 5M CH3COONH4 pH 7 5 and 2.5 volumes cold 95/i ethanol added and the mixture incubated at -70°C for 5 min DNA was pelleted by centrifugation at I2,000g for 5 min, washed with 1 ml cold 70X ethanol, dried under vacuum and dissolved in 10 jul TE Template DNA (4 jul), 1 jul of 2.5 >jg M13 DNA sequencing primer/ml, 4 jil of a 32P-dATP <40 juCi) and 1 jul of 10X buffer <0 5M NaCl, 0.1M Tris.HCl pH 7 5, 0.1M MgCl2, 0 01M DTT) were mixed in a 0.5 ml Eppendorf tube and incubated at 56°C for 10 min. The tube was placed at room temperature and allowed to cool slowly. 2 jul DNA polymerase I 1 unit-'jul) were added, mixed and 2 jul of the solution distributed to each of four 0.5 ml Eppendorf tubes containing 3 jul of either A mix, G mix, C mix or T mix These dideoxynucleotide mixes in IX buffer contained the following concentrations of nucleotides; A mix: 60 juM ddATP, 17 juM dATP, 30 juM dGTP, 30 juM dCTP and 30 juM dTTP; G mix: 150 juM ddGTP, 17 juM dATP, 4 2 juM dGTP, 40 juM dCTP and 40 jiM dTTP; C mix: 100 juM ddCTP, 17 juM dATP, 40 juM dGTP, 4 2 jjM dCTP and 40 uM dTTP and T mix 120 mM ddTTP, 17 juM dATP, 40 juM dGTP, 40 uiM dCTP and 1.7 jiM dTTP The mixtures were incubated at room temperature for 15 min, 1 jul of chase solution <0 2 mM of each dNTP, 0 05 units DNA polymerase I^jul) added and the mixtures incubated at room temperature for a further 15 min The reactions were stopped by addition of 10 tracking dye (0 3^ w^u xylene cyanol, 0 3/! w^u bromophenol blue, S9Y. w/'u deionized formamtde, 10 mM EDTA pH 8 3) and incubation at 100°C for 3 min. An aliquot (3 jul) of each reaction sample was loaded onto a SY. uj/y polyacrylamide sequencing gel (as described aboue> RESULIS, Construction of plasmids pETSgl. pETSgg. pET8£3, pET8£4. and p £TS£5. Methanococcus uanmeln DNA uias digested uiith Hindlll and Ps11. mixed with Hindlll -Pst I digested pUC 8 DNA, ligated and transformed into the multi-auxotrophic strain Escherichia coli X760 Transformants of X760 capable of growth on M9 minimal medium plates without arginine, were screened for recombinant plasmids. One plasmid, designated pET821, contained a 7 0 Kbp fragment of methanogen DNA (Fig 1>. Digestion of pET82l DNA with Hindllf and Pstl. ligation with Hindlll-Pstl digested pUC9 DNA and transformation of X760 resulted in plasmid pET822 which contained the 7 0 Kbp methanogen derived DNA fragment in the opposite orientation with respect to the vector DNA X760 cells containing pET822 were also able to grow on M9 minimal medium plates without arginine. This indicated that the cloned methanogen DNA encoded either a suppressor tRNA molecule or a polypeptide capable of complementing the argininosuccinate synthetase deficiency of £. coli X760. It also suggested that the tL vannielii DNA contained a DNA sequence recognized as a promoter by E_ coli RNA polymerase To determine more precisely the region of tL uanmelii DNA in pET821 necessary for aroG complementation additional plasmids were constructed by sub-cloning DNA from pET821 (Fig. 1> Digestion with EtjiRl, ligation and transformation into X760, followed by selection for AMP resistance, resulted in plasmid 43 44 Figure 1 Plasmids derived from pET821 pET821 has a 7 0 Kbp fragment of fcL vanmelii DNA cloned into the Hindlll/'Pst I site of pUCS pET824 tuas obtained from pET821 by EcoRl digestion and religation. Ligation of EcoRl digested pET821 with EcoRl digested pBR32S produced pET823 pET823 and pET824 do not complement araG mutations in £. coli Ligation of Bglll^Pstl digested pET821 uiith BamHl/Pstl digested pHE3 produced pET825 which retains arginine complementing activity Restriction enzyme cleavage sites are B-BamHl. Bo -BqIII. C-Clal. E-FroRl. H—Hindlll and P-Pstl 45 PLASM10S DERIVED F RDM PET821 PET821 PHE3 Eco R1 LIGATION Ecd Rl \ LIGATION LIGATION UUIX PETQ25 PET824 8g F ig u re 1 46 pET824. Complementation studies determined that pET824 uias unable to confer arginine independent grouith to X?60 The DNA situated between the two EcoRl sites of pET821 must therefore be essential for acqG complementation Sub-cloning of the 13 Kbp EcoRl fragment from pET821 into the EcoRl site of pBR322, transformation into X760 and selection for AMP resistance produced plasmid pET823 Complementation studies showed pET823 was also unable to confer arginine independence to X760 indicating that the DNA situated between the two EcoRl sites of PET821 was essential but not sufficient for argG complementation Pstl-Bglll digested pET821 DNA and BamHl-Pstl digested pHE3 DNA were mixed, ligated and the mixture used to transform E. coli RR28 competent cells. Transformants containing recombinant molecules were selected directly by plating cells on M9 minimal medium supplemented with 200 jug PFP/ml and 30 jig CflM/ml Plasmids present in transformants were obtained by rapid plasmid isolation and used to transform L coli X760 Transformants were selected for CAM resistance and these clones screened for their sensitivity to AMP and ability to grow in the absence of arginine. One transformant capable of growth without arginine, contained a plasmid designated pET825 (Fig. 1) The arginine complementing entity, either polypeptide or suppressor tRNA, encoded by the methanogen derived DNA was therefore concluded to be encoded by a DNA sequence located between the terminal Ps11 site and the internal JUglll site of the H. uannielii DNA fragment in pET821. Expression of methanogen derived DNA in B. subtilis. A fragment of methanogen DNA cloned from Methanosacrina barken had been shown previous 1 y to complement a deficiency in 47 argininosuccinate synthetase in both L coli X760 and IL subtilis MB97 (90, 91> To determine if the £1. vannielii derived DNA encoded by pET821 uias also functionally expressed in IL subtilis. BamHl digested pET821 DNA and BamHl digested pUB110 DNA were mixed, ligated and the mixture used to transform JL coli X760 All transformants selected for their resistance to both AMP and KAN contained plasmids capable of complementing the arginine deficiency. Plasmid DNAs isolated from these transformants uiere used to transform competent cells of B. subtilis MB97 Selection for resistance to KAN provided transformants that were screened for their ability to grow on M9 minimal medium Plasmid DNAs were re-isolated from transformants capable of growth without arginine and used to re-transform L coli X760. All X760 transformants resistant to AMP and KAN were also able to grow in the absence of arginine. This indicated complementation of L subtilis MB97 argA2 was due to the methanogen derived DNA and not due to reversion of the arqftg mutation The same results were obtained with a chimeric plasmid constructed from EcoRl digested pURBt DNA, containing a cloned fragment of Methanococcus voltae DNA, capable of complementing L coli X760 angG36 <145), and EcoRl digested PUB110 DNA. Overall these results show that genes from three different methanogens, barkeri. M. vannielii, and £L voltae are transcribed and the resulting mRNAs translated in both £_ coli and IL subtilis to produce a polypeptide(s) capable of complementing argininosuccinate synthetase deficiency in two eubacterial species. 48 Polypeptides synthesized in DS410 mimcells Encoded bu pET8£1. PET823. pET824 and pET825. It had been shown previously that pET371 <90, 91) and pURBl (145) directed the synthesis in minicells of polypeptides with estimated molecular weights (Mr) of 51,000/49,000 and 55.000 daltons respectively. Minicells from strains transformed with plasmids derived from pET371 and pURBl incapable of arginine complemention, did not synthesize these polypeptides 011 plasmids directed the synthesis of p-lactamase or chloramphenicol acetyl transferase (CAT) Competent cells of E. coli DS410 were transformed with PET821, pET823, pET8£4 and pET825 DNAs. Mimcells were isolated from the transformed strains and incubated with L- 49.000 daltons whereas pET823 and pET824, plasmids incapable of arg636 complementation, did not (Fig. 2). All three plasmids directed the synthesis of 0 -lactamase. pET82S did not direct the synthesis of p-lactamase but directed the synthesis of CAT. In addition pET825 directed the synthesis in minicells of two polypeptides with mobilities identical to those of the polypeptides synthesized in minicells containing pET821 (Fig 3) It appears therefore that the metabolic deficiency caused by the arqfi mutation is alleviated by either the 51,000 dalton polypeptide, the 49,000 dalton polypeptide or both of these polypeptides encoded by the 13. vannielii DNA cloned in pET821 Loss of the ability to direct the synthesis of the 51,000 dalton and 49,000 dalton polypeptides always correlated with a loss of 49 Figure E Autoradiogram of the electrophoretic separation of radioactiuely labeled polgpeptides synthesized in plasmid containing coli minicells. SB /3 lactamase Figure 2. 51 Figure 3 Autoradiogram of the electrophoretic separation of radioactiuelg labeled polypeptides synthesized in plasmid containing L coH minicells 5£ in — OJ CvJ rr_ ro 00 GO 03 LlJ H 1- or X LlJ LlI 3 C l CL CL C l £ lactamase F i g u r e 3 53 complemention ability. In all plasmid constructions bath polypeptides were either encoded or not encoded, in no case uias synthesis of the 51,000 dalton polypeptide observed in the absence of the 49,000 dalton polypeptide, or vice-versa Nucleotide sequence of pET821. The nucleotide sequence of 2 0 Kbp of the 7 0 Kbp M_ uannielii DNA fragment in pET821 uias determined using the sequencing strategy shouin in Fig 4B The 2098 bp sequence extends from the PstI cloning site of the pUC 8 vector DNA into the methanogen derived DNA and contains two long open reading frames 6 ,non- 6 ,N rules <131) and so is presumed to represent the C terminus of a bona fide M vannielii gene A computer search of the PIR amino acid sequence bank, using the IBM Genepro program, failed to find related sequences, the function of the product of 0RF222 therefore remains unknown Following the TAA termination codon of 0RF222 are two 15 bp identical, directly repeated sequences, separated by 1 bp Although these repeats could form a hairpin structure in mRNA, the predicted instability of the hairpin stem suggests that a classical rho-independent terminator is unlikely. The second open reading frame Figure 4 Detailed restriction map of 2.1 Kbp of pET821 A Restriction map of 2 1 Kbp of pET821 Hatched box represents an open reading frame 397 amino acid residues of argininosuccinate sgnthetase S. Strategg used to determine the nucleotide sequence of 2.1 Kbp of PET821 DNA 1 II' II III III £. II £. ha m o R o jnc h a A A A h a III CFO I AHA I A L A c t i Ama I H 11 A H * III S c * I L i * I A > T ID T 00 00 1 5 0 0 ■ too ■ ■ 3 0 0 ' 1 9 0 0 -o -■ 400 ■1000 '1 2 0 0 '1300 - 1 6 0 0 -ieoo -2000 -2 -500 ' 6 0 0 ' 7 0 0 'BOO -qoo "1100 1 4■ 0 0 - 1 7 0 0 -2100 t I I 1 f i 1 Figure Sfi 56 molecular weights reported for other argininosuccinate synthetase subunits (13, 28, 31, 105) 0RF397 conforms uiith RNY and 6,non-G,N rule predictions. Comparison of the 0RF397 amino acid sequence with the previously reported cDNA encoded human argininosuccinate synthetase amino acid sequence (13) revealed regions of highly conserved sequences. It was therefore concluded that 0RF39? encodes the aroG complementing polypeptide i.e encodes the argininosuccinate synthetase of M. vannielii Three bp 5' to the AUG translation initiation codon of 0RF397 (Fig 5) is a potential ribosome binding site, the AGGTGgT sequence should be capable of hybridizing to the 3' end of the 16S rRNA of H. vannielii <3* -CCUCCACUA-5' ) (2), Expression of a gene in £. coli and A. subtilis demands hybridization of its transcript to the 3' end of their 16S rRNfls. The sequence AGGtGGTaAT should hybridize to the sequences found at the 3 ' ends of the 16S rRNA of both E_ coli (3'-AUUCCUCCACUA-5' ) and B. subtilis (3'-AUCUUUCCUCCACUA-5*> (2). Further upstream, between base pairs 728 and 772 (Fig. 5), are sequences similar to those indicated in DNase I and exonuclease III footprinting to be the binding sites for RNA polymerase used to transcribe the M. vannielii his A and methyl reductase 0 subunit genes (J.W Brown, M. Thomm, G.S. Beckter, B.A. Sherf, personal communication) (25) The sequence upstream of 0RF397 (ACTGATATATTGGTAAAA) matches the sequence found upstream of the hisA gene (ACCAATATATATGTTAAA) (26) at 13 of 18 positions It matches the methyl reductase 0 subunit gene upstream sequence (AACTTGAATATATCT) at 9 of 18 positions (25). A sequence, ACCGAAAANTTTATATANTA found 38-50 bp upstream of M vannielii and £L voltae tRNA genes has been suggested to be a consensus promoter sequence (141) The sequence 57 Figure 5 Nucleotide sequence of 2 1 Kbp of pET821 Termination codons of ORF222 and ORF397, and the initiation codon of 0RF397 are indicated uiith single dots. Direct repeats downstream of 0RF222 are overlined with arrows. The inverted repeat sequence downstream of 0RF397, indicated by converging arrows, could be a transcription termination signal Bases which are presumed to constitute a ribosome binding site are marked with double dots ( = >. Bases marked with asterisks (O are also found upstream of the hisfl gene of £ 1. uannielii Restriction sites used in cloning and subcloning are indicated. Amino acids are indicated by the single letter code under the second base of each encoding codon. SB * + t ■ 1 3 ' 3 - ■ - L -I ( * * . , n i ijm tee *»*■,*• T m ^ M T U T .^ rr . t i « a ; t . : . . t :::»: t t . t t t l a a a t a a a a t ■' i t t t t a m . a : * c m : u <. Ai*7(,TM(TTs a ,. a a t T T ’ 1 1 Jl 1 3 I ‘ I lAA*c*TT0TAiT**CT;.CiLT7-CTf. AJH,AIATATTAAiA;.lT-.^ATTi*ITCiiCT*»CT»i‘TCiiACvjA*I7TJIit*T?ATTtCAiTiJ>iiC**Aj*TA TAliHAACCCC.-AlTTrrtLT:. TrTcllTArrfCrTAIAAATCAiTTAAATCCTAAACrTAATCCTTCrTl AAffTTcirAA**CT-TTATT:fCTTAAA*rfTl TT!T?AAl(JTAAACtTA 7 so * * * * •*...... tt Ml * ^AA»AAtir7TJA7All<.CA7rTAA<;7c;*T*fAT1CC7AiA17TAT7TTTCA7AA**Ci*TTTAlACACCACCIiCTiittttCJlwt*Ai***TT&C*tTTTr*&CATACrCTtiJIGCSCTTj BAG ■ | | i A T At A AOC t C C t ^ C rt A AA AT T ACTTr. A A U 4T A A'", I A T A ATT AT A A AC T JCTTT CTGTTCC ACTCC ATC 1AGGGC AGCGT G AtC AtCAC T T A *A *C A A C CtQ *AQ AAA ACCC*A AG AAAC 9 * * 1 4 I 1 < t )N J L * a * V r r W V * l*i»dT4qp*dfl i k|pt«na«« TTCCTGTTTTAAACCATrArACTATTCATGCAAAyCAACAAtTTSCACTtOACTATtTTTTTACCCCAATAAACSClilTUCCCTTTATCAACGTTACCCCTTATCAACGCCTCTTCCAA 1QBO ii*lfcriytld#pfPi'*TdTirr*iKini]T*(Tf>]»t*l* ^ACCCCTCATTGCTATAAAlArTCCA&AACTTCClMACAAArrG&^CCGGATCCAATTTCACACCCCTCTACrCGAAiAGCAAAT-SACCllTTTACATTTCAtTCCCTAATAAGAAClA 1 7C0 * p i 1 * b y I • « ] t y » L | i d < l l h f C t | f e ^ n d a l ' l ' r * * * I ' I jail A AGC A CC T C A A A T T C A A AT A A TTCC !T C C G G T 4 ACCGA TT T i l ACC f T A C A AG A A C A G A AC A A A T C { A A T A T G C A lA lC AAAAA&G A A T tC C A A T IC C A G T T G A T 7 T A G J U A lA C C A T T T A M J C k*pAI»llAp'rJid]l,r <-\*flqtlkahAlOlpnaL*l(p( G 7 ATCG ATG AAA ACTTATGCGG rA GAAGTATTCAACG CGC A A T7TTIG AA AACCCA ATG AtTGA A AC TCCA A A AG AG TCTTTTCCATC£ ACTG TTCACCC&AA A AT AC CA A A AG ATG A AG 1 * AO i I d * r 1 y ( r *1*41 > l#*pal*Lp»*ef«wt « 4 p h ) a k d * 1 AG A G f A C C fA G A A 1 1 TC A A T T T A 1GCC A G C C C T T C C J> - T 7 ‘ C A At I t ACGCG CAAA 1 A T t TC ATGC ? ATT A ATGT AATA A A A C A A G C AA a TAA a CTTGCTCCC AC AA 1-C GO C C T T C C T A H t d GGGTAGATATTATAGAAGATAGGGT7TTAGGCCTTAAATGAAGAGAAAATTATGAATGCCGCGCTGCAATTTTATTAATAAC7GCACATIAGGCACTTG1GGAACTC:T7TTATCAAGAC l«*0 A ACAA TT ACTAT T T AIACAAATGCTCC A T tC AA A G 7 ATGC AC 1 C 7 T A 1 TT T A CA 1 A GOACTATCCC ACCAAC CTTTA AGACATG A TT TaG A T O C A TTT G TA C A C A A AACwCA AGA A AC A A I 1 0 0 iilTrki»iditri4lirt|i>'ritp|'hdldirrakt4tr tcaatcgaatacttcgggcaaaactttagaaaggttctttaacaattcttggaagggaaagcgaatg Tgcag 7TTATCAAgaaaacatggt 7TCATTCGagaataacgacatcgatca TG5T7lticTt7tliTACAAA&TTAt F i g u r e 5 5 9 ACCATTTAACTGATATATT6 found 45 bp 5J of the QRF397 initiation codon matches this putative consensus promoter sequence at 13 of 20 positions Downstream <18-19 bp) from the putative promoter for tRNA genes is a conserved triplet T 6C which is the site of transcription initiation, however there is no T 6C triplet at this position upstream of ORF397 Between base pairs 766 and 771 the sequence TTGATA matches the consensus -35 region of £, coli promoters CTTGACA) in 5 of 6 positions <54) Overall, the intergenic region upstream of 0RF397 contains sequences which are similar but not identical to those reported to be involved in transcription initiation in £L vanmeln. M. voltae and coli. and translation initiation in £L vannielii. £. coli and JL subtilis Upstream of genes encoding enzymes involved in arginine biosynthesis in £. coli is an 18 bp control region, the *arg box*, a partially conserved, palindromic sequence < 8, 17, 27, 28, 102, There are no runs of arginine codons in the upstream region of 0RF397, nor are there sequences that might form a stem and loop structure in mRNA which would be needed for attenuation DNA sequences involved in the regulation of expression of genes 6 0 encoding arginine biosynthetic enzymes in £L vanmelii do not therefore seem likely to be the same as in eubacteria Eighteen bp 3' to the ORF397 T A A termination codon is a sequence potentially capable of forming a stem-loop structure as mRNA with 4 6-C bp in the stem and 6 unpaired A’s in the loop, folloLued by 7 U's. This resembles a rho-independent terminator of L coli The intergenic DMA sequences flanking 0RF222 and 0RF397 are very A^T rich C78 4X A+T) uuhereas uuithin 0RF222 and 0RF397, the A + T content <67 87i A+T) is similar to the overall 67 A+T reported for the total genome of £1. vanmelii <63) Nucleotide sequence of pET371 The nucleotide sequence of pET371, containing the M barkeri DNA fragment cloned previously <49, 50), uuas determined using the sequencing strategy shouin in Fig. 6B The 3113 bp Hindlll fragment contains tuio long ORFs (Fig 6A> both of which conform to the RNY and 6,non-G,N rules <118, 131). Open reading frame 396 (0RF396) is 1191 bp long and encodes a polypeptide containing 396 ammo acid residues with a calculated molecular weight 44,445. This is lower than the apparent Mrs of 51,000 and 49,800 estimated from the electrophoretic mobilities of polypeptides synthesized in minicells containing pET371. The 0RF396 amino acid sequence is very similar to the amino acid sequences encoded by the £L vannielii DNA (0RF397) in pET821 and the human cDNA known to encode argininosuccinate synthetase Taken with evidence from sub-cloning of DNA from pET371, that resulted in derivative plasmids incapable of conferring arginine independent growth on L coli X760 (90, 91), it was concluded that 0RF396 encodes an arqG complementing polypeptide i e. the argininosuccinate synthetase of £L barkeri 61 Figure 6 Detailed restriction map of pET371 DNA A Hatched box represents an open reading frame B. Strategy used to determine the nucleotide sequence of P E T 3 7 1 DNA 6 8 OOU H UOOt J C u b i- uO«2- HU ____ HU 033 ----- 00£2 tm 033 —— 00«_ t ' J t 0Q£2_ 00*2 , 00*2 - oo;c_ i : t . 0012, , , t 0002 OOW I; i t; OOQl. 1 T * O Q il- 4 ’ ’ * * 009*- , » * oo «_ I ¥3$----- OOM- i-T F igure F 00*1 ( ’ t I O02i- OOii o o o i - 006_ . I ■ OOQ 1 t t ‘ ■ t 1 I < < O O i- o* «u 4 ' * „ < ►- 009- 005- 0(J* -i NN ■■■Oil -J 1 * I * o; _ 301 . 63 Figure 7 Nucleotide sequence of pET3?l DNA Termination codons of 0RF398 and 0RF396, and the initiation codon of 0RF396 are indicated uiith dots. Direct repeats downstream of 0RF398 <6x14 bp) and upstream of QRF396 <3 x 2 9 bp) are over lined with arrows. Converging arrows indicate the only palindromic sequence upstream of 0RF396 Bases which are presumed to constitute a ribosome binding site are marked with double dots <:). Bases marked with asterisks <*) are the same as those proposed as putative promoter sequences in halophiles and 1*1. smithii (53) Restriction sites used in cloning and subcloning are indicated. Amino acids are indicated by the single letter code under the second base of each encoding codon 64 [I * * " t * •; = i ' 7 : r ; a a . : t g * a ;• g g .j r * : -1 a * c t t l . r. a a * ■ , i. a a c a a t t a l: t t g : * a a a s, 7 * 7 : g c 7 r c c ; t ;tg-l77GT... , t : : m : .• 7 a 1 r . i : . . , , 0»* J4« u ' ■ . l u t c c m u : : : * : * ' : a r :; a a a t m , a c : t t g a AC c •' T A r »7-. a a A.. AGGC ag r r acGj ?r 7 . t< * U iU '.* r r c n u ic T u * n ; n :: A T T : i'; 7 t i. agc. : n :. l t t ; : , a a I T - , T .,', a 7 '.j r Ao T T . A .. a :f a a a j a.; , T t A T : i : : ' . / - -. a a t a * ! ...a:.-: a : * : * c j * A . , t i j j t •, 11 •' a < 7 1:: i,a ., at t 1 i : 77 7 ^ 7 a a r :: . a • ; a s a t r 7 •1 A * N •’ ¥ I * ,1 I - - I r * l» g •. 3 1 J 1 A .. •, A % r; t 7 : it ^ [ c j ^ / a a , , ; ; : * ; t a t a . ■: i : a a a t a l. c t ' T 7 g : 7 ■ T : a t . a a a ■, •: 7 a 7 7 a a ■' A T A C a g a t g g l' a *. a G a a a g r . a a g i, t ." r a t \ ; • : ] r ., * a .: ? * .r .i > ' -7 s 1 *!--■■'►* ; 1 r- 1 ■■* * - * - 1 » h ' p ; p a rrr-:i.TTiAAGi'A^CALA*7T r : t 7 7G 7 t 7. i i m : , : " • t 7 ” g a a t c s. •' t t j f. a a a g a t t g c a g c T a a g ; j 7 a a 7 t g t a ; i g a l a c a g c t t a a a g a g r 7 r 'j g g t a . a t a." ,a f r * * r ' I ; | 1 n ri > c L f . » * I » » h , l < g ^ S . h ^ , (l.Tlr - a a a i 1.1 aaa<.a7:;t*7: .7 a 7 7 a a s . g a a m c :• 7 : t Ti.' •” -T:::iTA»Arr ag l A c g tg ca c1 r : r rr;7 r :• t •: ;;gC c c g C AS A1 0 A*r; a g t A r T r : v ( a g g j a a t a a t 7 ii 1 = t a - , a T ! . . G v A A g g g C A | A 7 T A 7 A AO G C AG A G C 7 7-., L I :, G s G A7 AA T : 7 r T s ; T C 7 7 A :J A-,7, A AA A' ; 7 T I T C 7 T T C C A T A A G G A A TG C A G A C A A A A £ C g A * rTT.V.T.,:; A r:; T t G . A A *... A A A 7 7C A : a:, r A , T T . . 1 H M l T . C s , . ' 1'. J ‘ ! t H A T A T ■' 7 7 L A A •. : A . . . T : 7 T •' A T j -i A 7 •. T .. .. T A A A ft A •. . T ■ A 7 . A : . A A .. 1 . A A T , T A T * * 1 1 | ■> i ' A T - . l T ’/ o . A G A Ij A : > T T A A : . T 7 A 7 ; A T r A A T » i , ' ; A A A A As A 7 :. .j ’A-., rj, s. A T s!s, 7 7 • A A ; A 7 ! A . C' l . , I. 7 , ; 7 s, 7 1 A ' 7 7 C A A 7 s ., T A s" A T A • . ■ ■ m TP.-: I r ■ j » H ^ l - - i a i * f « ^ £ I T ^ ' A I j Cj :. CJCAA7A;;:1.. ..i-A.I.LtSCT-SCCATHjAliAl'TATsrAAGAAiiutfAipAOCAACTTALiATTiiAArLCATCAACCAGTACCACAAACjiOJil^CAA&A^AAAAv'AAAr, Tf'TAA j . ■ qt*iia#AiipT*i*h4ffj;ih!ilrfyrB»iif CC AAA A AC rAAfl’r d c ^ A A A AT rtrAArfGiTTAJiTATr. c: *TC T G g Ia A A I f z C A A C T CC,Taia AT cTaaC r & 5 T ii A aT CTCCfC.JlA A A C CG ACT A A AG *:AG A A A TT 7C A 'L::C j AA A T TC T'.C ITA AATTc.TfrC.A AC ^A A A A A A £ A 7 C Z i: 7 T C, Z T 7 A r ' T T T T T T ? A C A T A A T T T A T T T A I'. A C T A T 7 A A t C T A A 7 T C T-C C, C. IT A T 7 A TA TCCuTTA:AT::G 7T:;AAO TTA'7' ' 7 C S T s f T “ iTIj; AC T? A 7 -f A i f c f (T AAG r iT * ACfT AACTA !ffTA f?AA T CT CAACTT^AAC t T f l A i f 17 T c a tT aatct tTac tTTa AiTCACCT. ::ACA;77A:TTG;TT7AA: C T lT A A ST T AAACGGT7 7A C TIA 1A G C iTGGCAAATjiiAlLTTG C AC T TGCAT A T TC 10C A GG t. C T I t I t A C C T CGGTGTGCATCCCT a t CC Tt AAuiiAAAtC 7 1 C G17 A7 K C i L u l A C 7 AG 11- fl r «aMhv*lBv*at^4t»»clpt ivtkTlrC** «n« , 7TiC-trATTTCAi:jTCGATCTA|7UCi:. *GCCTCAAijA(j|jAAATCA*uCGGSCACACrJ CAAAAGCAG*CAAtiATCAC.CAAtAACCATTAT A v GATCCITC CA4iCGAACACTTCCT7AAACAT7 1ft OC. A”G7' TTCTCArT-ATTAAAGs"' AACCGAiGTTACCIAAGGCTAT-.TJAT.'. C. GIACTTCAATTCcr-r GCCCGTTjATCCCGAAAAAArjTGGTCuAGGCTGC.'AijGAAAjAAG.', |i,' A .7 * ‘ 1 i,1' •: c c r 7 n c r c t c G G c T : A •: A c si A A A A G G c A A r A r • A :;c ir G G r T T 'r'iA:JG c :j7 7 tT c :3 C i:A (j# c c .:JA C A T :iG A T0T T A rrnc:cc.7A T i:cG .:G A G A T ^A A < t T g a c f r;-.. r :. a ., 7 g g l , P ] *-'6' ij. » (''dq;rfa»»T * r** AIATCGAGTaCijCAAAAGA:.:.A:'GCAATCr [""-.G7GGAAGT(;ACAAAG77!'AA:.CG7TGCAGCGTTGiCGAAAACATCTGGAGC('GCAGTATCGAAGGCGGCAjGrT7GAA.:jA' ■ "TT' :,7 TGA&CGG'. TAGGCCTTT i7GAAGAAAATjAfcTGAfcA77GCAGGrr;AAAACGGCCTrGG7<:GAACTGATA7GATtGAAGACC&CC7GCT7GGCTTAAAAjCCCGCGAGAAGtAf'GAAr-. AC'' .'t'lf' • y, v i ‘ | Aft»njygi*-l.(jii!l c c Cc t g c a a :• t g r 7 r t : : g g c A j •: 1.1 h : i . r; r l. a g c T I r G A G a a a •: 7 f 0 T •: C T T Li a : c c G C a g CG a a ' t l a a g t t r a a g a a g a tc g 7 o z a c g a ■: <■ a t g c t c t g a a c t 7 g r ••; p, t t a • i. • r f p .J t r- * r t - r- 3 •, T . .4 , I b < k M I V l! <] b -I p a > , G G C T T G 7 G (, A C G A r., r: •: G 7 7 7 7 A T . A 7 ' 7 , » A : V ' 7 7 7 A T r'.i A ■: A A G T C f • A A A A A G 7 7 A : A C G t A T A l . 7 : A A A G 7 r. J G G r 7 r. r A C A A A .7 G r 7 r ■ 7 G A G :i A 1 7 7','"' ' G * “ " . > a . r J ' ' k ' * ^ i K ! ! > » p a . t . a - , r T I i •: » A T T ' A T T r ” • k » .. * T 1 ■ :.TT7'_ I • I • ' A T 7 * • . * A * ' 7 •! A A A G • 7 T T G 7 i A T A 7 A r * A :. •• C A . . A T A * , I I . ' s k t - * , , * p - , a ' ' ? TAATGGATAAGrAGTAA'j7TAAT'.AT77TG 7 7 : C 7 7 7 7 A i. I . ( G A i T G ■ G A : TT' 7AC7T7T7AA7T7AA7A77ClAGTrTTTT77AAA7AATCAA7T77- T:7AA77'7’ ;':: - * . «P V • rl H T CCrA7JA7AA7AATTtTr, A»::CAAT::TiJAA7AGT'L:rJGAAA7ATGArTACCAGrTTT7t::ATAtAArL-:7J7ArArTTTC7AGAAiTTTCt'A'.'-TTTTCCAGCGATArA7G:A7AGTA7J' ■ ‘ fc*odin AATTCTtAAAAATGCGGvAu'TGCAAG Z. G G AT G A C r G T A * AG 7 C G T T & A A T f t C A T C T A T 7 G C T A AC t z A C T G A A C T C C C A C A 7 T T r T 7 r A 7 A 7 : : 7 C T ^ C T G C C C A C : l A C : 7 ,' F ig u re 7 65 Six bp 3' to the translation initiation codon of 0RF396 (Fig. 7) is a potential ribosome binding site. AaGgGGTagT which should be capable of hybridizing to the 3' ends of the 16S rRNAs of M barkeri (3'-UCCUCCACUA-5' >, E_ coli <3'-AUUCCUCCACUA-5'> and £. sub tills <3'-AUCUUUCCUCCACUA-5' > Further upstream, between base pairs 1554 and 1565, the sequence TTTTAATCTATA is similar to a sequence 5J TTTTAATATAAA suggested as a possible consensus archaebactenal promoter sequence from Methanobrevibacter smithu and Halobactenum halobium data <52, 53> Comparison of the in ter genic region upstream of the M. bar ken 0RF396 with the putative promoter sequence of methanococcal tRNA genes (ACCGAAAANTTTATATANTA) located one region 0RF396 translation initiation codon yet it has been suggested that transcription of methanococcal genes starts at a TGC triplet 18-19 bp downstream from the putatiue promoter Three 29 bp directly repeated sequences, with a AAGTTYAAGTAYTCATCAATCTYAA6TTY consensus sequence 0RF396 translation initiation codon. There are no obvious similarities however between these sequences and sequences upstream of the £L uannielii 0RF397 initiation codon nor with the putative methanococcal promoter. None of the 0RF396 upstream sequences resemble the L coli *arg box" consensus sequence The only palindromic sequence, AAAAGATGG< 8N)CCATCTTTT, is 215 bp upstream of the 0RF396 initation codon, 3' to the three 29 bp direct repeats. There are no runs of arginine codons in the intergenic region preceding 0RF396 nor sequences potentially capable of forming the stem and loop of an attenuator The upstream region of 0RF396 contains sequences that are more 6 6 similar to those suggested as promoters in fcl* smithii and H* halobium than those suggested to be promoters in vanmelii and H volt ae. Transcription regulation of the genes encoding M. barken arginine biosynthetic enzymes does not therefore appear to be the same as eubacterial transcription regulation Sequences required for translation of mRNA in H, barkeri. £. r.Q li and JL subtilis are houiever present immediately preceding 0RF396. There are no inverted repeats downstream of the 0RF396 TAA termination codon that could identify a terminator, nor is there the start of another open reading frame The second open reading frame in pET371 <0RF398) is 1197 bp long and is located 5' to 0RF396. 0RF398 has the potential of encoding the 398 carboxyl terminal amino acid residues of a polypeptide interrupted at a Hindlll site in the methanogen DNA 0RF398 terminates at a TAA codon and is followed 4 bp downstream by six 14 bp direct repeats 5 0RF396. A computer search of the PIR amino acid sequence bank revealed that 0RF398 is closely related to the large subunit of yeast, rat and £, coli carbamyl phosphate synthetases <80, 95, 96, 97, 98, 104) The large subunit of £_ coli carbamyl phosphate synthetase is encoded by the carB gene which is 3819 bp long encoding 1878 amino acid residues resulting in a polypeptide with a calculated molecular weight of 117,710 <95) The sequences of the large subunits of carbamyl phosphate synthetase in rat, yeast and L call have an internal homology <80, 95, 98) indicating that the present forms of these genes 67 resulted from a tandem duplication of an ancestral gene, an event presumed to have occured before the divergence of the eubacteria and eukaryotes The available sequence of 0RF398 is too short to determine if the gene encoding carbamyl phosphate synthetase in £L barken MS also contains a duplication. The 3.1 Kbp methanogen derived Hindi 11 DNA fragment in pET371 had been sub-cloned from pET720 <49) which contains three Hindlll fragments cloned into pHC79 from a partial Hindi 11 digest of genomic fcL barken DNA It was possible therefore, that more of the methanogen carbamyl phosphate synthetase gene was on the other Hindlll fragments of pET720 DNA sequencing from the Hindlll ends of each fragment did not however reveal more amino acid sequences related to carbamyl phosphate synthetase. It appears therefore that, pET7E0 DNA unfortunately does not contain the Hindlll fragment encoding the N-terminal amino acid sequence of the £L barkeri carbamyl phosphate synthetase. The intergenic DNA sequences bordering 0RF396 and 0RF398 are A^T rich <67 3 V. A + T) whereas within 0RF396 and 0RF398, the percent A + T <49.5X) is much lower The overall percent A + T reported for the total genome of barkeri is 61Ji <63) The £L barkeri derived DNA cloned and sequenced in pET371 averages 50 4X A+T Codon usage. Codon usages in the £L barken and £L vanmelii ORFs show a preference for codons with A or U in the wobble position (Table 2) This is presumably necessary to accomodate the overall high A + T of their genomes. Codons containing a CG dinucleotide are rarely used in the H. vannielii ORF’s and the arginine codons C 6A and CGC are never used This is consistent with codon usage in previously sequenced methanococcal genes 68 T a h 1 (* r' Cudon usjou » r | C h h ij»in GAF222 r . i * i i j 1 1 I no . s > r to, ay" Alanine ll.t M. 3 6 . a 14.1 5 6 0,0 i fc . & 16 . I 1 . 5 t 4 ‘i 10,0 fc . A 0 •* ■ A 1 8 . 4 1 30.0 A'j A 25 ‘,8. j fi i . <: 66 . 7 AGO 2 1,7 '• * , 6 57,9 27.1 ; j . 6 I 1 - r., 16.7 7GA ■5.0 ' ? '3.6 9 . 5 : g c 1 0 1 t , 7 *4 , 9 2 1,1 0 36 , 4 2 S . t - jG 7 1 , 7 10.5 2 9 . 1 28.6 5 , 1 ; gu 6 '•0.0 5 3.6 •0,5 4 . 6 9 . 5 1:1 A)pirB(Lne AAC 4 1 5 4,7 2 b **9.1 60,0 ■1 7 5.0 81.3 6 40 . D 4 23.5 AAU 3“ a 6 . ] 29 50,9 j 40.0 25,0 18.8 i 60 , 0 ! 3 76 .5 A j p a r tite G AC 12 ^3.6 19 29.2 m2 65.6 40,0 1 8 72 . 0 62 , 5 a 31.3 5 . 9 GAkJ 4 1 56 ,2 *6 70.8 22 3*.* 52 , 0 28 . 0 37 .5 l 6 66 . 7 16 9* , 1 C y a t * 1 n e UGC - ? 33.3 2 10.2 3 57.1 50 . 0 100 00 . 0 ] 50.0 1 33.3 UGU 1 A 66 , 7 9 0 1,3 u 4 2 . 9 50.0 0 0 20.0 •j 50,0 2 66,7 G lu iis in e CAA 6 11.5 26 75.0 3 17,1 0 0 0 29-4 - 70 . 0 1 00 CAG *6 80,6 a 29 02.9 1 00 1 1 1 00 70 . 6 i 30,0 0 G l u t « p i u G A A A* “5,0 6 4 75, t 66 ^ . 5 45.6 26 68 , 4 28 . 6 38 86 . 4 14 93.3 GAG 52 S«.2 18 2'.9 26 27.5 S* . 5 l 2 31 .6 7 1,4 6 lj.6 6,7 G lycine GGA 33 29.2 5 9 . t 2 2 , m 28 . 6 25 , 0 9 . 7 1 0 38.5 5 55,6 GGC 35 3 1.0 10 15.2 • i 6 1.5 39 . 3 ' 4 50 , 0 58 . 1 4 15.4 0 3 GGG 22 19,5 17.9 4 14.3 19.4 6 23.1 2 22.2 GGU 2} 20 , 4 78 3 3 ■ i 14.3 3 )0.7 12.9 fc 21 . T 2 22.2 H i s t l d i n f CAC 50 , 0 * 3 ■ , 3 i 73.3 77.8 4 80.0 55 .6 3 50.0 C AU 50 . j i 66,0 « 36.4 22 .2 1 20 , 0 4 4 . 4 3 50.0 l i o l i u c i n * AU A 1 * H .3 1 2 i^-3 26 . 7 “ . 3 j . " 1 0 29.4 46 , 2 AUC 40 4 1.2 22 26 . 2 39 54 . 9 36 .7 1 6 69.6 66 , 7 3 8,8 7 . 7 AUU 46 4 7,4 -- 0 59,5 4 3.? 36 . 7 6 26 . 1 29 . 6 ; t 6 1,0 46 . 2 L • Li c l r. e •-j CUA T 2 9 . l " , 6 0 0 2 . 5 12,0 c u e 1 41 ’ C . 6 * , 9 M . 3 6 24.0 34 . 3 2 . 6 2 8 . 0 t, 41 OUG *-0.9 £ . 9 1 fc . 7 4 16.0 *5 , 7 2 . 8 0 C'JU 1 9 1 4 , 4 60.0 ’ 1 4 4.0 0 . 6 3 0.9 5 32.0 U U A 9 6 , 9 6 . 7 1 4 . 0 52 . a 9 36 . 0 UUG 24 1 8 . 2 3 . 3 3 12.0 5 . 7 0 3 12.0 L y a i n e 0 AAA 4 5 0 1 c, 28 . 3 fc: . 0 17 0 3 .9 45 . 5 33.3 2 6 7 2 . Z 9 0.0 A AG 7 1.7 1 0 J i 6 . 1 5 0,0 • 8 5 4.=, 66 . 7 i0 2 7.8 10.0 69 Table P continued cirB • r 4 G birk«ri bim«ri Hu*»n nl*l"T Q*F222 Rtildut ® ^ ® t I I 1 1 and c o d o n * 0 ‘ i d , « |n • <>- iy n *Q< i|rn No. ayn * Q * *yn Xo . i)rn • yn " f ! m J 0 . r i i . i f * J G kO ' 0 0 1 ■' ' 3 0 TOC 1 5 1 o ^ : lfir. , s n n PPienvidianine 2 9 : 5 p o ; I n f 0 A .• C 2 9 . * s 1 L : , ; r ' S . 2 ' : H * 6 ' c , < / 3 . s * n . o 2 0 . o ■':.(! i ,5 ■ '* ,6 ' r 3. ‘ i ■ s ' - ’ . 'I 1 4 . • 2 c . Cj . ■ 2n o 0, .o C- :. o S t r i n g ACC 1 3 1 2 . A 2 , * ‘ 10.9 N A , 19 . G 30. a - 3 7.5 13-3 a . 2 A u U 1 6 ’5,2 1 •? i 12,2 6 . 5 3 ? A . ] 2 7 . 7 i a . 2 1 2 0 . u j 12.5 UCA i ? ‘ 1 1 6 . 2 2 •'"• 5 2*2 I ‘A . 3 1 5 ,2 ■* . 2 -o .o 37.5 l! C C 2 7 . 6 ■ fc 1 8 29 18.1 39 * ’ 20,6 •j 15,4 1 0 4 1 . 7 fc . 7 j 12.5 O C 0 e> 5 . 7 * -, a 1 0 T 9 , fc i 14 ♦ 3 4 9 .2 A . 2 0 a. j UCU 2* 22 . 9 J 5 **2 , 2 1 0 2 1,7 9 - 5 - 7.7 8.3 3 20,0 fc 25 * 0 T h f io n in e AC A ' 9 30 . fc l . 7 >. 4 35 39 * a 42,9 25.0 fc . fc 36 , a *6,2 ACC 29 3 3 , 0 • o 1 fc . ‘ M 0 66.7 M 28 . 6 a 3 . 0 ‘ 3 72, z 0 7,7 ACC c fc . 7 fc , 5 11.7 * 0 3 i a . 0 5 . 6 18.2 7 . 7 ACU i 9 2 l . fc 9 4 t . a 2 20 . 3 A 2 20 . 6 12 .5 16.7 5 a 5 , 5 5 38.5 Tryptopnan l 5 OCG l 5 0 1 0 0 “ 100 0 - 100 100 3 100 Tirrcsjnf i J A C 2 3 ’ 9 22 £. * "i t 3 . 9 a 3 .2 7 * ♦ 0 3S-5 50.fi ’2 6 3.2 35 . 7 4 57 , l U A 0 - i f- b . & 29,0 □ 3* * L *- 5 6 1.5 A l . 2 36.8 i 6 A . 3 \ A* . 9 1 V a 1 l r, f , i. a -i * , 7 m . 9 J _ ;• ► j 20 . 0 1 fc . 2 35.7 fc fc . 3 0 24 . - • i 13.3 :uc iS.5 ' 2 1 0 . C i 8 . 2 - 36.7 3 . fc 7 9 4 9,7 ‘ A C , ' - 5 o . ■:■ * 5 2 7 22 . L 2^.2 '8 60.0 3 . fc d , 4 u J'JU JO 2 « . « 50 . fc - o 2 1.3 2 7.5 ’ a A3 . si 3 • 3 • 6 5 7.' i 70 <7, 26, 51, 52, 53> £L barkeri codon usage in 0RF398 and 0RF396 does not conform to this general rule In 0RF396, 63 7Ji of the arginine codons are those uihich contain a CG dinucleotide and 4745i of the proline codons are encoded by CCG The glutamine codon CAA and leucine codon CUA are not present in either of the H barkeri QRF’s, and the arginine codon AGA is employed very infrequently Codon usages determined in tuuo genes from tt. halobmm show that the arginine codon AGA, glutamine codon CAA and isoleucine codon AUA are newer used and the leucine codon CUA used very rarely <53) £L barkeri appears therefore to have some codon usage similarities to tt. halobium Computer analysis of carbamul phosphate synthetase genes Diagon analyses with carbamyl phosphate synthetase genes of E. coli, S, cereuisiae and El barkeri are shown in Fig 8. A Sequence Analysis program written by J.W. Brown for the Apple computer was the source of the Diagon analysis used. Amino acid or DNA sequences are compared by their Juxtaposition in all possible overlapping configurations, acquired through movement of one sequence relative to the other at one amino acid residue or base pair intervals. At each overlapping alignment the number of adjacent identical matches are determined A "window size" is chosen to determine the number of adjacent identical matches required before their presence is plotted on the graph. For amino acid sequences the "window size" chosen was either two or three, dependent upon the signal to noise ratio The graphs represent the amino acid sequences aligned with N termini at the intersection of the X and V axes Each pair or triplet of amino acid residues is therefore compared to every possible pair or triplet in the second sequence and a match is scored by a dot on the graph Figure 8 Diagon analysis of carbamyl phosphate synthetas Graphs show Diagon analyses for £L barkeri carB (SARCARP), yeast CPA2 tt 5 E C o fti_ ■ R P T Ey; T M5i Y EhCriRp1. 7EV:T W5 i' VE ^C riR P ; T I 1 ' l i l J < « ! h * , l t * 1 f i * *♦ 5 S Pi R C Pi R P T E '■w' T HSi vEriCriRP. TEXT #7♦ VEriCrrFF l t » T s •* tcocutf mt 'jwn •i brnf tut lmith 11DM»l cm * m * j iP _ - ri R. P r E « 5 • ECOCnrPP * TEX T « 5 i E C u C n P F , T£,-.T F i g u r e 8 73 In this way every part of one sequence is compared to every part of the second sequence. Diagon analysis of the previously reported L coli carB and S cerevisiae CPP2 ammo acid sequences (80) indicated three lines of homology (Fig 8) The central line represents the homology of the amino acid sequences throughout the full lengths of the £, coli and S. cerevisiae carbamyl phosphate synthetase proteins. The tuio shorter lines above and beloui the central line represent the homologies betuieen the N- and C-terminal halves of the sequences These shorter lines indicate the tandem duplication which has occured in these carbamyl phosphate synthetase genes Diagon analysis of the M barkeri 0RF398 amino acid sequence uiith those of the L coli carB and JL cerevisiae CPA2 gene products indicated two lines of homology (Fig 8) Stronger signals towards the 3' termini of the E. coli and S . cerevisiae sequences indicate 0RF398 has greater similarity with the C-termini of these primary sequences than the N-termini 8 stronger signal also indicate that the similarity between barkeri and E . coli primary sequences is greater than between £L barkeri and £. cerevisiae. Alignment of the £L barkeri 0RF398, £. coli carB and S. cereuisiae CPA2 amino acid sequences was obtained using the DNAstar Align program (a homology detecting and maximizing computer program) in conjunction with the alignments previously reported for the E . coli and E . cerevisiae sequences (80) (Fig. 9) Gaps were introduced into the sequences to maximize similarities Many amino acid residues are conserved in all three species but conservation is greater between the eubacterial and archaebacterial sequences Blocks of conserved amino acids are between residues 46 and 64, 106 and 127, 173 and 190, and 238 and 255 (Fig. 9>. 74 Figure 9 Carbamyl phosphate synthetase amino acid comparison Asterisks indicate identical amino acid residues in the sequences of carbamyl phosphate synthetase From different organisms Numbers indicate the positions of the amino acid residues within the alignment Only the carboxyl terminal ends of the sequences are shouin 75 CARBAMYL PHOSPHATE SYNTHETASE AHINO ACID COMPARISON x YEAST IDRAKNRHKESSILDSIDVDQPKVSILTSVKEAKLEASKVNYFVLIRPSYVLSGAAMSWNNK aaaaa a a «« « a a * aa aaaaaa a aa a a K COLI i nRAKnRgREQHAVgRI.KLIcqPANATVTATgHAVEIIAKKIGYPLVYRPSYVLQGRAHKIVYTm a aa a a a aa aa a aaaaaaaaaaaaaaaaaa M. BARKERI mXJYPQPKGGYAT&QKKAIEVAKKIGEPVLYRPSYVLGGRAMKIYYDE a aaa aa aa a aaa aaaaaa a aa a a YEAST - IDRAENRHKESSILDSIDVDQPEWSELTSYEKAKUASKVNYPVLIRPS YVLSGAAME WNHK a a EELKAKLTLASDVSPDHPVYMSKEISQAQKIDVDA VA YHGNYLVHAISE8YEHAGYHSGDAS LVLPFQHLSDDYK IALK a a aa a aa a a a aaaa aa a aa a aaaaaaa aa aa ADLRRYPQTAVSVSNDAPVLLOHELDDAVEYDVDAILDOEMYLICOIKEBIEQAGVHSGDSACSLPAYTLSQKIQDYHR aa aa aa aa a a a aa a a aaaa a a aa aaaaaa aaaaaaaaaa a aa a a a HDLEKYtCZAVRYSHKHPILIDDFLEAASEIDYDAYCDQEDYIIGAIHEHIEEAGVBSGDGACVIPPQSLSPSYLDQVR a a aa aa aaa aaaaaaa a aa aa a aaaaaaa a aaa aa a KELKAKLT LASDYSPDHPYVKSKE1KGAQXIDYDAVAYNGNYLVHA1SEHVENAG VHSQDAS L V LPPQHLSDDYKI ALK * s 1/1 5 DIADKYAKAWKITGPTNMQIIKDGEHTLKVIECMIRASRSEPEVSKVLGYNEIEIAVKAFLGGDIVPKPVDLMLNKKYD QQYQKLAEELQVRG LHNVQF AVKNNEVYL IKYHFRAARTVPEV81CATGVPLAKVAARYMAGKBLAKQGVTKEYIPP a a aaaaaaaa aa a aaa aa aaaaaa a aaaa aa a a aa a a a DYTRXIALALRVKQLINIQMAEEQGKYY VLEANFRSSBTIPEVSKAYGIPLAKIAAKVIVGHSLKSLGYMDEPKPK DIADKVAKAHKITGPFNMQ1IKDGEBTLKVIECNIRASRSEPEYSKYLG VNFIElAVKAFLGGDIYPKPVDLHLMKKYD a a YVATKYPQFSETRLAGADPELGYEHASTGEV AfiFCRDLIKSYWTAIQ STMNTHVPLPPSGILEGGDTEEEYLGQVA a a a a aa aa aa aaaaa aa a a aaa a a aa___a ___ a YTSVEEV VLPEMKFPUV UPLLGFEMRSTGEVHGVGRTEAEAP AKAQLCSMSTMKKBGRALLS VRE GDCERWDLA A a aaa aaa a aa aa aaaaa aaaaaaa a a aa a a aa a aa *a a HVS1ESVLLPEDKLFGADPYLGFEICSTGEVMQI DYDfGKAYYEAELAADNVLPLTGKVELfi IRM AOKTELYDYA K a a a__a aaaa aa *a aaaaa a a a a a YYATEYPOrsrTRLAOADPrLGYlMASTaKVASFaRDLIESYVTAIQ STtMTHYPLPPSG ILEGGDTSEEYLGOVA 9 SIV AT IGYSIYTTNETTETYLOEHIEEKNAKYSLIKEPKNDKRKLKELEQEYDIKAVENLASKRAESTDDYDYIMKRNA ELLKQ GEELDATHG TAIYLGEAGI NFRLYNKYHEaSPHIODRIKHGEYTYII NTTSGKRAIE DSRYI BBSA aa aaaaaa a a a aaa a a a a aa aa a a aa a KLQAA GLEUKITEG TYMYLAQBGY EMDYYKKYHDGSFNYIIimRRDEYNLII NTFTSKQSRR DGSBI RRAA aaaaaaa a a a a a a a a SI VAT IGYRTVT TW II'H TY TJBI T KgKHAgVRri KTpywnyBKI.Rgr.rQgYTI IK AYTNLASKRAESTDDYDYIMRRNA S IDEAIPLENKPgTALLEAKCLXAEIAEK IKILEEHDVIYPPKVRSWDEFIGFKAY LQYEYHYDTTLNGGEATAHALNADATKKYISYQEiaAQIK. aa a aa aaa a VDEEVPYITTMQAAI AAAAA IETHKEGEEI-T IK S I tnrYmnPOnaCM K V aa aaaa a a IDEAIPLFNEPQTALLEAKCLKAKIAEK IKILESHDYIVPFKVRSWDKEIGEKAY Figure 9 76 The V- similarity between carbamyl phosphate synthetase of coil, IL cerevisiae and £L barkeri uias calculated for the DNA and amino acid sequences 6E9X for identical and conserved ammo acids) than between either of the bacterlal species and jL cerevisiae The eubactenal and archaebacterial ammo acid sequences are equally similar to those of the eukaryote, 3?.QX and 40 for DNA and 29.8 X and 30?% for identical amino acids which is increased to 45 5 X and 46 AX. when conservative substitutions are included Amino acids that are represented in a homologous sequence by residues of similar size and charge are considered to be replaced by conservative substitutions Computer analysis of argininosuccinate sunthetase genes Diagon analyses of the £L barkeri. M. vanmelii and human argininosuccinate synthetase genes are shown The DNAstar ALI 6N program was used to produce the best possible alignment of the amino acid sequences of the M barkeri. M vanmelii and human argininosuccinate synthetases (Fig 11) 6aps were introduced into the sequences to maximize similarities Three main regions of conserved residues are apparent in all three sequences (Fig 12) There are 10 amino Table 3 7. SIMILARITY BETWEEN CARBAMYL PHOSPHATE SYNTHETASE OF DIFFERENT SPECIES '/. SIMILARITY BETWEEN- DMA AMINO ACIDS AMINO ACIDS t CONSERUED SUBSTITUTIONS YEAST & £. COLI 37.8/i 29 S/i 45.5/i £ COLI & ft BARKERI 50.3Z 44.5Z 62.37, H BARKERI I YEAST 40.9/i 36 ?Ji 46.4X N 78 Figure 10 Diagon analysis of argininosuccinate synthetase Graphs shoui Diagon analyses for tL barkeri artaG (SARARGP), M vanmelii ar qG (UANARGP) and human (HUMftRGP) ' i i i i O C I'^ - -ixn-e TJGWMCI"'- fc | fc v L 1 fc * fc 6 fc L*: G * I [ » 5 i v r t N A R G F ' , T E X T ' j - {' '(■' knO'- O n .k ■ • ■ I WQH ■ fc A1 . * -IE fc 0 G fc r L ' O ' V W M « . ■ 4 < i ) i. - ?♦* - ’ « [ ' ftif=ftiRGF':i TEXT . l T W I l f 1 i 1 fc v iue 10 Figure h * * G G f ft t f fc G G * i : -lW. t(>4'r - !■ + - r ' 4 O > O ( l ■ I t ’ l ( T y .M llW t- « * 5 : SftfeftRGF . : SftfeftRGF 5 * lt ► . Slit n t ' i j l j i ' u rt T E> : T : E> T 79 8 0 Figure 11 Argininosuccinate synthetase amino acid comparison Asterisks indicate identical amino acid residues in the sequences of argininosuccinate synthetase from different organisms Numbers indicate the positions of the amino acid residues within the alignment. 8t ARGINIHOSUCCINATE SYNTHETASE AMIHO ACID COMPARISON s HUMAN MS&KGSYYLAYSGGLDTS CILVWLKEQ GYD V IAYLANIGQKKDFEEARKXALKLGAKKVF1 M. VAMM1KLII MQEKIAV LAYSGGLDTSCCLKL LEDKYHYK WSVAVDVGQPKD DLKKPKEK AKKLGV a * * «**«•»•»« a t «a a aa aaaaaaa a a a a H. BARKERI MAKK V ALAYSGGLDTEVCIPI LKKKYGYDEWTISVDVGQPE KKIKRADAK AEKISN a a a aaaaaaaaaa aa aaa aaa a aa a aa aaa HUMAN M8SKQSWLAYSGGLDTS C1LVWLKEQ GYD VIAYLANIGQKXDFKRAKXKALKLGAKXVFI 3s KDVSRKFYEKF IWFAIQSEALYEDRYLLGTSLARPCIARKQYEIAQREGAKYVSE1GATGKGNDQ YRFKLGCVSL aaa a aa aaaa aaa aaaa aa a a a aa aaa aaaaaaa aaa LKHYTIDAKEKFAYDYIFRAIKANALYEG YPLETALARPLIAIKIA1LAKXIQADAISBQCTGKGHDQFHFESVIRTK aaaaaaaaaaa aa a aaaa aaa a a aaaaaa a a a aa a aaaaaaaaaa aaa a a KHYTIDAKKKFYKDYVTPLIKANGSYKC YVMGTSIARPL1AKKYVEAARKXGAVALABQCTGKGNDQLRFKAVFRQT aaa a a aa a aaa aaa aa a aa a aaa aa aaaaaaa aaa KDYSKKFVKKF IWFAIQSSALYEDRYLLGTSLARPCIARKQVEIAQHKGAKYVSBGATGKGMDQVRFKLGCYSL *, n ? s AFQIKYI APWRMPEFYHRFKGRNDLMEYAKQBG I P I PVTPKHFWSMDKNU® 1SYEAGILKHPKHQAPFGL YTKTODPA aa a aaa a aaa aaaaaa a a aaaa a a aaaaaa a a aa AFEISIIAPYRDLHLTRTEElq YAKKKGIFIFYDLEKPFStDEHLWIRSIBGGlLKMPHIKTPKECFAWTYDPK aaa a aaaa aa aaaa aaa a aa a aaa a aaaaaa aa a a a a aa a D HDVIAPHRBWLTRKWEID YAEKBQIPVKYTKSKFWSYnEllIlfSRSIKGGRLEDP&FYPPKEIFEWTTSPE aaaa a aaa aaaa aa aaa aaa a a a aa a a a a AFQ IKYIAfYRHFKFYNRFKGRHDLMKYAKQBG IPIPYTPKNPWSMDENU1BISYEAGILKNPKNQAPPGLYTKTODPA s 3 s KAPNTPDILEIEFKRGVPVKVTNYKDGTTBQTSLELFHYLHKVAGKBUYGRIDIVEHRFIGMKRRGIYETPAGTILYH a aaaaa aaaa a a aa aaaa aa a a a aaa aa a a IAKDEEEYVKIKFKAGVPVAINGQKFDAIN VIKEAHKLAGRHGYGRYDIIEDRVLOLKRRENYKCPGAILLIT a a a aaaaa a a a a aa aaaaa a aaaaaaaaa aaaaa a a a KAPEQPRWDIGFEAGYPVSLDGKXM&GYA FYnHHXIAGEHOYGRTDHIKDRYIjGLKAREKYEHPAATVLRQP aaa a a a aaaa a aa aa aaaa a a a a a a aaaaaa KAPMTPDlLKIEFKXGVPYKYTNYKDGTTBQTSLELfTfYLNEVAGKBGVGRlD IYEHRTIGMKSRGI YETPAGTILYH inm AHLDI EAFTMDRKVRKI RQGLGLKFAELVYTGLRFSPECKFYRHC IAKSQKRYEGKYQ YS Y LKQQ YYI LGRESTLBLYN aa a aa a a a a a aa a a aaa a a aa a aaaa aa AIKALEQL YLSRKEL YFKEHYD6KYAD LIYKGLWHEPLRHDLDAFYDCTQKRHNGTYRAKLYKGSLR1 VGRESKCAL Y Q a a a aa aa aa a a aa aaa aa aa aa aaa a a aaaa a a a a aaa H B SPLR N SSLTR SK LK m I VDDQWSKLAYYGLVDKPLYADLHAF IDKSQKR VTGIVKYRLYKGALTILARSSPKALYS a a a aa a aa a a aaaaaa a a a aa aa a aa aa AHLDIEAFTHDRKVRKI KQQUJLKr AEL YYTGLRFSPECSFYRHCIAKSQERYKGKYQYG VLKQQYYILGRESPLSLYN 3 KELV5MNVQGDYKPTDATGFIN1NSLELKEYHRLQSKVTAK KNKVSFEHKDMDQREIV GMVKFHG LOAA1FES [JINK EDLV5FDSQTIDQKGS1 GFAKYBGFQARMYRKVHDKQ a aaa a aa a a EELVSHNVQGDYKPTDATGFININSLRLKEYHRLQSKVTAK Figure 11 32 Figure 12 Amino acid residues conserved between fcL barkeri. M vanmelii and human argininosuccinate synthetase sequences Numbers indicate residue positions within the amino acid alignment. “All sequences" indicates amino acid residues conserved in all three sequences 83 b 10 15 2U 25 50 35 4U 45 bu huMAN & tl, VANN 1 LL11 M„K__V_LAYSGGLDTS_C L Y__V GQ_ED ___ B. VANN1 ELI 1 8 fcl. BARKERI M__K__V_UYSGGLDTS_C L__KY_Y__VV„_VDVGQPE ____ fcl. barkeri 4 Human H__K V_LAYSGGLDTS_CI LKE__GYD_V ______GQ_E__EE_ All Sequences : fc! K V LAYSGGLDTS C L Y V ^60_E____ 10b 111 lib 121 12b 131 13b 141 14& 151 Human 4 B. vannielii ARP_1A.K_.E_A___GA___SHG_TGKGNDQ.RFE A P . l . J A P . R M. VANN I ELI 1 a M. BARKERI ARPLIA_K__E_A___6A_A__HGCTGKGNDQ_KFE_V_R ______IAP_R B. BARKERI i hUMAN ARP _ I A_K _ Vt _ A _ .EGA HG.TGKGNDQ.RFE ______V1A°_R a l l S e q u e n c e s : ARP._IA_K__E_A___GA____HG_TGKGNDQ_RFt ______IAP _ R 17b lei 18b 191 19b 201 20c 211 huMAN Q M. VANN 1 ELI 1 YAK &IPIPV P.S.DEHL S_E_6ILENP____ P fcl. VANN 1 ELI 1 a B. BARKERI YAKE _GIP _ _V KP.S.DEN.W.RSIEGGILE.P ____ P H. BARKER i 4 HUMAN YAK_HGIP__YT.__PWS.DEH S .E .G .L E .P ____ p All Sequences ; YAK_.GiP..Y.___P_S_DEH____S_E_G_LL _P____P F i g u r e 1B 84 acid residues conserved at the N-termini of all three sequences and 18 of 14 residues, from position 124 to 137, are identical in the polypeptides of all three species. A third region of conservation, between positions 172 and 212, does not contain a long block of identical residues but has a high level of conservation in a short stretch of residues Although there is conservation of residues in all three amino acid sequences there is more similarity between the tuio methanogen sequences than between either methanogen sequence and the human sequence There are blocks of amino acid residues conserved in the two methanogen sequences between positions 39 and 45, 66 and 76, and 267 and 292, which are not conserved in the human sequence (Fig 11). There are similarities between the individual methanogen sequences from amino acid residues 267 through 292 and the human sequence but these are not the same sequences in all three species. The similarity between the nucleotide and amino acid sequences of argininosuccinate synthetase of human, £ 1. barkeri and H vannielii was calculated (Table 4) The two methanogen polypeptides have 49.9X identical amino acids and if conservative substitutions are included they are 66.7X similar. The methanogen genes are 52.1^ identical. Although the methanogen polypeptides are equally similar to the human sequence for identical amino acids (38.2X and 38 5Ji> and identical and conserved amino acids (53 7/i and 54 2Ji>, the barkeri nucleotide sequence is more similar to the human DNA sequence than is the DNA sequence of £1. vannielii This is due to the more frequent use of codons that have an A or U in the third base position in £L vannielii. Table 4 Z SIMILARITY BETWEEN ARGININOSUCCINATE SYNTHETASE OF DIFFERENT SPECIES V. SIMILARITY BETWEEH- DNA AMINO ACIDS AMINO ACIDS t CONSERUED SUBSTITUTIONS HUMAN & U. UftNHIELlI +2 BX 38.23! 53.73! tt. UANNIELII t tl BARKERI 5P \Y. 49 93! 66.7% £1. BARKERI 4, HUMAN 47 .27. 38.5 7. 5427. a t in 8 6 Construction of an evolutionary tree for ar^imnosuccinate synthetase genes. An alignment of the closely related DNA sequences of M_ vannielii 0RF39?, tL barkeri 0RF396 and the human argininosuccinate synthetase cDNA was made using the DNAstar ALIGN computer program. DNA sequences of tuio human argininosuccinate synthetase pseudogenes, HumASPsl and HumASPs3. were included in the computer generated alignment The alignment of five argininosuccinate synthetase sequences was used to construet the most parsimonious evolutionary tree using the IBM computer program PAUP, version 2 91 <44, 126) In the construction of a tree, the PAUP program determines the numbers of differences between the aligned DNA sequences and quantitates these values as a measure of evolutionary distance. The computer program then generates all possible unrooted trees that could represent the evolution of the sequences The alignment of five taxa DNA sequences) will generate 15 possible unrooted trees The total length of each possible tree is then determined and the most parsimonious tree, i.e. the shortest passible tree accounting for all the differences between the aligned sequences, is chosen as the most probable evolutionary tree. Should two trees be of the same length both are presented as equally probable To obtain a rooted tree, one of the aligned sequences must be defined as the ancestral sequence The evolutionary tree generated from the arginmosuccinate synthetase genes of the human cDNA HumASPsl , £L barkeri Figure 13 Evolutionary tree for argininosuccinate synthetase DNA sequences encoding argininosuccinate synthetase from human cDNA (hum), human pseudogene 1 (psl), human pseudogene 3 hum (1] psl (3) ps3 (2) r msb (4) -6 L mcv (5) F iq u re m a 89 between two sequences tL barkeri and tL vannielii DNA sequences were found to be more closely related to each other than to the human argininosuccinate synthetase sequences and represent an outgroup relative to the human sequences Divergence of the methanogemc bacteria from each other occurred before the divergence of either pseudogene f rom the expressed gene in humans. More dif f ere nee s have occured between the tL barkeri and tL vannielii sequences than between any of the human sequences. The tree indicates that the HumASPs3 pseudogene was produced by reverse transcription of human argininosuccinate synthetase mRNA before the production of the HumASPsl pseudogene. The HumASPsl sequence is most closely related to the cDNA sequence of the expressed gene. This agrees with a previously reported evolutionary tree constructed for argininosuccinate synthetase pseudogenes (94) The evolutionary tree produced by the PAUP computer program for the argininosuccinate synthetase genes adds further support for an ancient divergence of the archaebacteria from the eukaryotes It also indicates that the methanogen and human ar gininosuccinate synthetase genes are homologous, if homology is defined as having shared a common ancestor. Occurrence of the 14 bu and 29 bo direct repeats in M. barkeri MS and M. barkeri 227. The function(s) of the 6 x 14 bp direct repeats and 3 x 29 bp direct repeats in the intergenic region separating 0RF398 and 0RF396 in the tL barkeri derived DNA cloned in pET371, is unknown Hybridization of r adioac tively labeled probes to Hindi 11 digested chromosomal DNAs of tL barkeri MS and tL barkeri 2 2 7 was used in attempts to determine the number each repetitive 90 sequence per genome. Oligomer DNAs (kindly synthesized and provided by Lisa Alex, M I T ) unth either the sequence AAAATCCAACT66T (14-mer), present in five of the 6 x 14 bp repeats or the sequence AA 6TTCAASTATTCATCAATCTC (29-mer), the exact sequence of one of the 29 bp repeats and differing at only one or three positions from the other two repeats, were labeled at their 5' ends using « ^P-ATP. Selected fragments from Rsal digested pET371, radioactiuely labeled at the 5' ends *jp with a P-ATP, gave probes, one 290 bp long encoding the 6 x 14 bp direct repeats and a second 216 bp long encoding the 3 x 29 bp direct repeats The complete plasmid pET37t, nick translated using oc ^P-dATP as the label, was also used as a radioactive probe. Each probe was hybridized to Hindlll digested M. barkeri MS DNA and Hindi 11 digested tL barkeri 227 DNA immobilized either in dehydrated agarose gels or bound to a Zeta Probe membrane The radioactively labeled 14-mer and the 290 bp Rsal fragment of pET371 hybridized to the same, single DNA fragment in Hindlll digested tL barkeri MS DNA or in Hindlll digested M. barkeri 227 DNA The Hindlll fragments from the two strains of tL barkeri were not however the same size The 14-mer and 290 bp Rsal probes hybridized to a Hindlll fragment in tL barkeri MS DNA that was the same size as the Hindlll fragment of tL barkeri MS DNA cloned in pET371. The DNA fragment in HindITI digested tL barkeri 227 genomic DNA, which hybridized to these probes, was larger The block of 14 bp direct repeats must therefore occur once in the genomes of both tL barkeri MS and M. barkeri 227 but due to changes in the location of sites for the restriction enzyme Hindlll in the genomes of the two strains of tL barkeri the 14 bp repeats are within a larger 91 Figure 14 Autoradiogram of DNA DNA hybridizations between M barkeri DNA and radioactioely labeled pET37l probes. A tt barkeri MS DNA (track 2>, tL barken 227 DMA (track 3> and pET371 DNA (track 4), digested with H indl l l . hybridize to the radioactiwely labeled 29 base oligomer probe. pBR322 DNA (track 1) does not hybridize with the 29 base oligomer probe The 14 base oligomer, 290 bp Rsal DNA fragment and 210 bp Rsal DNA fragment probes also hybridize with Hindlll digestions of these DNAs to gioe the same pattern (not shown) B Hindlll digestions of pBR322 DNA (track 1), tL barkeri MS DNA (track 2), tL barken 227 DNA (track 3> and pET371 DNA (track 4) hybridize to nick translated pET3?l DNA A. B. Hind HI Hind JR 1 2 3 4 pBR322 UJ F i g u r e 14 ru 93 Hindlll fragment in the genome of CL barkeri 227 These data do not address the question of the number of 14 bp repeats in the genomic DNAs of £1. barkeri MS and CL barkeri 227 The r adioactively labeled 29-mer DNA and the 210 bp Rsal fragment of pET371 hybridized to the same, single DNA fragment in HindUl digested CL barkeri MS DNA or in Hindlll digested M. barkeri 227 DNA DHA fragment probes The 29 bp repeats must therefore occur only once in the genomes of CL barkeri MS and CL barkeri 227 and are closely linked to the 14 bp repeats in both genomes Nick translated pET371 DNA, also hybridized to the Hindlll DNA fragments of both CL barkeri MS and CL barkeri 227 which hybridized to the oligonucleotide probes. The two DNA fragments produced by Hindlll digestion of pET371 DNA (homologous to the probe), and the single fragment produced by Hindlll digestion of the vector pBR322 DNA, hybridized as expected to the nick t ranslated pET37l DNA PBR322 DNA DISCUSSION. The purpose of this study uuas to determine and analyze the nucleotide sequence of tuuo DNA fragments, one cloned from M barkeri and one from tL vannielii which, when expressed in E. coli or i . snhtilis. complement a deficiency in argininosuccinate synthetase. Determination of these sequences was expected to answer the following questions. 1) Which DNA sequences encode the polypeptide Expression of methanogen derived DMA in two eubacteriai sy stem s. The ability of the methanogen derived DNAs to complement auxotrophic mutations in two eubacteriai species indicated that the methanogen DNA sequences must contain three essential elements. First, a sequence that encodes either a suppressor tRNA, or a polypeptide capable of functioning in E. coli and ]L snhtilis to suppress a deficiency of argininosuccinate 94 95 synthetase This did not necessarily indicate that the methanogen gene encoded an argininosuccinate synthetase enzyme. The methanogen polypeptide might suppress the argininosuccinate synthetase deficiency in £» coli and JL subtilis by providing an alternative means of arginine independent grouith. The second essential element must be the presence of DNA sequences recognized in the two eubacteriai species as transcription initiation sites. The resulting mRNAs must carry the third essential regulatory element, a sequence capable of hybridizing with the 3' end of the 16S rRHA molecules in the eubacteriai ribosomes needed to direct translation initiation. The promoters used in L coli and I. subtilis are probably fortuitous sequences since H. vannielii and tL barkeri appear to have completely different regulatory sequences for transcription initiation from eubacteria and each other. Previously reported H vannielii genes his A (26, 145>, hisl (7), mcrBPCGA (25), 5S rRNA <139), and various tRNA genes (139, 141) have been c o m p a r e d and conserved upstream sequences proposed as promoters <53, 138, 141). Footprinting experiments have begun to elucidate the binding sites for tL vannielii DNA-dependent RNA polymerase (18). In the case of tL barkeri. the only genes so far sequenced are the argS reported here and the mcr operon <15, 49, 50). Sequences necessary for transcription initiation in the Methanomicrohiales order of the methanogens remain uninvestigated. The gene products of £1. barkeri 0RF396 and EL vannielii 0RF397 uiere determined to be responsible for complementation of the argininosuccinate synthetase deficiency in both L coli and i, snhtilis The amino acid sequences of these gene products are closely related, they differ in length by only one residue, contain 49. 9¥. identical amino acids, and are 66,7Ji 9 6 identical if conservative substitutions are included. Their obvious relationship to the human argininosuccinate synthetase, determined from a cDNA clone <13), leads to the conclusion that these ORFs represent the argininosuccinate synthetase genes of M barkeri and tt vannielii. and all three genes are derived from a common ancestral gene. The methanogen polypeptides must have remained sufficiently similar to the eubacteriai polypeptides to be functionally active in L coli and i, subtilis. Comparison to the complete £. coli ar gininosuccinate synthetase, uihich as yet has been only partially sequenced <28), mould be valuable in determining the extent of conservation between the archaebacteria and eubacteria as opposed to that between the archaebacteria and eukaryotes. Sequences upstream of 0RF396 and 0RF397 resemble the -35 consensus sequences for £. coli promoters but there is little resemblance in either intergenic region to the -10 consensus sequence 0RF397 sequence, TTGflTA matches the £. coli -35 consensus sequence (TTGACA) at five of six positions, uihereas GTCACA 41 bp upstream of d. barkeri 0RF396 matches the EL coli sequence at four of six positions. Sequences capable of hybridizing mRNA to the 16S rRNA of EL coli and JL subtilis ribosomes and to the appropriate methanogen rRNAs are present upsteam of the translation initiation codons of both ORFs. Their presence presumably accounts for the translation of the methanogen mRNAs in the eubacterial cells L*Jhat remains unexplained however is the synthesis of both a 51,000 and 49,000 dalton polypeptide in minicells containing pET371 or pET821, apparently encoded by one and the same gene. The mRNAs synthesized in eubacteriai cells may have initiation sites different from the mRNAs synthesized in methanogens In 9 ? the eubacteriai cells there may be tuio sites of initiation of transcription and translation. Both 0RF396 and 0RF397 have in-frame TAA termination codons 15 bp upstream from their initiation codons which would seem to preclude a premature start to the translation of the mRNA at an alternative AUG (methionine) or 6UG (valine). Downstream of the 0RF396 initiation codon, within a sequence of amino acid residues conserved in both methanogens and in the human sequence, there is however a sequence TT6 ACA which is a perfect match to the —35 £, coli consensus promoter sequence The tL vannielii DRF397 sequence at the equivalent position is TT 6 ATA and it is followed 3£ bp downstream by a sequence TATAAT that matches the E» coli -10 consensus sequence at all positions. These or other such sequences internal to 0RF396 and 0RF397 could be recognized by the E. coli RNA polymerase and cause the transcription and translation of a shorter mRNA resulting in the smaller polypeptide however there are no AUG (methionine) or 6UG (valine) initiation codons within the immediate downstream region of these possible E coli promoter sequences. Sequencing deletion derivatives of plasmid pET371 <90, 91) to determine the extent of the deletions might aid in elucidating why two polypeptides are encoded by one ORF. An alternative method, to confirm or negate the misstart proposal, would be to isolate the two polypeptides and determine their N-terminal polypeptide sequences The appearance of two polypeptides could be due to the experimental conditions of the SDS-PA 6E gel If the disulfide bonds are not fully reduced by p — mercaptoethanal in the loading buffer, polypeptides may not completely unfold. The purpose of the SDS in the gels is to bind to polypeptides and minimize their native charge differences, all polypeptides 9 8 migrate as anions as a result of the complex unth SDS Nhen polypeptides are completely denatured, the SDS binding holds them in a rigid structure and their separation through the SDS-polyacry 1 amide gel is dependent upon size. Incomplete denaturation of polypeptides restricts binding of SDS ions and the charge of the polypeptides becomes a factor in their migration. Polypeptides that are complexed uhth fewer SDS ions are less positively charged and will consequently migrate at a slower rate than polypeptides of equal molecular weight that are fully denatured Two bands could therefore result if they bind different amounts of SDS even though the bands represent the same polypeptide. Sequences needed for expression of genes in methanogens. Expression of 0RF396 and 0RF397 in cells of the methanogens U. barkeri and EL vannielii respectively, presumably does not result in two polypeptides. Correct transcription and translation start signals should result in the synthesis of only one transcript and one polypeptide. The tL vannielii 0RF397 upstream DNA sequences are similar to conserved sequences proposed as promoters far other methanococcal genes (53). Footprinting and SI mapping experiments will be necessary to determine the sequences which bind tL vannielii DNA-dependent RNA polymerase and the sites of transcription initiation preceding 0RF396 and 0RF397 Sequences found upstream of tL harkeri 0RF396 bear more resemblance to the promoters proposed for tL halobium and tL smithii (53) than to the putative promoters for methanococcal genes. This is in line with established evolutionary relationships within the archaebacteria. The genus hhthanosarcina is within the order Methanomicrohiales which is more closely related to the 99 halophiles such as tL hatohium than are the otner two orders of methanogens, the Methanobac.teriales and the Methanococcales (21. Within the methanogens however the Methanomicrobiales and Methanohacteriales. which includes tL smithii. are thought to be the more closely related <21 Until more DNA sequences From Methannsarcina and other members of the Methanomicrobiales. such as Methanospirilium hunqatei, are available, proposals for promoters in this order would be premature M. barkeri direct repeats. The function of the three 29 bp direct repeats upstream of M harkeri 0RF396 is unknown. They are not artifacts of cloning as they exist in two strains of tL barkeri. MS and 227. A general promoter function seems unlikely as they occur only once in the genomes of tL barkeri If the consensus sequence AAGTTYAAGTAYTCATCAATCTYAA6TTY were similar to a tL barkeri promoter consensus sequence, multiple fragments would have hybridized to the probes. The 29 bp direct repeats might be involved in the regulation of expression of argininosuccinate synthetase in tL barkeri Neither tL barkeri 0RF396 nor tL vannielii 0RF397 were preceded by obvious attenuator structures and there was no resemblance in the upstream DNA sequences to the L coli "arg box". The 29 bp direct repeats might therefore function as a binding site for a regulatory molecule to regulate the expression of the argininosuccinate synthetase gene. If, however, the synthesis of all enzymes involved in the biosynthesis of arginine were coordinately controlled in H barkeri. then similar sequences would be expected upstream of other genes involved in arginine biosynthesis. Hybridization of the radioactive probes to a single lee Hindlll fragment of £1. barkeri DNA also therefore argues against this possibility unless all the arq genes are linked in this organism. The £9 bp direct repeats shoui sequence similarity to seven 22 bp direct repeats located at the origin of replication for plasmid R 6K <122> The sequence AAC6 TACTAA6CTCTCAT6 TTT matches the 29 bp direct repeats at 17 of 22 positions if two gaps are included to maximize identity. One could speculate therefore that the 29 bp repeats might represent an origin of replication in tL barkeri Advances in the molecular biology of tL barkeri will have to be made before such a possibility could be tested by cloning the direct repeats, with a selectable trait incapable of autonomous replication, and transformation of the resulting plasmid into M. barkeri. The function of the 14 bp direct repeats also remains unknown Their location directly downstream of 0RF398 suggests a rale in termination of transcription in TL barkeri however, there are no similar DNA sequences downstream of 0RF396. If these repeats had a general role in transcription termination, throughout the genome, more than one Hindlll digested DNA fragment would have hybridized with the radioactive probes containing the 14 bp direct repeats. A role in the coordination of regulation of expression of arginine biosynthetic genes is possible, but again more than one Hindlll digested DNA fragment would have been expected to have hybridized to the radioactive probes if the repeat sequences played a coordinating role in the control of expression of arq 6 genes The lack of hybridization of multiple bands to the radioactive probes representing the 14 bp and 29 bp direct repeats may also be due to experimental conditions. Conditions 101 were chosen to give hybridization of the probes to complimentary sequences without mismatches. It is possible that the repeats resemble control elements scattered thoughout the genome but, due to mismatches in their sequences, these have not hybridized to the probes under the conditions used. Codon usage and gene organization in M barkeri and M vannielii Codon usage in the two tL uannielii ORFs of pET821 is very similar to that previously reported for other methanococcal genes (53); namely preferential usage of codons which have an A or li in the third position and infrequent use of codons containing a CG dinucleotide Although the tL barkeri ORFs of pET371 haue some similarities in codon usage with the methanococcal genes there is also similarity to codon usage in H. halohium < 11, 36. 53). Neither tL barkeri nor H. halobium ORFs contain the glutamine codon CAA or leucine codon CUA, and both use the arginine codon AGA very infrequently This is not true for tL uannielii. The codon usage of the tL barkeri mtc. operon is similar to that reported for tL uannielii and tL thermoautotrophicum am operons <14) with a preference for a C in the wobble position. The arginine AGA codon is used more frequently in the £L barkeri mcr genes than in the araG and carB genes reported here. Use of the glutamine CAA codon is infrequent in the tL barkeri men operon and of a total 108 leucine codons only 2 are CUA codons Therefore some of the observations made for the codon usage of tL barken genes sequenced in this study are also true for the genes of the M. barkeri mcr operon. The similarity of codon usage may be indicative of the closer relationship between the extreme halaphiles and the Methanomicrobiales than between the 102 halophiles and the Mcthanobacteriales on the Mehhanococcales More ORFs from HalQbactermm s p , Methanosarcina s r . and Me thanospir ilium sp. need to be analyzed to determine if there are consistent observations for codon usage The ORFs encoded by tL vannielii DNA in pET821 and M barkeri DNA in pET371 do not appear to be organized in operons. Distances of 390 bp and 130 bp separate ORFs in the tL tiarkeri and tL vannielii DNAs respectively. These intergenic regions contain sequences that are similar to proposed promoter sequences Previously sequenced genes from tL vannielii <7, 85, 26), tL voltae <26, 72, 145), tL thermolithotroohicus <136) and tL smithii <51, 52) appear to be organized unthm operons. This feature therefore appears to be a feature common to methanogen and eubacteriai genomes. The L col* argG gene is, in fact, transcribed separately from arq genes which encode other enzymes involved in arginine biosynthesis in L coli which are arranged in operons. Independent control of the expression of argininosuccinate synthetase may therefore be a requirement in both eubacteria and archaebacteria for, as yet, unknown reasons. In £, coli the ar gfi gene is physically separated from the carAB operon whereas these genes are adjacent in the M barkeri genome. This clustering of genes involved in the arq biosynthetic pathway may be significant in the methanogen, although the clustering is apparently not as an operon. Intergenic regions in both tL barkeri and tL vannielii have higher A+T contents than sequences encoding polypeptides This has been found in the intergenic regions of all methanogens studied. There is not however such a dramatic increase in the A + T content in the intergenic regions in organisms that have a high overall molX 6+C such as tL thermoautotrophicum <49 7 X 6*0 <63). The increased frequency of A+T bases in the 1 0 3 intergenic regions of methanogen genes probably mas fortuitous for the expression of these genes in L coli Sequences necessary for transcription initiation in E. coli are A^T rich and therefore occur more frequently by chance in the intergenic regions between methanogen genes. Evolutionary relationship of argininosuccinate synthetase genes. Comparison of homologous sequences can be used to determine the evolutionary relationships Until more 16S rRNA sequences from thf? archaebacteria are completely determined, the best available measure of the evolutionary relattonships are comparisons of oligonucleotides produced by RNase T1 digestion of the 16S rRNAs from the various organisms <2>. Based on oligonucleotide catalogues, £1. vannielii and EL barkeri were determined to have an association coefficient of 0.20 and were assigned to different orders, the Methanococcales and Methanomicrobiales respectively The DMA sequences encoding argininosuccinate synthetase from these two methanogens are 52.IX identical. Previous comparisons of homologous genes have indicated a correlation between the X similarity of DNA sequences and the S^g values determined from 16S rRNA oligonucleotide comparisons <2) The hisA genes from tL vannielii and tL voltae. two organisms within the same genus with a value of 0 60, have DNA sequences which are 66 0X identical (26> Two rucE genes from tL smithii and tL thermoautotrophicum. species in different genera within the same family, with a S^jj value of 0.49, have DNA sequences that are 53.0X identical <51). The results show there is also a correlation between the S^g values and value for X similarity of DNA sequences for the aroS genes of tL vannielii and tL barkeri. 104 Construction of an evolutionary tree using the DNA sequences of tL, uannielii. tl* barkeri and human cDNA encoding argininosuccinate synthetases provides the following observations The two methanogen argS sequences are more closely related to each other than either is to the human cDNA sequence or the human pseudogenes. Divergence of the two methanogen argfi genes however occurred prior to the divergence of the human sequences, and all the present day sequences are derived from a common ancestral gene A better indication of the evolutionary relationships between organisms is obtained when amino acid sequences are used to construct trees. Amino acid comparisons are however only more revealing if functional polypeptides are being studied. The inclusion of human argininosuccinate synthetase pseudogenes in this comparison demanded that DNA sequences be compar ?d. Pseudogenes are not expressed and are therefore not subject to the same selective pressures An evolutionary tree, constructed previously (94), using the DNA sequences of seven pseudogenes and the functional human argininosuccinate synthetase gene, is in good agreement with the tree presented here (Fig. 131. HumASPs3 was calculated to have diverged from the expressed gene approximately 55 million years ago and HumASPsl to have diverged approximately 48 million years ago (94). These calculations cannot reasonably be extrapolated however to determine the probable time of divergence of the two methanogens because of the uncertainty of different mutational rates for DNA sequences with and without the selective pressure of coding for a functional gene product. A more complete evolutionary tree for argininosuccinate synthetase would include the two methanogen AC 90 sequences described in this study, tL voltae argG (145), tL 1 0 5 thermalithotr ophicus acgG Introns within the human argininosuccinate synthetase gene. It has been determined that the human argininosuccinate synthetase gene contains at least 13 exons <39, 40). Ten intron-exon boundaries of seven introns have been sequenced <39). Two of the introns, intron 1 and intron 2, are 67 bp and 5 bp respectively, upstream of the ftTG initiation codon of the argininosuccinate synthetase gene. Of the five introns determined within the gene, four lie between two codons and are classified as phase 0 introns Cintrons 3, 4, 7 and 8) <117) Intron 9 lies between the first and second bases of a codon and so is classified as a phase 1 intron. All the intron-exon boundary sequences resemble the consensus splice sites for nuclear mRNA precursors 5,-A/'CA6GtGTft>'6AGT. . intron . The methanogen DNA sequences that align with the intron-exon boundary sequences of the human argininosuccinate synthetase gene show similarity to the consensus splice site sequence A similarity is seen when the methanogen sequences are compared to the sequences of either the 5' or the 3' splice site sequences. This may however be a result of conservation of amino acid sequences essential to the function or folding of the polypeptide. Introns 3, 4, and 8 are located in regions of 1 0 6 the polypeptide inhere the amino acid residues of the three argininosuccinate synthetase sequences are not conserved. Intron 7 however, lies within a highly conserved region of amino acid residues and intron 9 is located in the first codon of the amino acid sequence 6 UPU conserved in the argininosuccinate synthetase of all three species Carbamvl phosphate synthetase in M. barkeri. M barkeri 0RT398 was determined to be the carboxyl terminal region of a polypeptide that is homologous to the large subunits of £. coli, £L cerevisiae and rat carbamyl phosphate synthetase. In E» coli and S. ceremsiae, the genes encoding the carbamyl phosphate synthetase large subunit (carB and CPAS respectively) are located downstream of the genes encoding the small carbamyl phosphate synthetase subunit (carA and CPA1) <80. 95). These genes are cotranscribed in £. coli as an operon In the rat the two genes have become fused resulting in one gene (CPS1) encoding both subunits (98) Internal homology within the carbamyl phosphate synthetase large subunit indicates that a tandem duplication of an ancestral gene occurred prior to the divergence of the eubacteria and eukaryotes <80, 95). The length of the carB ORF in tL barkeri was insufficient to determine if there is an internal duplication within the methanogen gene If the carbamyl phosphate synthetase gene in tL barkeri does not have interna) homology it would suggest that the gene duplication event occurred after the divergence of the archaebacteria but before the divergence of the eubacteria and eukaryotes. Comparison of the E_ coli. cerevisiae and rat carbamyl phosphate synthetase amino acid sequences indicates that the amino-termini of the polypeptides are more conserved than the 107 carboxyl-termini <80, 95) When the tl. barkeri carB carboxyl-terminus is compared to the amino- and carboxyl-termini of £. coli carB and S» cerevisiae CPAS there is more similarity between the carboxyl-termini of the polypeptides The lack of conservation between the carboxyl-terminus of 0RF398 and the more conserved ammo-termini suggests that there may be a more conserved amino—terminal region of the tL barkeri carB gene which would be more similar to the amino-termini of the £, coli carB and S. cerevisiae CPA2 genes If this were true, then the duplication of the ancestral gene occurred before the divergence of the archaebacteria from the eubacteria and eukaryotes Use of the pET371 DNA as a radioactive probe should identify the adjacent DNA sequences in a DNA library and the rest of the tL barkeri carB gene could be cloned and sequenced to determine the presence or absence of an internal duplication Comparison of the carB genes of £_ coli and tL barkeri indicated the DNA sequences available were 5B.3X identical whereas the 16S rRNA oligonucleotide catalogue comparisons determined a value of 0.10 for £, coli and tL barkeri These calculations show there is a trend between the values and V. similarity as was observed with the results for arqfi. hisA and purE genes as stated above. 1. n. vannielii 0RF397 and tL barken 0RF396 encode argininosuccinate synthetase polypeptides capable of complementing the £* coli ac.gG36 and JL subtilis angA 2 auxotrophic mutations 2. Argininosuccinate synthetase genes in tL barkeri. M uannielii and human have diverged from a common ancestral gene and show extensive conservation of amino acid sequences. The methanogen sequences are more closely related to each other (DNA: 52 IY. identical; amino acids: 66 .7/i identical) than either is to the human 54.2Ji identical amino acids). 3. The comparison of argininosuccinate synthetase and carbamyl phosphate synthetase genes supports the established phylogenetic relationship for tL barkeri and tL vannielii in separate methanogenic orders. Similarity between upstream sequences of tL barkeri and tL halobium genes and codon usage within the genes supports the established close phylogenetic relationship of the Mathanomicmbiales and the extreme halophiles. 4 The region upstream of 0RF397 in tL vannielii contains sequences similar to those suggested as methanococcal promoters, whereas sequences upstream of tL barkeri 0RF396 188 109 show more similarity to sequences suggested to be promoters in tL haUttnum and £L smithu 5 t ± barken 0RF398 encodes the carboxyl terminus of the large subunit of carbamyl phosphate synthetase, additional sequencing is needed to determine if this gene has an internal duplication. 6 The 29 bp and 14 bp direct repeats encoded by the hi harken derived DNft in pET371 are not artifacts of cloning The blocks of repeats occur once in the genomes of both H barkeri MS and tJL harkeri 227. The number of the repeats and their function(s) remain to be determined. bibliography 1. Ankel-Fuchs, D , R Hi* s ter, E. Hdrschel, S.P.J. Albracht and RK Thauer 1986. Structure and function of methyl coenzyme M reductase and of F43B in methanogemc bacteria. Syst. Appl. Microbiol ?■ 383-387 2 Belch, WE , G.E. Fox, L J Magrum, CR Woese and R.S Wolfe 1979 Methancgens- Reevaluation of a unique biological group Microbiol Rev 43> 260-296. 3. Balch, W.E. and R.S. Wolfe. 1979 Specificity and biological distribution of coenzyme M (2-mercaptoethanesulfonic acid) J. Bacteriol. 137: 256-263. 4. Barest, L. and 6. Bertani. 1984 Isolation of a bacteriophage for a methanogenic bacterium. pl33 in Abstracts of 84th Annual Meeting of the American Society for Microbiology. 5. Beaudet, A.L., T-S. Su and W E. O’Brien. 1982. Dispersion of argininosuccinate synthetase-like human genes to multiple autosomes and the X chromosome. Cell 30= 287-293. 6. Beckler, 6.S., L.A. Hook and J.N. Reeve. 1984. Chloramphenicol acetyltransferase should not provide methanogans uiith resistance to chloramphenicol. Appl. Env. Microbiol. 47= 868-869. 7 Beckler, 6 S. and J.N. Reeve. 1986. Conservation of primary structure in the hisl gene of the archaebacterium, Methanococcus vannielii. the eubacterium Escherichia coli, and the eucaryote Saccharnmvcts cerevisiae. Mol. Gen. Genet. 204: 133 140. 8. Bencini, D.A., J.E. Houghton, T A. Hoover, K.F Foltermann, J.R, Wild and G.A O’Donovan 1983. The DNA sequence of arol from Escherichia coli K12. Nucleic Acids Res 11: 8509-B518. 9. Berman, M.L., L.W. Enquist and T J Silhavy. 1982. In 'Advanced Bacterial Genetics 1982”. Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724. 10. Best, A.N. 1978. Composition and characterization of tRNA from Methanococcus vannielii J. Bacteriol. 133: 240-258 110 Ill 11. Betlach, M., J. Friedman, H.W. Boyar and F Pfeifer. 1984. Characterization of a halobacterial gene affecting bacterio-opsin gene expression. Nucleic Acids Res IS 7949-7959. 12 Blaut, M. and 6. Gottschalk. 1985. Evidence for a chemiosmotic mechanism of ATP synthesis in methanogenic bacteria. Trends Biochem. Sci. 18: 486-489. 13 Bock, H-G.O., T-S. Su, W E O'Brien and AL Beaudet 1983 Sequence for argininosuccinate synthetase cDNA Nucleic Acids Res 11: 6585-6512 14. Bokranz, M., 8. taumner, R. Allmansberger, D. Ankel-Fuchs and A. Klein. 1987. Cloning and characterization of the methyl coenzyme M reductase genes from Methanobacterium thermoautotrophicum. In Press 15. Bokranz, M. and A. Klein 1987 Nucleotide sequence of the methyl coenzyme M reductase gene cluster from Methanosarcina barken. Nucleic Acids Res. 15: 4358-4351. 16. Bollschuieiler, C. and A. Klein. 1982. Polypeptide synthesis in Escherichia coli directed by cloned Methanobreuibacter arboriohilus DMA Zbl. Bakt. Hyg., I. Abt. Orig C3: 181-189. 17. Bouvier, J., J - C. Patte and P Stragier 1984 Multiple regulatory signals in the control region of the Escherichia coli carAB operon Proc. Natl. Acad. Sci. USA. 81: 4139—4143. 18 Brouin, J W, M. Thomm, G.S. Beckler, 6 . Frey, K.O. Stetter and J.N. Reeve. 1987 Archaebacterial RNA polymerase binding sites and transcription of the his A gene of Methanococcus vannielii. In Press. 19. Cech, T.R. 1983. RNA splicing: three themes with variations. Cell 34. 713-716. 28. Cech, T.R. 1986. The generality of self-splicing RNA. relationship to nuclear mRNA splicing. Cell 44- 207-210. 21. Cheeseman, P., A Toms-Wood and R.S. Wolfe. 1972. Isolation and properties of a fluorescent compound, factor F420 from Methanobacterium strain M.o.H J. Bacteriol. 112. 527-531. 22. Chou, P.Y and 6 .D Fasman 1974 Conformational parameters for amino acids in helical, p-sheet, and random coil regions calculated from proteins Biochemistry 13: 211-222. 23. Chou, P.Y. and 6.D. Fasman. 1974, Prediction of protein conformation Biochemistry 13= 222-245. 112 24. Chou, P.Y. and 6.D, Fasman. 1978. Empirical predictions of protein conformation. Ann Rev. Biochem. 47= 251-276 25 Cram, D.S., BA Sherf, R.T. Libby, R.J. Mattaliano, K.L. Ramachandran and J.N. Reeve. 1987. Structure and expression of the genes, mcrBDCBA. uihich encode the subunits of component C of methyl coenzyme M reductase in Methanococcus vannielii Proc. Natl. Acad. Sci. USA 84= 3992-3996 26. Cue, D., G.S. Beckler, J.N. Reeve and J. Konisky. 1985. Structure and divergence of two archaebacterial genes. Proc. Natl. Acad. Sci USA. 82= 4287-4211. 27. Cunin, R., T, Eckhardt, J. Piette, A. Boyen, A. Pidrard and N. Glansdorff. 1983. Molecular basis for modulated regulation of gene expression in the regulon of Escherichia coli K12. Nucleic Acids Res 11= 5087-5019. 28. Cunin, R., N. Glansdorff, A. Pidrard and U. Stalon. 1986. Biosynthesis and metabolism of arginine in bacteria. Microbiol. Rev. 58= 314-352 29 Daniels, C.J., R. Gupta and W.F. Doolittle. 1985. Transcription and excision of a large intron in the tRNA^r^ gene of an archaebacterium Halobacterium volcanii J. Biol. Chem. 268= 3132-3134 30. Daniels, C.J., J.D. Hofman, J.G. MacMilliam, W.F. Doolittle, C.R. Moese, K.R Luehrsen and G.E. Fox. 1985. Sequence of 5S ribosomal RNA gene regions and their products in the archaebacterium Halobacterium volcanii. Mol. Gen. Genet. 198= 278-274 31. Davis, R.H. 1986. Compartmental and regulatory mechanisms in the arginine pathways of Neurosoora crassa and SBCCJiaroinyCBS cereuislae. Microbiol. Rev. 50= 288-313. 32. Davis, R.W , 0. Botstein and J.R. Roth. 1980. In 'Advanced Bacterial Genetics". Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 33 Dennis, P.P. 1986 Molecular biology of archaebacteria J Bacteriol. 168 471-478 34. Doolittle, R.F. 1986. Of URFs and 0RFs= a pimer on how to analyze derived amino acid sequences. University Science Books. Mill Ualley, CA 94941 35. Douglas, C., F Achatz and A. Back. 1980. Electrophoretic characterization of ribosomal proteins from methanogenic archaebacteria. Zbl. Bakt. Hyg., 1. Abt. Orig. Cl= 1-11. 113 36. Dunn, R., J. McCoy, M. Simsek, A. Majumdar, S.H. Chang, U L. RajBhandarg and H.G. Khorana. 1981. The bacteriorhodopsin gene. Proc Natl Acad. Sci USA. 80: 6744-6748 37. Eirich, L.D., G.D. Uogels and R.S. Wolfe. 1979. Distribution of coenzyme F420 and properties of its hydrolytic fragments. J. Bacteriol 140: 20-27 38. Feng, D F., M.S. Johnson and R F. Doolittle. 198S. Aligning amino acid sequences: comparison of commonly used methods. J. Mol Evol. 21: 112-125. 39 Freytag, SO, A.L Beaudet, H-6.0 Bock and W.E O’Brien 1984 Molecular structure of the human argininosuccinate synthetase gene: occurence of alternative mRNA splicing Mol Cell Biol. 4: 1978-1984 40. Freytag, S.O., H-6.0. Bock, A L. Beaudet and M.E. O’Brien. 1984 Molecular structures of human argininosuccinate synthetase pseudogenes; evolutionary and mechanistic implications J. Biol. Chem. 259: 3160-3166. 41. Fox, G.E., K.R Luehrsen and C R. Woese 1982 Archaebacterial 5S ribosomal RNA. Zbl. Bakt. Hyg., 1. Abt. Orig C3 330-345. 42. Fox, G.E., L.J. Magrum, W.E. Balch, R.S. Wolfe and C.R. Woese. 1977. Classification of methanagenic bacteria by 16S ribosomal RHA characterization. Proc. Natl. Acad. Sci USA. 74: 4537-4541. 43. fiarnier, J., D.J Osguthorpe and B. Robson 1978. Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins. J. Mol. Biol. 120: 97-120. 44. Goldman, 0., P R. Giri and S.J. O’Brien. 1987. A molecular phylogeny of the hominoid primates as indicated by two-dimensional protein electrophoresis. Proc. Natl. Acad. Sci. USA. 84: 3307-3311 45. 6ropp, F , W D Reiter, A Sentenac, W. Zillig, R. Schnabel, M. Thomm and K.O Stetter 1986 Homology of components of DNA-dependent RNA polymerase of archaebacteria, eukaryotes and eubacteria. Syst Appl. Microbiol. ?■ 95-101. 46. Gunsalus, R.P and R S. Wolfe. 1978. Chromophoric factors F342 and F430 of Methanobacterium thermoautotroohicum FEMS Lett 3= 191-193. 114 47. Gupta, R. 1984. Halobacterium uolcanii tRNAs: identification of 4l tRNAs covering all amino acids and the sequences of 33 class I tRNAs. J Biol Chem 859 9461-9471 48. Gupta, R. and C.R. Uoese. 1980. Unusual modification patterns in the transfer ribonucleic acids of archaebacteria. Curr Microbiol. 4: 845-249. 49. Hamilton, P.T. 1982. The cloning and expression of archaebacterial DNA from methanogens in E sc her ichia coli M.S. Thesis, The Ohio State University, Columbus, Ohio 5B. Hamilton, P.T and J.N Reeve. 1964. Cloning and expression of archaebacterial DNA from methanogens in Escherichia coli In, Microbial Chemoautotrophy (H R. Strohl, 0 H Tuouinen, eds ) P291-307 The Ohio State University Press, Columbus, Ohio 51. Hamilton, P.T and J.N. Reeve 1985 Sequence divergence of an archaebacterial gene cloned from a mesophilic and a thermophilic methanagen J. Mol. Evol. 22: 351-360. 52. Hamilton, P.T. and J.N. Reeve. 1985. Structure of genes and an insertion element in the methane producing archaebacterium Methanobrevibacter smithii. Mol Gen Genet. 200: 47-59. 53. Hamilton, P.T. and J.N. Reeve. 1986. Molecular biology of archaebacteria. In Microbiology 1986. American Society for Microbiology. p338-348 54. Hauiley, D.K. and UI.R. McClure. 1983. Compilation and analysis of Escherichia coli promoter DNA sequences. Nucleic Acids Res. 11: 2237-2255 55. Hennecke, H., 1. Gunther and F. Binder. 1982. A novel cloning vector for the direct selection of recombinant DNA in £. coli. Gene 19: 231-234 56. Hofman, J.D., R.H. Lau and 14.F. Doolittle. 1979. The number, physical organization and transcription of ribosomal RNA cistrons in an archaebacterium Halobacterium halnhinm Nucleic Acids Res 7= 1321-1333 57 Holzer, 6 , J Oro and T. 6 . Tornabene. 1979 Gas chromatographic-mass spectrometric analysis of neutral lipids from methanogenic and thermoacidophilic bacteria J. Chromatog. 186: 795-809 58. Hopp, T.P. and K.R. Uloods. 19B1. Prediction of protein antigenic determinants from amino acid sequences. Proc. Natl. Acad. Sci USA 78: 3824-3828. US 59. Hui, I. And P.P. Dennis. 1985. Characterization of the RHA gene clusters m Halobacterium nitiriibrum J Biol Chem 260: 899-986 60. Janekovic, D., S. Munderl, I. Holz, W. Zillig, ft Gierl and H. Neumann. 1983. TTU1, TTU2 and TTU3, a family of viruses of the extremely thermophilic, anaerobic, sulfur reducing archaebacterium Thermoproteus tenax Mol. Ben Genet 192: 39-45 61. Jarrell, K F , G.D. Sprott and ft.T. Matheson. 1984. Intracellular potassium concentration and relative acidity of the ribosomal proteins of methanogenic bacteria. Can J. Microbiol 30: 663-668 62 Jinno, Y , H. Nomiyama, S Wakasugi, K. Shimada and I Matsuda. 1984 Isolation and characterization of phage clones carrying the human argininosuccinate synthetase-like genes J Inher. Metab. Dis. ?■■ 133-134 63 Jones, 14. J., D P. Nagle Jr and W B. Whitman 1987 Methanogens and the diversity of archaebacteria. Microbiol. Rev. 51 135-177 64. Kabsch, W. and C. Sander. 1983. Houi good are predictions of protein secondary structure? FEBS Lett. 155= 179-182. 65. Kagramanova, U.K., ft S. Mankin, L A. Baratova and 0.8. Bogdanov. 1982. The 3' - terminal nucleotide sequence of the Halobacterium halohium 16S rRNfl FEBS Lett 144= 177-188 66. Kaine, B.P., R. Gupta and C.R Woese 1983. Putative introns in tRNH genes of prokaryotes. Proc. Natl. Read. Sci. USA. 88: 3309-3312. 67. Kandler, 0. and H. Konig. 1978. Chemical composition of the peptidoglycan-free cell uialls of methanogenic bacteria. Arch. Microbiol 118: 141152. 68 . Kessel, M and F. Klink. 1980. Archaebacterial elongation factor is ADP-ribosylated by diphtheria toxin. Nature 287 = 250-251 69. KJems, J and R A. Garrett 1985 An intron in the 23S ribosomal RNA gene of the archaebacterium Desulfurococcus mohilis Nature 318: 675-677. 70. Klein, A. and M Schnorr 1984. Genome complexity of methanogenic bacteria. J. Bacteriol. 158: 628-631. 116 71. Klink, F. 1985. Elongation factors. In, The Bacteria uol 8 Archaebacteria (C.R. Woese and R.S Molfe eds) p379-410 Academic Press, NY. 72. Konheiser, U., G. Pasti, C Bollschuieiler and A Klein. 1984 Physical mapping of genes coding for tuio subunits of methyl CoM reductase component C of Hethanococcus voltae Mol. Gen. Genet 198: 146-152 73. Kushner, D.J and H Onishi 1968. Absence of normal cell wall constituents from the outer layers of Halobacterium cutiruhrum Can. J. Biochem. 46= 997-998. 74. Kyte, J and R.F. Doolittle. 1982 A simple method for displaying the hydropathic character of a protein J Mol Biol. 157 105-132 75. Lake, J.A. 1987. A rate-independent technique for analysis of nucleic acid sequences: evolutionary parsimony In Press 76. Lederberg, E.M. and S.N. Cohen. 1974. Transformation of Salmonella tuphi.nurium by plasmid deoxyribonucleic acid. J. Bacteriol. 119: 1072-1074. 77. Leinfelder, W, M. Jarsch and A. Bock. 1985. The phylogenetic position of the sulfur-dependent archaebacterium Thermooroteus tenax: sequence of the 16S rRNA gene Syst. Appl Microbial 6: 164-170. 78. Luehrsen, KR, D.E. Nicholson, D.C. Eubanks and 6.E. Fox. 1981. An archaebacterial 5S rRNA contains a long insertion sequence. Nature 293= 755-756 79. Luehrsen, K.R., D.E. Nicholson and G.E. Fox. 1985. Midespread distribution of a 7S RNA in archaebacteria. Curr. Microbiol. 12: 69-72. 80. Lusty, C.J., E.E. Nidgren, K.E. Broglie and H. Nyunoya. 1983. Yeast carbamyl phosphate synthetase; structure of the yeast gene and homology to Escherichia coli carbamyl phosphate synthetase. J. Biol. Chem 250: 14466-14472. 81. Makula, R A and M.E. Singer. 1978 Ether-containing lipids of methanogenic bacteria. Biochem Biophys, Res Comm. 82 716-722. 82. Maniatis, T., E.F. Fritsch and J, Sambrook. In "Molecular cloning: A Laboratory Manual* 1982. Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724. 117 83 Mankin, 8.S.* U.K. Kagramanova, N.L. Teterina, P.M. Rubtsov, E M Belova, A M Kopylov, L A Baratova and A A. Bogdanov. 1985. The nucleotide sequence of the gene coding for the 16S rRNA from the archaebacterium Halobacterium halohium. Gene 37: 181-189 84. Martin, A, S Yeats, D. Janekovic, W.D. Reiter, W Aicher and W. Zillig. 1984. SAU-1, a temperate u.v.-inducible UNA virus-like particle from the archaebacterium Sulfolobus acidocaldarius isolate B12. EMBO J 3= 7165-2168 85. Matheson, AT. and M Taguchi. 1982. The evolution of the archaebacterial ribosome. Zbl. Bakt. Hyg., I. Abt. Orig. C3: 192-199. 86 Maxam. AM and 14 Gilbert 198B Sequencing end-labeled DNA with base-specific chemical cleavage. Methods Enzymol 65: 499-580 87. McBride, B.C. and R S. Wolfe 1971. A neui coenzyme of methyl transfer, coenzyme M. Biochemistry 10: 2317-2324. 88 . Meile, L., A. Kiener and T. Leisinger 1983. A plasmid in the archaebacterium Methanobacterium thermoautotrophicum. Mol. Gen. Genet. 191 480-484 89. Moritz, A., B. Lankat-Buttgereit, H.J. Gross and W Goebel. 1985. Common structural features of the genes for two stable RNAs from Halobacterium halohium Nucleic Acids Res 13: 31-43 98. Morris, C.J. and J.N. Reeve. 1984. Functional expression of an archaebacterial gene from the methanogen Methanosarcina barkeri in Escherichia coli and Bacillus SUbtilis In, Microbial Growth on CL Compounds. CR.L. Crawford and R.S. Hanson, eds.) American Society for Microbiology, Washington, B.C. 20006. 9 1 . Morris, C.J. and J.N. Reeve. 1 9 8 5 . Expression of a methanogen gene in Escherichia coli and in Bacillus SUbtilis In, Proceedings of the First International Symposium on Biotechnological Advances in Processing Municipal Wastes for Fuels and Chemicals. (A.A. Antonopoulos, ed.) Academic Press, P 1 7 5 - 1 8 6 92 Mitchell, R M., L A. Loeblich, L C Klotz and A R. Loeblich 1979 DNA organization of Methanobacterium thermoautotrophicum. Science 204: 1882-1084. 93. Needleman, S B. and C D. Wunsch 1970. A general method applicable to the search for similarities in the amino acid sequences of two proteins. J. Mol. Biol. 48: 443-453. 118 94. Nomiyama, H., K Obaru, Y. Jinno, I liatsuda, K. Shimada and T Miyata. 1986. Amplification of human argininosuccinate synthetase pseudogenes J. Mol. Biol 192- 221-233. 95. Nyunoya, H. and C.J. Lusty. 1983 The carB gene of Fscherichia coli: A duplicated gene coding for the large subunit of carbamyl-phosphate synthetase. Proc Natl Acad. Sci. USA. 88 : 4629-4633 96. Nyunoya, H. and C.J. Lusty. 1984. Sequence of the small subunit of yeast carbamyl phosphate synthetase and identification of its catalytic domain. J. Biol. Chem. 259: 9790-9798. 9 7 Nyunoya, H , K.E. Broglie and C.J Lusty 1985 The gene coding for carbamy 1-phosphate synthetase I was formed by fusion of an ancestral glutaminase gene and a synthetase gene Proc Natl Acad. Sci. USA. 82: 2244-2246. 98 Nyunoya, H., K.E. Broglie, E.E. Widgren and C.J. Lusty. 1985. Characterization and derivation of the gene coding for mitochondrial carbamyl phosphate synthetase I of rat. J. Biol. Chem. 260: 9346-9356 99. O’Brien, WE, R. Mclnnes, K. Kalumuck and M Adcock 1986 Cloning and sequence analysis of cDNA for human argininosuccinate lyase. Proc. Natl. Acad. Sci. USA 83= 7211-7215. 100 Pappenheimer, A M , Jr , P C. Dunlop, K.W Adolph and J.W. Bodley. 1983 Occurrence of diphthamide in archaebacteria. J. Bacteriol. 153= 1342-1347 101 Pfeifer, F , K Ebert, 6 Weidinger and W. Goebel 1982. Structure and function of chromosomal and extrachromosomal DNA in halobacteria. In 'Archaebacteria" <0. Kandler, ed ), pll0-119. Fischer, Stuttgart. 102. Piette, J , R. Cunin, A. Boyen, D. Charlier, M. Crabeel, P Uan Uliet, N. Glansdorff, C Squires and CL. Squires. 1982 The regulatory region of the divergent aroECBH operon in Escherichia coli K12. Nucleic Acids Res 10: 8031-8048. 103. Piette, J., R Cunin, F Uan Uliet, D. Charlier, M. Crabeel, Y Ota and N. Glansdorff. 1982. Homologous control sites and DNA transcription starts in the related argF and argl genes of Escherichia coli K12 The EMBO Journal 1: 853-857 119 104 Piette, J, H. Nyunoya, C.J. Lusty, R. Cunin, 6 LWeyens, M. Crabeel, D. Charlier, N Glansdorff and A. PiArard. 1984. DNA sequence of the car A gene and the control region of car Alt tandom promoters, respectively controlled by arginine and the pyrimidines, regulate the synthesis of carbamyl-phosphate synthetase in Escherichia coli K12. Pr oc. Natl. Acad. Sci. USA. 81- 4134-4138 105. Ratner, S. 1982. Argininosuccinate synthetase of bovine liver: chemical and physical properties. Proc. Natl. Acad. Sci. USA 79= 5197-5199 106. Reeve, J.N. 1979. Use of minicells for bacteriophage directed polypeptide biosynthesis. Methods Enzymol. 68: 493-583 107 Reeve, J N , G S Beckler, J W Brouin, D.S Cram, E S. Haas, P.T Hamilton, C J Morris, B.A Sherf and C F Neil (1986) Divergence of methanogens, conservation of the hist gene sequence in all three biological kingdoms and the status of Methanobacterium thermoautotrnnhicum Proceedings of the Fifth International Symposium on Growth on Cl Compounds. Haren, Netherlands p255-260. 108. Reeve, J N., P.T. Hamilton, G.S. Beckler, C.J. Morris and C.H. Clarke <1986). Structure of methanogen genes. Syst. Appl Microbiol 7 5-12 109. Reeve, J.N., N.J. Trun and P T Hamilton. 1982. Beginning genetics with methanogens. In, Genetic Engineering of Microorganisms for Chemicals. 118. Sanger, F., S. Nicklen and A.R. Coulson. 1977. DNA sequencing with chain-terminating inhibitors. Proc. Natl. Acad. Sci. USA. 74 = 5463-5467 111. Sapienza, C. and W.F. Doolittle 1982. Unusual physical organization of the Halobacterium genome. Nature 295: 384-389. 112 Sapienza, C , M R Rose and W.F Doolittle. 1982 High-frequency genomic reaarangements involving archaebacterial repeat sequence elements Nature 299- 182-185. 113 Schmid, G and A. Bock 1981. Immunological comparison of the ribosomal proteins from archaebacteria J. Bacteriol. 147= 282-288 114. Schmid, G. and A Bock. 1982 The ribosomal protein component of five methanogenic archaebacteria. Zbl. Bakt. Hyg., I Abt. Orig. 03: 347-353 120 115. Schmid, 6 , T. Pecher and A. Bock. 1982. Properties of the translation apparatus of archaebacteria. Zbl. Bakt Hyg., I Abt. Orig C3: 209-217 116. Schnabel, H., W. Zillig, M. Pfaiffle, R. Schnabel, H. Michel and H, Delius 1982. Halobacterium halobium 0 H EMBO J. 1 87-92 117. Sharp, P.A 1981. Speculations on RNA splicing Cell 23: 643-646 118. Shepherd, J.C.W. 1981. Method to determine the reading frame of a protein from the purine^pyrimidine genome sequence and its possible evolutionary Justification Proc. Natl. Acad. Sci. USA. 78= 1596-1600 119 Shine, J. and L Dalgarno. 1974 The 3 '-terminal sequence of Fsrherichia coli 16S rRNA: complementarity to nonsense triplets and ribosome binding sites. Proc. Natl. Acad Sci USA 71: 1342-1346. 120. Smith, M.C.M., A. Mountain and S Baumberg 1986. Sequence analysis of the Bacillus subtilis aroC promoter region. Gene 49: 53-60 121. Staden, R. 1982. An interactive graphics program for comparing and aligning nucleic acid and amino acid sequences Nucleic Acids Res. 10: 2951-2961. 122. Stalker, DM, R Kolter and DR. Helinski. 1979. Nucleotide sequence of the region of an origin of replication of the antibiotic resistance plasmid R6K Proc. Natl. Acad. Sci. USA. 76: 1150-1154 123. Stetter, K.O., J. Winter and R. Hartlieb. 1980. DNA-dependent RNA polymerase of the archaebacterium Methanobacterium thermoautotrophicum Zbl. Bakt. Hyg., I. Abt Orig Cl: 201-214 124. Streicher, SL. 1983. Cloning and expression of the arqG gene from Streptomuces cattleua in Escherichia coli and Streptomuces cattleua. p. 134 in Abstracts of 83rd Annual Meeting of the American Society for Microbiology. 125 Strom, AR and L.P Uisentin 1973. Acidic ribosomal proteins from the extreme halophile, Halobacterium rntiruhrum the simultaneous separation, identification and molecular weight determination. FEBS Lett. 37: 274-280. 126. Swofford, D.L. and W.P Maddison. Reconstructing ancestral states under Wagner parsimony. Math. Biosci. In Press 121 127 Thomm, M., J Altenbuchner and K.O.Stetter 1983 Evidence for a plasmid in a methanogenic bacterium J. Bacteriol 153: 1060-1062 128 Thomm, M and G. Mich. 1987. An archaebacterial consensus promoter sequence for stable RNA genes with homology to the eukaryotic TATA box. In Press. 129. T ornabene, T.6. and T. A. Lang worthy. 1978. Diphy tanyl and dibiphytanyl glycerol ether lipids of methanogenic archaebacteria. Science 203= 51-53 130 Torsvik, T. and I D. Dundas. 1974. Bacteriophage of Halobacterium salinarium Nature 248: 680-681. 131 Trifonov, E.N. 1987. Translation framing code and frame-monitoring mechanism as suggested by the analysis of mRNA and 16S rRNA nucleotide sequences. J Mol Biol. 194: 643-652 132. Uan Knippenberg, P H., J.M.A. Uan Kimmenade and H.A Heus. 1984. Phytogeny of the conserved 3' terminal structure of the RNA of small ribosomal subunits. Nucleic Acids Res. 12= 2595-2604. 133 Uan Uliet, F ., R Cunin, A Jacobs. J Piette. D Gigot, M Lauwereys, A. Pi4rard and N Glansdorff. 1984. Evolutionary divergence of genes for ornithine and aspartate carbamyltransferases-complete sequence and mode of regulation of the Escherichia coli argF gene; comparison of ar 134. Uisentin, L.P., C. Chaw, A.T. Matheson, M. Yaguchi and F. Rollin. 1972. Halobacterium cutirubrum ribosomes. Properties of the ribosomal proteins and ribonucleic acid. Biochem. J 130: 183-110 135. Uossbrinck, C.R. and C.R. Noese. 1986. Eukaryotic ribosomes that lack a 5.8S rRNA. Nature 320: 287-208. 136 Meil, C.F. and J.N. Reave. 1987. Structure and organization of the hisA gene of the thermophilic archaebacterium Methanococcus thermolithotrophicus In Press 137 Meiss, R.L. 1974. Subunit cell wall of Siilf n To bus acidocaldarius J. Bacteriol. 118: 275-284. 138. Mich, 6., H. Hummel, M. Jarsch, U. Bar and A. Bock. 1906. Transcription signals for stable RNA genes in Me thano coccus Nucleic Acids Res. 14= 2459-2479. 122 139. Mich, 6., M. Jarsch and A. Bock. 1984. Apparent operon for a SS ribosomal RNA gene for tRNA genes in the archaebacterium Methanocnccus vannielii. Mol. Gen. Genet. 196: 146-151. 148 Mich, G, M. Leinfelder and A. Bock. 1987. Genes for stable RNA in the extreme thermophile Thermaproteus tenax introns and transcription signals EMBO J. 6: 523-528. 141. Mich, G., L. Sibold and A Bock. 1986. Genes for tRNA and their putative expression signals in Methanocnccus System. Appl Microbiol 7 18-25 142. Moese, C.R. 1987 Bacterial evolution. Microbiol. Rev 51: 221-271. 143 Moese, C.R. and BE Fox 1977 Phylogenetic structure of the prokaryotic domain: the primary kingdoms Proc. Natl. Acad Sci USA 74 5888-5898 144 Moese, C R, L.J. Magrum and G.E Fox 1978 Archaebacteria J Mol Euol 11= 245-252 145. Mood, A.6 , H Redborg, D R Cue, M B Mhitman and J. Konisky. 1983. Complementation of arofi and hisA mutations of Escherichia coli by DNA cloned from the archaebacterium . „ Methanocnccus voltae J Bacteriol. 156= 19-29. 146. Mood, A.6., A M Meege, P Delannoy and T Konisky. Uirus-like particles from a Methanococcus strain closely related to Methanococcus voltae. p. 178 in Abstracts of 87th Annual Meeting of the American Society for Microbiology. 147 Mood, A.G., M.B. Mhitman and J Konisky. 1985. A newly isolated marine methanogen harbors a small cryptic plasmid. Arch. Microbiol. 142= 303-305. 148 Vang, D., B.P. Kaine and C.R. Moese. 1985. The phytogeny of archaebacteria. System Appl. Microbiol. 6= 251-256. 149. Veats, S., P. McMilliam and M Zillig. 1982. A plasmid in the archaebacterium, Sulfglobus acidocaldarius EMBO J 1 1835-1838 158 Zillig, M. and K.O. Stetter. 1980 Distinction between the transcription system of archaebacteria and eubacteria. In, 6enetics and Evolution of RNA polymerase, tRNA and Riboscmes. 151 Zillig, M., K.O. Stetter, R. Schnabel, J. Madon and A. Gierl. 1982. Transcription in archaebacteria. Zbl. Bakt Hyg., 1 Abt. Orig. C3= 218-227 123 152 Zillig, W , K O. Stetter, R. Schnabel and M. Thomm. 1985 DNA-dependent RNA polymerase of the archaebacteria. In, The Bacteria vol. 8 Archaebacteria (C.R. Moese and R.S. kiolfe eds) p499-524 Academic Press, v 153. Zuckerkandl, E. and L. Pauling. 1965. "Evolutionary divergence and convergence in proteins" in Evolving Genes and Proteins (U Bryson and H.J Uogel eds.) p97~166 Academic Press, Neui York Could comparison of these genes add information to the phylogeny of the archaebacteria? 4) Were there sequences in the inter genic regions f lanking the ar qG genes similar to the putative transcription and translation control sequences described from studies of other methanogen^archaebacterial genes?