U N I V E R S I T Y O F C O P E N H A G E N F A C U L T Y O F S C I E N C E
PhD Thesis Xu Feng Identification of the archaeal regulatory network in DNA damage response
Supervisor: Qunxin She
Data of Submission: 12 June 2018
Name of department: Department of Biology
Author(s): Xu Feng
Title and subtitle: Identification of the archaeal regulatory network in DNA damage response
Topic description: The role of a paralog of TFB/TFIIB protein and an orthologue of Orc1/Cdc6 protein during DNA damage-induced transcriptional responses in Sulfolobus islandicus Rey15A was investigated.
Supervisor: Qunxin She
Submitted on: 12 June 2018
This thesis has been submitted to the PhD School of the Faculty of Science, University of Copenhagen
The cover image is an integrated model of the regulatory network of DNA damage response in Sulfolobus.
PREFACE
The thesis entitled ‘Identification of the archaeal regulatory network in DNA damage response’ was submitted to the Faculty of Science, University of Copenhagen. The work presented in the thesis was carried on at the Danish Archaea Center (DAC), Department of Biology, University of Copenhagen, Denmark, under the supervision of Dr. Qunxin She and with financial support from Danish Council for Independent Research (DFF-4181- 00274) and China Scholarship Council (CSC).
The thesis starts with a general overview of mechanisms employed by three domains of life to deal with DNA damage, including DNA damage repair and tolerance pathways. It is followed by an introduction of the current knowledge about the DNA damage response (DDR) across the tree of life. Subsequently, the results obtained during my PhD study are summarized. At last, the thesis goes to its end with the discussion of the results and perspectives for future research.
Two papers are included at the end of the thesis, with the first one addressing the function of TFB3 and the second focusing on the functional study of Orc1-2 during DDR regulation in Sulfolobus islandicus Rey15A. These papers are given below:
Xu Feng, Mengmeng Sun, Wenyuan Han, Yun Xiang Liang, Qunxin She; A transcriptional factor B paralog functions as an activator to DNA damage-responsive expression in archaea, Nucleic Acids Research, gky236, https://doi.org/10.1093/nar/gky236
Mengmeng Sun, Xu Feng, Zhenzhen Liu, Wenyuan Han, Yun Xiang Liang, Qunxin She; An Orc1/Cdc6 ortholog functions as a key regulator in the DNA damage response in Archaea, Nucleic Acids Research, gky487, https://doi.org/10.1093/nar/gky487
I
ACKNOWLEDGEMENTS
It has been a wonderful time during my stay in Denmark and I would like to say thanks to everybody that has supported me here.
Firstly, I would like to express my sincere gratitude to my supervisor Dr. Qunxin She for his support during my PhD study. His guidance has enlightened me in the past years and will be the most valuable thing for my scientific career.
I am grateful to all my current and former labmates in the Danish Archaea Centre and the Molecular Biology of Archaea Lab in Wuhan including Yunxiang Liang, Roger Garrett, Xu Peng, Yongmei Hu, Nan Peng, Yuxia Mei, Zhengjun Chen, Ling Deng, Changyi Zhang, Wenyuan Han, Wenfang Peng, Fei He, Daniel Stiefler-Jensen, Mariana Awayez, Thi Ngoc Hien Phan, Soley Gudbergsdottir, Carlos Leon, Laura Alvarez, Dongqing Jiang, Yingjun Li, Min Ren, Wenqing She, Jingzhong Lin, Yan Zhang, Mingxia Feng, Mengmeng Sun, Tong Guo, Anders Lynge Kjeldsen, Yuvaraj Bhoobalan, Anders Fuglsang, Pavlos Papathanasiou, Anne Louise Grøn Jensen, Weijia Zhang, Zhenzhen Liu and Saifu Pan. It has been a wonderful experience working together with them. My sincere thanks also go to Dr. Li Huang and all the lab members in his lab for their support during my stay in Institute of Microbiology, Chinese Academy of Science at Beijing.
My special thanks are dedicated to my best friends including Yingwei Feng, Liuquan Feng, Bin Li, Zhengkun Kuang, Jun Wang, Qishan Zhang and Kaisong Huang. Thanks to them for being there through all those tough times in my life.
Last, and most importantly, I would like to express my deep gratitude to all my beloved family members, especially my wife (Jianglan Liao) and my son (Muxin Feng) for their unconditional love and support.
II
TABLE OF CONTENTS
Preface ...... I
Acknowledgements ...... II
Table of contents ...... III
Summary ...... V
Sammendrag ...... VII
Abstracts ...... IX
Abbreviations ...... X
Objectives ...... XII
Introduction ...... 1
Universal strategies for DNA damage removal ...... 2 Direct reversal of DNA damage ...... 3 Excision of DNA damage ...... 5 Repair of double strand DNA breaks ...... 13
DNA damage tolerance by Translesion DNA synthesis ...... 16
DNA damage response in three domains of Life ...... 19 ATM/ATR mediated DNA damage signaling pathways in Eukarya ...... 19 Multilayer regulations of SOS response in Bacteria ...... 23 Cellular responses towards DNA damage in Archaea ...... 26
Transcriptional regulation in Archaea ...... 30
Summary of the results ...... 33 TFB3 functions as a transcriptional activator for DNA transfer pathway ...... 33 Orc1-2 functions as a global regulator essential for DDR in Sulfolobus ...... 35
Discussions and future perspectives ...... 38
III
References ...... 41
IV
SUMMARY
DNA damage response (DDR) is essential for the maintenance of genome integrity in all three domains of life, and the process is controlled by evolutionarily unrelated factors in Bacteria and in Eukarya. While DDR is primarily mediated by cleavage of the global repressor, LexA in the former, the process in the latter is mainly orchestrated by two evolutionally conserved kinases, ATM/ATR. Strikingly, none of these DDR regulators have a homologue in Archaea. As a result, it remains elusive as to how organisms in Archaea coordinate cellular processes in response to DNA damage signal (s). Nevertheless, investigation of genome expression upon UV light exposure in Sulfolobus species revealed a number of differentially expressed genes including genes encoding a paralogue of TFB protein (TFB3) and an orthologue of Orc1/Cdc6 protein (Orc1-2). Here, we apply a combination of genetic, biochemical, transcriptome and phylogenetic analysis to investigate their possible roles in archaeal DDR using Sulfolobus islandicus REY15A as the model.
Firstly, we constructed tfb3 gene deletion mutant and the transcriptome analysis of the resulting mutant (∆tfb3) revealed that TFB3 is essential for the transcriptional activation of a subset of DDR genes. Phenotypic characterization of ∆tfb3 showed that the mutant loses its ability to form cell aggregates upon DNA damage and is moderately sensitive to DNA damage. Interestingly, CHIP-qPCR analysis showed that TFB3 specifically binds to the promoter region of TFB3-dependent genes, suggesting that TFB3 directly modulates the transcriptional process upon DNA damage. Further, mutagenesis of the TFB3 protein and subsequent functional analysis indicated that the N terminal Zn ribbon and C terminal Coiled-Coil motif are essential for its function in the transcriptional activation. Furthermore, the phylogenetic analysis revealed a co-evolution of TFB3 with its target system (Ced, the Crenarchaeal system for exchange of DNA), suggesting that the TFB3- mediated transcriptional regulation may represent a well conserved DDR regulatory circuit for intercellular DNA transfer in Crenarchaeota.
Then, we showed that the previously constructed orc1-2 deletion mutant (∆orc1-2) is hypersensitive to NQO treatment and the transcriptome analysis of the mutant revealed that
V
Orc1-2 is essential for the global transcriptional regulation upon DNA damage. Orc1-2- dependent processes include the TFB3-controlled DNA transfer pathway, DNA replication initiation, cell cycle arrest and potential translesion DNA synthesis. Consistently, ∆orc1-2 is defective in DNA damage induced cell aggregation and cell cycle control. Furthermore, DNase I footprinting assay with Orc1-2 indicated that this protein is capable of protecting a conserved promoter element present in a number of Orc1-2-dependent genes and reporter gene assay demonstrated that this motif is responsible for the DNA-damage responsive expression, suggesting that Orc1-2 binds to the conserved DNA damage responsive element (DDRE) upon DNA damage and modulates transcriptions of DDR genes. Eventually, a promoter switch strain was constructed and analysis of the DDR in this strain showed that the induction of Orc1-2 is essential but not sufficient for activation of DDR, suggesting Orc1-2 could be posttranslationally modified upon DNA damage.
VI
SAMMENDRAG
DNA damage response (DDR) er essentiel for vedligeholdelse af genomets integritet i alle livsformer, og processen er styret af evolutionæ rt uafhæ ngige faktorer i bakterier og eukaryoter. Hvor DDR primæ rt er kontrolleret af den globale repressor LexA i bakterier, er processen i eukaryoter hovedsageligt styret af to bevarede kinaser, ATM/ATR. Ingen af disse DDR-regulatorer har homologer i arkæ er. Derfor er det fortsat ukendt hvordan organismer i det arkæ iske domæ ne koordinerer lignende cellulæ re processer som respons til DNA-skadesignaler. Ikke desto mindre har forskningen af genomekspressionen i Sulfolobus-arter afsløret et antal forskelligt udtrykte gener inklusivt gener kodende for en paralog af TFB-proteinet (TFB3) og en ortolog af Orc1/Cdc6-proteinet (Orc1-2). Her er en ræ kke genetiske, biokemiske, transkriptom- og phylogenanalyser blevet udført for at undersøge deres mulige funktion i arkæ isk DNA-skade reparation ved brug af Sulfolobus islandicus REY15A som modelorganisme.
Først konstruerede vi en tfb3-deletionsmutant og en transkriptomanalyse af den resulterende mutant afslørede, at TFB3 er nødvendig for transkriptionel aktivering af nogle af DDR-generne. Fæ notypekarakterisering af tfb3-deletionsmutanten viste at mutanten havde mistet evnen til at forme celleaggregater ved DNA-skade og er delvis sensitiv til DNA-skade. CHIP-qPCR-analyse viste at TFB3 associerer specifikt med promoterregionen af TFB3-afhæ ngige gener, hvilket tyder på at TFB3 modulerer den transkriptionelle proces direkte ved DNA-skade. Ydermere indikerer mutagenesen af TFB3-proteinet og den efterfølgende funktionelle analyse, at det N-terminale Zinkbånd og det C-terminale coiled-coil-motiv i TFB3 er essentielle for transkriptionel aktivering og at den sidstnæ vnte sandsynligvis medierer rekrutteringen af TFB3 til promoterregioner. Derudover påviste fylogenetisk analyse koevolution af TFB3 og dets target-system (Ced), hvilket tyder på at TFB3-medieret transkriptionsregulering kan repræ sentere et konserveret DDR-regulatorisk kredsløb ved intercellulæ r DNA-overførelse i Crenarchaeota.
Derefter viste vi, at den tidligere konstrueret orc1-2-deletions mutant er hypersensitiv til NQO-behandling og transskriptomanalyse af mutanten viste at Orc1-2 er nødvendig for
VII den globale transskriptionelle regulering ved DNA-skade. Orc1-2-afhæ ngige processer inkluderer den TFB3-kontrollerede DNA-overførsels-pathway, initiation af DNA- replikation, cellecyklusarrest og potentiel translesionsyntese af DNA. ∆orc1-2 er ikke funktionsdygtig til celleaggregering og cellecycluskontrol ved DNA-skade. Desuden indikerede DNase I footprinting assays med Orc1-2, at proteinet beskytter en konserveret promoterelement der findes i et antal Orc1-2-afhæ ngige gener og reportergen-assays demonstrerede at motivet er ansvarlig for DNA-skadereaktiv ekspression, hvilket tyder på at Orc1-2 binder til det bevarede DNA damage responsive element (DDRE) ved DNA- skade og modulerer transskription af DDR-generne. Til sidst viste vi ved konstruktion af en promoter-skiftet stamme og undersøgelse af DDR i denne, at induktion af Orc1-2 er essentiel, men ikke tilstræ kkelig, for aktivering af DDR, hvilket tyder på at Or4c1-2 kunne væ re posttranslationelt modificeret ved DNA-skade.
VIII
ABSTRACTS
To counteract the threat of genomic DNA lesions, organisms belonging to domains of Eukarya and Bacteria have evolved a sophisticated network of DNA damage response (DDR) systems. These include events that lead to activation of DNA repair, cell-cycle arrest and tolerance of DNA damage. In bacteria, these processes are coordinated by the global regulator, LexA and in eukarya the principal DDR regulators are two evolutionally conserved kinases, ATM and ATR. In contrast, comparative genomics analysis failed to detect any of the reported regulators mediating DDR in Archaea. In this work, we aim to investigate the function of two potential DDR regulators including a paralog of TFB family protein (TFB3) and an orthologue of archaeal/eukaryal Orc1/Cdc6 protein (Orc1-2) in DDR of Sulfolobus islandicus Rey15A, a model archaeon. We found that tfb3 deletion mutant (Δtfb3) is more sensitive to a DNA damaging agent, NQO, and transcriptome analysis of the response of WT and Δtfb3 to NQO treatment revealed that TFB3 is essential for the transcriptional activation of a subset of genes, including a number of genes implicated in intercellular DNA transfer. Consistently, we demonstrated that the deficiency of TFB3 leads to the loss of cell aggregation upon DNA damage. Furthermore, CHIP- qPCR analysis indicated that TFB3 is specifically associated with the promoter region of its target genes and functional analysis of TFB3 by mutagenesis demonstrated that the conserved Zn-ribbon and coiled-coil motif are essential for its function. These results indicate that TFB3 functions as a transcriptional activator for DDR genes probably by interacting with specific transcriptional regulator, thus facilitating PIC (Pre-initiation complex) formation. More strikingly, phenotypic characterization of the previously constructed Δorc1-2 revealed a hypersensitivity phenotype to NQO, and subsequent transcriptome analysis indicated that the deficiency of Orc1-2 abrogates the differential expression of all DDR genes, including those implicated in DNA replication initiation, cell division, translesion DNA synthesis and TFB3-dependent DNA transfer pathway. Furthermore, DNase I footprinting analysis and reporter gene assay demonstrated that Orc1-2 interacts with a conserved hexanucleotide motif present in the promoter regions of a number of DDR genes and regulates their expression. In addition, by manipulating the expression level of orc1-2, we showed that a high level of Orc1-2 is essential but not sufficient for the DDR activation.
IX
ABBREVIATIONS
DDR DNA damage response DDRE DNA damage responsive element Orc1/Cdc6 Origin recognition complex 1/cell division cycle 6 TBP TATA box binding protein TFB Transcription factor B AP Apurinic/apyrimidinic CPD Cyclobutane pyrimidine dimer 6-4PP 6-4 photoproduct DSB Double strand DNA break BPS Base pair substitution Indel Insertion and deletion ROS Reactive oxygen species BER Base excision repair NER Nucleotide excision repair GGR Global genomic repair TCR Transcription-coupled repair HRR Homologous recombination repair MMR DNA Mismatch repair NHEJ Non-homologous end joining TLS Translesion DNA synthesis PIKK Phosphatidyl inositol 3' kinase-related kinases ATM Ataxia-telangiectasia mutated ATR ATM and Rad3 related DNA-PKcs DNA dependent protein kinase catalytic subunit ATRIP ATR Interacting Protein MRN/X Mre11-Rad50-Nbs1/Xrs2 complex RPA Replication protein A XP Xeroderma pigmentosum TRCF Transcription-repair coupling factor NucS Nuclease S1
X
SSB Single strand DNA binding protein/ Single strand DNA break CDK Cyclin-dependent kinase PTM Posttranslational modification MMS Methyl methanesulfonate 4-NQO 4-Nitroquinoline 1-oxide UV Ultraviolet Ced Crenarchaeal system for exchange of DNA Ups UV-inducible pili operon of Sulfolobus FAD Flavin adenine dinucleotide UDG Uracil DNA glycosylase dRPase Deoxyribophosphodiesterase OGT O-linked β-N-acetylglucosamine transferase MGMT Methylguanine DNA methyltransferase XRCC4 X-ray repair cross-complementing protein 4 XLF XRCC4-like factor ExoI Exonuclease 1 BLM Bloom Syndrome RecQ Like Helicase Dna2 DNA replication helicase/nuclease 2 BRCA2 Breast cancer early-onset 2 MCM Minichromosome maintenance Hje Holiday junction endonuclease hjm Holiday junction migration PCNA Proliferating cell nuclear antigen Fen1 Flap endonuclease 1 MDC1 Mediator of DNA damage checkpoint 1 TopBP1 DNA Topoisomerase II Binding Protein 1) RNR Ribonucleotide reductase SMARCAL1 SWI/SNF-related, matrix-associated, actin-dependent regulator of chromatin, subfamily A-like 1 HGT Horizontal gene transfer
XI
OBJECTIVES
During my PhD study, I aimed to investigate the role of a paralog of TFB protein and an ortholog of Orc1/Cdc6 protein during DNA damage-induced response of Sulfolobus islandicus Rey15A. The following objectives have been addressed:
To determine whether TFB3 and Orc1-2 function in DNA damage-induced transcriptional responses To identify specific pathways that is regulated by TFB3 or Orc1-2. To provide insights into the mechanism of Orc1-2/TFB3 mediated transcriptional regulation To provide insights into how Sulfolobus respond to DNA damage in a global level
XII
INTRODUCTION
Since the recognition of Archaea as a separate domain of life based on the phylogenetic analysis of ribosomal RNA sequences (1), a wealth of knowledge in the field of archaeal biology has been obtained via phylogenetic, biochemical, structural and genetic studies on archaeal model organisms. These studies have revealed that archaeal organisms are featured with both eukaryal and bacterial characteristics. For instance, the archaeal DNA replication, transcription and translation system are more closely related to those in eukarya, whereas archaeal cells lack nucleus and contain the circular chromosome as that in bacteria (2).
Many archaeal organisms thrive in extremely high temperatures that facilitate DNA lesion formation (3), as spontaneous hydrolysis of purine and pyrimidine bases, sugar phosphate cleavage and deamination of cytosine in DNA accelerate as temperature increases (4). For this reason, a considerable amount of studies has been focusing on how these extremophiles maintain their genomic integrity (5). Strikingly, genetic analysis revealed that the spontaneous mutation rate of the thermoacidophilic archaeon Sulfolobus acidocaldarius is comparable to that of mesophilic bacterium Escherichia coli (6). This indicates that archaea possess a more efficient system by which DNA damage is repaired.
The availability of archaeal genome sequences has largely facilitated the research of DNA damage repair in Archaea during the past decades. A myriad of archaeal homologs have been identified based on sequence similarity to known bacterial or eukaryal DNA repair proteins, and subsequently characterized in details. These studies in turn provide insights into the function of their counterparts in other life domains. For example, the structural analysis of archaeal Y family polymerases has substantially increased our understandings of mechanisms of translesion DNA synthesis, due to their relative ease for crystallization (7). Moreover, a few archaeal specific enzymes implicated in DNA repair are also identified, which has expanded the spectrum of DNA repair machinery (3,8). The development of versatile genetic toolboxes in model archaeon further offers the possibility of studying the functionality of potential DNA repair systems in Archaea in vivo (9,10). Here our current understanding of how archaea deal with genomic insults and how they
1 respond to DNA damage signal in a global level, including the reported DNA repair pathways, DNA damage tolerance and other related cellular processes, will be summarized, with a comparison of those strategies have been evolved across the tree of life.
UNIVERSAL STRATEGIES FOR DNA DAMAGE REMOVAL The primary structure of DNA can be modified by both endogenous and environmental factors, giving rise to DNA lesions. For example, some DNA aberrations can arise via cellular processes, such as misincorporation of dUTPs during replication, abortive topoisomerase activity, spontaneous hydrolytic reactions and metabolic byproduct like reactive oxygen species (ROS). Some are resulted from modifications by exogenous factors like UV, ionizing irradiation and diverse chemicals.
Figure 1. Sources of DNA damage and DNA repair pathways Endogenous and exogenous factors (top) induce different types of DNA lesions (middle) on either DNA backbone or base. These DNA lesions can be removed by specific pathway or by the co- actions of several pathways (bottoms). The picture was taken from Genois et al., 2014 (11).
As reviewed by Sancar et al. (12), genomic insults occur on bases, the building blocks of DNA molecule, the sugar-phosphate backbone or sometimes in a form of crosslink of the two DNA strands. Common base damages include O6-methylguanine, thymine glycols, and other reduced, oxidized, or fragmented bases in DNA that are produced by ROS or by 2 ionizing radiation (12). Ultraviolet (UV) radiation also gives rise to these species indirectly by generating reactive oxygen species, and meanwhile, it also produces specific products such as cyclobutane pyrimidine dimers (CPD) and (6-4) photoproducts (12). In addition, various base adducts can also be induced by diverse chemicals. For instance, the formation of bulky adducts can be induced by large polycyclic hydrocarbons or simple alkyl adducts by alkylating agents. Backbone damages include abasic sites and single- and double-strand DNA breaks (SSB and DSB). Specifically, abasic sites are generated spontaneously by the formation of unstable base adducts or by base excision repair. SSB are produced by excision repair or directly by damaging agents. DSB are produced by ionizing radiation or induced by DNA-damaging agents. Some DNA damaging agents such as cisplatin and mitomycin D modify DNA in a form of interstrand or intrastrand DNA cross-links, which is the most complex form of DNA damage (12).
These DNA lesions can block genome replication, and if left unrepaired or are repaired improperly, they lead to genomic mutation or instability (13). As a result, all organisms have evolved a number of DNA repair pathways, to counteract the deleterious effects of DNA damage. Briefly, different types of DNA lesions generated by diverse cellular processes or environmental factors can be directly reversed or excised, or sometimes bypassed by specific pathways. As shown in Fig. 1, the DNA lesions occurred at base without strand-distortion are processed by direct reversal repair or BER (Base excision repair), while the DNA-helix-distorted lesion such as CPD (Cyclobutane pyrimidine dimmer) and 6-4 photoproducts are repaired by NER (Nucleotide excision repair), which also removes the bulky adducts on DNAs and are involved in the intrastrand DNA cross- link repair as well. The broken ends of DSB can be ligated by NHEJ (Non-homologous end joining) or subjected into HRR (homologous recombination repair). Post-replicative repair (mismatch repair, MMR) corrects those errors arising from DNA replication and the unrepaired replication-blocking lesion can be bypassed by translesion DNA synthesis (TLS) (14).
DIRECT REVERSAL OF DNA DAMAGE The most straightforward DNA repair pathway reported so far is the direct reversal of DNA lesions by specialized enzyme, including the photolyase that functions in the repair of UV- induced dimmers and methylguanine DNA methyltransferase (MGMT) that corrects with O-
3 alkylated DNA damage, by binding to DNA lesion and restoring the DNA to its normal states. Whereas the former is absent in placental mammals, the latter is ubiquitously present in nearly all organisms (15).
The DNA lesions induced by widespread UV light can block the DNA replication machinery if not removed properly. Two distinct enzymes, CPD photolyase and 6-4 photolyase have been identified for the direct removal of UV-induced DNA lesions. The former repairs CPD dimer and the latter specifically removes 6-4 photo products. So far, CPD photolyases have been found in most organisms from all three life domains, while 6-4 photolyases are mainly reported in Eukarya. Photolyase binds to DNA containing a pyrimidine dimer in a light- independent reaction and flips the dimer out into its active site pocket. The light harvesting cofactor in the photolyase absorbs a photon and transfers the excitation energy to the catalytic cofactor, FAD. Then, the excited state of FAD (FADH-) transfers an electron to the pyrimidine dimer, splitting the dimer into two pyrimidines. The electron returns to the flavin radical to regenerate FADH- and the enzyme then dissociates from the repaired DNA (16).
Archaea encode homologues of both CPD and 6-4 photolyases and the photoactivation activity has been reported in several model archaeon including Methanobacterium thermoautotrophicum, Halobacterium NRC1 and S. acidocaldarius (17,18). Biochemical and structural study of the photolyase from S. tokodaii demonstrated that it is capable of removing CPD and its overall structure superimposes very well with other known photolyases (19). Furthermore, analysis of the dynamics of CPD post UV treatment in S. solfataricus showed that light highly accelerate the removal of CPD, suggesting a solid role of photolyase in the repair of photoproduct in archaea (20). Interestingly, genetic study of two Sulfolobus photolyases indicated that only CPD photolyase is important for DNA repair, as deletion of the gene coding for 6-4 photolyase does not impair DNA repair activity (18).
Alkylating agents react with DNA bases leading to the formation of a variety of cytotoxic and mutagenic covalent adducts ranging from small methyl groups to bulky alkyl adducts. MGMT (also called AGT or OGT) is the enzyme that removes the alkylation of guanine bases at the O6 position or O4 position, which has been widely identified in different bacterial and eukaryal organisms. Similar to the photolyase, MGMT has also been proposed to recognize
4
DNA damage by three-dimensional diffusion. After forming a low-stability complex with the DNA backbone at the damage site, it then flips out the damaged base into its active site cavity, wherein the methyl group is transferred to the cysteine in the active site. The enzyme then dissociates from the repaired DNA, but the C-S bond of methylcysteine is stable, and therefore, after one catalytic event the enzyme becomes inactivated and is accordingly referred to as a suicide enzyme (12).
The archaeal enzymes for direct reversal of DNA alkylation are mostly reported in thermophilic archaea. These enzymes include a DNA alkyltransferase from Pyrococcus sp. KOD1 (pyrpMGMT) (21), an O6-alkylguanine DNA alkyltransferase from S. solfataricus (ssoOGT) (22) and the AGTendoV from Ferroplasma acidarmanus (23). Specifically, the pyrpMGMT expressed in E.coli exhibited a remarkable thermostability and this enzyme surprisingly restored the resistance of an E. coli ogt-deficient strain towards alkylating agent (21,24), suggesting the functional conservation of these enzymes. Similarly, ssoOGT protein also shows stability at high temperature and the encoding gene was induced upon alkylation agent treatment. More interestingly, the F. acidarmanus AGTendoV consists of a fusion of the C-terminal active site domain of O6-alkylguanine-DNA alkyltransferase (AGT) with an endonuclease V domain. Functional study of this protein showed that it is capable of removing O6-methylguanine lesions in DNA via alkyl transfer action and cleaving DNA substrates that contain deaminated bases, uracil, hypoxanthine, or xanthine in a similar manner to E. coli endonuclease V. This bifunctional enzyme has been found in a number of archaeal genomes and the functional association of AGT and other DNA repair pathway here may represent a general adaptation to the harsh environments for thermophilic archaea (25).
EXCISION OF DNA DAMAGE Three main pathways including BER, NER and MMR repair the DNA damage by the ‘cut and paste’ mode. They remove a single strand DNA containing DNA base damage, double helix distortion and mispaired bases individually and fill in the ssDNA gap by DNA synthesis. The mechanism of these pathways is highly conserved across the organisms in both Bacteria and Eukarya, though the proteins involved in bacterial and eukaryal NER pathway show little homology.
Base excision repair
5
Bases with small chemical alternation including oxidized bases, alkylated bases, deaminated bases and uracil that do not strongly affect the DNA helix structure are subjected to BER. BER is probably the most conserved pathway from bacteria to eukarya, by a series of coordinated reactions executed by glycosylases, AP lyases, AP endonucleases and DNA synthesis related proteins. As illustrated in Fig. 2, after the recognition of DNA lesion by specific DNA glycosylase, which catalyze the hydrolysis of the damaged base from the sugar- phosphate backbone to generate an AP site (apurinic/apyrimidinic sites), the AP site is then excised or replaced by DNA synthesis. During the long patch BER, AP site is processed by AP endonuclease, which hydrolyzes the phosphoester bond at the AP site, giving substrate suitable for stand replacement synthesis. The 5’flap is then cleaved by Fen1 endonuclease, generating a ligatable nick sealed by DNA ligase. In short patch BER, it is the AP lyase makes the first incision at the AP sites and the ribose-phosphate backbone is then removed by dRPase, giving the single gap that is filled in by the BER specific DNA polymerase and sealed by DNA ligase (13,26).
Figure 2. Schematic representation of canonical base excision repair (BER) pathway.
6
Two canonical subtypes of BER pathway including Short Patch BER and long Patch BER are illustrated. The DNA fragments in red represent the new inserted or synthesized DNA base pair(s). Picture adapted from Grasso and Tell, 2014 (27).
All Uracil DNA glycosylases (UDGs) or other damage-specific DNA glycosylases are conserved from bacteria to eukarya and they contain two core domains. One is responsible for the activation of the catalytic water molecule and the other interacts with the DNA minor groove after flipping out the base, thus stabilizing the DNA–UDG complex (27). In E. coli, the AP lyase activity is exerted by Endo III (also known as Nth) and the same activity is executed by Nth and OGG1 in human, which incise the DNA between the phosphate and the deoxyribose at the 3’ of the AP site. The main AP-endonuclease in human is APE1 and the most well studied ones in E. coli are Exo III (also known as Xth) and Exo IV (known also as Nfo) (27).
A number of archaeal BER proteins have been characterized and the fundamental pathway is well conserved in all three domains of life (27). As summarized by Grasso and Tell (27), about 20 archaeal uracil-DNA glycosylases (UDGs) have been characterized and they all fall into four of the six known UDGs families. Compared to those mesophilic UDGs belonging to class II and VI, the thermophilic ones are of class IV and V, suggesting these UDGs have co- evolved with the specific environment (5). In addition, two specific endonucleases that recognizes deaminated bases or abasic sites have also been identified, with Endonuclease Q (EndoQ) cutting at 5′ from deaminated bases or abasic sites in the DNA strand and Endonuclease V (EndoV) incising at 3′ towards deaminated adenine (28). Many archaeal genome encode EndoV, a nuclease found in all three domains of life (29), while in contrast, EndoQ is mainly distributed in Thermococcals and not present in most bacterial and eukaryal organisms (30). Meanwhile, several archaeal homologs of endonuclease IV (EndoIV) and exonuclease III (ExoIII) show APE activity among which the S. islandicus ExoIII and EndoIV have been characterized both in vitro and in vivo (31). The comparative analyses demonstrated that the S. islandicus EndoIV enzyme is much more active than the ExoIII enzyme in vitro, and genetic analysis indicated ΔendoIV is much more susceptible to the alkylating agent MMS than ΔexoIII (31).
Archaeal DNA polymerases harbor an uracil stalling pocket, which enable them to stop at
7
Uracil in the template DNA, representing a ‘read-ahead’ proofreading function (32). Interestingly, the interaction between PCNA and UDG has been reported in both crenarchaeota and euryarchaeota (33,34). It was proposed that such an interaction enable those UDGs being recruited to replication fork once DNAP stalled at uracil (25). In addition, PCNA also stimulates the activity of DNA polymerase, DNA ligase I, and flap endonuclease in the crenarchaeota S. solfataricus (35). The physical and functional interaction between PCNA and EndoQ, as well as the Mre11-Rad50 complex was also described in Pyroccus furiosus (36,37). Thus the slide clamp in archaea may function as a platform for the coordination between BER, DNA replication and other DNA repair pathways.
Nucleotide excision repair Nucleotide excision repair (NER) pathway removes a spectrum of single-strand DNA lesions that cause local helix-destabilization. Two subtypes of NER have been reported based on whether damage detection is linked to transcription (transcription-coupled repair, TCR) or not (global genome repair, GGR). TCR removes transcription stalling lesions, and GGR localizes and repairs genomic DNA lesions globally (38).
As shown in Fig. 3, during bacterial GGR, the genome is scanned by the UvrA-UvrB complex for damaged nucleotides causing large conformational changes. For TCR, the repair process is initiated by a stalled RNA polymerase on an actively transcribed gene, followed by the recruitment of UvrA-UvrB to the site of lesions by TRCF (transcription repair coupling factor, Mfd). Both mechanisms then converge into the same pathway. After damage recognition, UvrA then dissociates from the complex (by ATP hydrolysis), leaving a stable UvrB-DNA complex for damage verification, which further recruits UvrC to incise the damaged strand on both sides of the lesion. UvrC contains two nuclease domains that cleave the phosphodiester bonds 8 nucleotides 5’ and 4-5 nucleotides 3’ relative to the damaged site. UvrD helicase is required for the removal of excised DNA strand (12-13 nucleotides in length) and release of UvrB and UvrC form DNA. At last, the DNA repair is completed by gap filling by DNA polymerase I and nick sealing by DNA ligase (39).
8
Figure 3. Schematics of nucleotide excision repair (NER) in Bacteria and Eukarya The molecular mechanism of NER in E. coli (a) and Human (b). The DNA lesion is indicated by black rectangular, which can be detected by RNAP during TCR or bacterial/eukaryal specific damage sensors during GGR. In addition, photolyase (Phr) assists the recognition of CPDs in E. coli and accelerates the rate of CPD repair. After damage recognition and verification, the ssDNA containing DNA lesions is incised and the resulting gap is then filled by DNA re-synthesis. Picture taken from Hu, 2017 (40).
Eukaryal NER shares a similar process with the bacterial one but employs a much more complicated machinery (at least 25 proteins) to repair the similar lesions. A number of proteins implicated in human disease Xeroderma pigmentosum (XP) play parallel roles as the bacterial Uvr proteins. As reviewed by Giglia-Mari et al. (41), the damage sensing is achieved by stalled RNAP in TCR and Cockayne syndrome factors B/Rad26 that couples the stalled RNAP with NER machinery. Lesion discrimination in GGR is executed by the concerted actions of two complexes: XPC/hHR23B and UVDDB (DDB1 and DDB2/XPE). Subsequent steps of TCR and GGR convergent into the same pathway. TFIIH, a bi-directional helicase, is then recruited to open the damaged strand over a stretch of approximately 30 nucleotides. The unwound DNA is further stabilized by XPA and RPA. Two structure specific nucleases XPG and ERCC1-XPF then incise the damaged strand 3’ and 5’ respectively, giving a 25-30 nucleotides gap, which was then filled by DNA polymerase δ/ε. The NER repair is completed by sealing the final nick by DNA ligase 1 or DNA ligase 3 (42).
Phylogenetic analysis of NER proteins revealed that homologues of bacterial UvrABC proteins are only found in certain mesophilic archaea, which are probably acquired by the
9
HGT (horizontal gene transfer) from bacteria. In most archaea, especially for those thermophilic archaea, a number of homologues of eukaryal NER nuclease (XPF and Fen- 1(XPG)) and helicase (XPB and XPD), with an archaeal specific nuclease Bax have been identified, suggesting the existence of an eukaryal-type NER pathway in archaea (9). However, the proteins for damage recognition or verification, including the homologues of CSB, XPA and XPC, are absent in all archaea (38).
Biochemical characterization of these archaeal XP homologues showed that these proteins are competent to function as a nuclease or helicase. For example, archaeal XPD protein unwinds DNA from 5’ to 3’ and crystallographic analysis of the archaeal XPD revealed a striking similarity to the structure of eukaryal ones (43). Interestingly, archaeal XPB homologs form a complex with archaeal structure-specific nuclease Bax1, and the complex unwinds and cleaves model NER substrates in vitro (44). More recently, it was proposed that the XPB-bax1 complex may actually function as a dsDNA translocase-nuclease machinery by binding at the site of helix-destabilizing lesions, opening the bubble via XPB’s ATP-dependent translocase activity and cutting at the lesions by bax1(8).
In contrast, the results of the genetic analysis of archaeal NER system are particularly confusing. On the one hand, the photoproducts induced by UV light was efficiently removed under the dark environment, suggesting the existence of the dark repair pathway to deal with photoproducts (20,45). On the other, the deletion of individual xpb, bax1 and xpd genes hardly affect the host’s sensitivity towards DNA damage agent (10,46). While in contrast, the deletion of uvrA, uvrB or uvrC in Halobacterium NRC-1 leads to an increased sensitivity, though this organism also encodes a set of eukaryl-type NER proteins (47). Moreover, it was found that the removal of photoproducts in Sulfolobus lack strand specificity (20,45), suggesting the lack of TCR system or that GGR and other DNA repair pathway works much more efficient than that of TCR, so that the strand biased repair fails to be detected. Taken together, the debate about whether there is a functional NER in archaea will persist before more genetic studies are reported.
Mismatch repair DNA mismatch repair (MMR) is a highly conserved process in both bacteria and eukarya and
10 is primarily responsible for the repair of base-base mismatches and insertion/deletion generated during DNA replication and recombination. E. coli MutS and MutL and their eukaryal homologs, MutSα\β and MutLα\β\r, are key players in MMR-associated genome maintenance (48).
Figure 4. Schematic representation of DNA Mismatch repair (MMR) In E. coli, the mismatch-activated MutS-MutL-ATP complex licenses MutH to incise the nearest unmethylated GATC sequence. Then the joint action of nucleases and UvrD helicase are responsible for the removal of the DNA strand containing DNA lesions. The resulting gap is filled by DNA synthesis. The Picture was adapted from KEGG pathway: http://www.genome.jp/kegg- bin/show_pathway?ko03430
As illustrated in Fig. 4, the first step in bacterial MMR involves the damage recognition and binding by MutS, which then interacts with MutL to enhance mismatch recognition and distinguish between the template strand and nascent strand. In E. coli, the lack of adenine methylation in nascent strand allows MutH to specifically incise the unmethylated daughter strand at hemimethylated dGATC site. Following generation of the strand specific nick, UvrD
11 is loaded at the nick and unwinds the duplex from the nick towards the mismatch, generating single strand DNA breaks, which are rapidly protected by SSB proteins. The removal of mismatched bases are performed by single strand exonucleases from either direction by Exo I or Exo X (3’-5’ exonuclease) and Exo VII or Rec J (5’-3’ exonuclease)), which excise the nicked strand from the nicked site (the dGATC site) up to or slightly past the mismatch. The resulting single-stranded gap is filled by DNA re-synthesis by DNA polymerase III and the repair is completed by DNA ligase (48). MMR in eukarya shows strong similarities to that in bacteria, although it involves several different MutS and MutL homologs with more specialized roles individually. Importantly, In E. coli, hemi-methylated dGATC sites determine the strand specificity of the repair, and the mechanism for eukaryal MMR machinery to discriminate the template and nascent strand during the repair remains unclear (49).
So far, the MutS/L homologs are only identified in several mesophilic archaeal lineages (50) and these archaeal MutS/L homologs were reported to be dispensable for the maintenance of a low mutation rate in H. salinarum NRC-1 (51). However, the lack of the canonical MMR in archaea is not reflected in high mutation rates (52), suggesting the existence of an alternative pathway independent of MutS/L proteins for the maintenance of genomic fidelity in archaea. Recently, a NucS homolog from Pyrococcus abyssi was shown to be capable of cleaving mismatched DNA base pairs on both strands and was named EndoMS (53). The following structural analysis demonstrated that EndoMS flips out mismatched bases into its binding sites, and cleaves the DNA backbone in a manner reminiscent of type II restriction enzymes (54). Interestingly, the phylogenetic analysis revealed that EndoMS homologues are present in archaeal lineages that lack functional MutS homologs and also in the organisms from Actinobacteria, which also lack MutS homologs (53), suggesting that EndoMS may mediate the alternative MMR pathway. Consistently, a more recent study in actinobacteria revealed the function of NucS homologs from Mycobacterium and Streptomyces in preventing mutations and inactivation of the NucS homolog in M. smegmatis leads to a similar hypermutation phenotype as that observed in canonical MMR-null mutants (55).
As reviewed by M. White and T. Allers, the generation of DSB by EndoMS at the mismatched base pairs could be an advantage in terms of strand discrimination, as both
12 strands will then be processed for HRR repair (43). In consistence with that, the homologue of EndoMS has been found in an operon with RadA recombinase in P. abyssi (56). However, the generation of DSB is a risky strategy especially for those non-polypoid organisms and the in vivo mechanism of archaeal EndoMS-mediated pathway remains further investigation.
REPAIR OF DOUBLE STRAND DNA BREAKS DNA damaging agents, ionizing radiation and cell’s attempt to replicate the lesion-containing templates induce DNA double-strand breaks (DSBs). These lesions are extremely cytotoxic, as the cell cannot rely on simply copying the information from the undamaged strand. For example, it has been shown that a single DSB is sufficient to kill E. coli cell (57) and causes cell cycle arrest in human cell (58). Two main pathways including HRR (Homologous recombination repair) and NHEJ (non-homologous end-joining) have been evolved for the repair of DSBs, depending on whether homologous templates are required or not.
NHEJ is best studied in eukaryal models and is an error prone process, by connecting the two ends of DSB directly. As reviewed by Lieber et al. (59), the DNA ends of DSB are recognized by Ku70/80, which form a molecular scaffold for the recruitment of other core factors. In vertebrate cells, Ku then recruits DNA-PKcs forming the holoenzyme together with the DNA ends, which promotes DNA-end tethering and recruitment of additional NHEJ factors. The ligatable ends are often prerequisite for end ligation by NHEJ, which is facilitated by certain nucleases and polymerases. For example, Artemis nuclease is activated and recruited to the DNA ends upon DNA damage and facilitates the end processing in preparation for ligation. Yeast cells lack both DNA-PKcs and Artemis, instead the MRN complex plays a more substantial role in broken DNA end recognition and end processing. At last, the DNA ends will be sealed by protein complex consisting of XRCC4, XLF and DNA ligase IV (59).
Compared to NHEJ pathway, the HRR pathway is a highly faithful way to repair DSB, with a prerequisite of homologous chromosomal template. As a result, it acts exclusively in S- and G2-phase during the cell cycle. In contrast, those post-mitotic cells and cells in G1 phase have to seal DSBs by NHEJ (41). As demonstrated in Fig. 5, HRR starts with the resection of DNA ends at the DNA breaks by the MRN/X complex in eukarya or RecBCD/AddAB proteins in
13 bacteria (5’-3’ resection). In eukarya, there are additional factors responsible for further extensive end processing (ExoI or Dna2 in conjunction with BLM). End resection of DSB produces the 3’ single strand overhang, which is then coated by RecA/Rad51 recombinase (RecA in bacteria and Rad51 in Eukarya) forming the nucleoprotein filament (60). The loading of RecA/Rad51 to ssDNA is assisted by recombination mediators such as BRCA2 (breast cancer early-onset 2) or Rad52 in eukarya and RecBCD or RecFOR in bacteria. Then the homology search and strand invasion will be performed to form D-loop (displacement loop) or holiday junction structure upon second strand capture, which is then resolved by helicase and the structure-specific nuclease, to finally produce crossover or non-crossover products (61).
Figure 5. Schematic representation of DSB-initiated homologous recombination repair (HRR) HRR is initiated by end resection of DSB, giving a 3′ single-stranded end that then coated by RecA/Rad51. The DNA-RecA/Rad51 filament can invade a homologous template to initiate repair, with the assistance of recombination mediators. The stand invasion and subsequent DNA synthesis
14 lead to the formation of D-loop or holiday junction (HJ) structure upon second strand capture. These structures can be resolved by helicases and structure specific nucleases, giving a non-crossover or a crossover product. Picture adapted from Moynahan et al., 2010 (62).
Archaea encode an eukaryal-type machinery for HRR, so far homologs of the core factors of eukaryal HRR including Rad51 (RadA in Archaea), Rad50 and Mre11 have been identified in all archaeal genomes (60). However, the homologues of nuclease and helicase for extensive end processing in eukarya are not found in Archaea. Instead, in many archaea, genes coding for Mre11 and Rad50 are clustered with two other genes coding for a hexameric helicase of the FtsK superfamily (HerA) and a 5′-3′ exonuclease (NurA) (63). These proteins form a complex with Mre11-Rad50 and the resulting complex is capable of DSB end recognition and resection (8). In addition, a Rad54 homolog from S. solfataricus (64) and a RadA paralog (RadB) from H. volcanii (65) have been shown to interact with RadA and function as the recombination mediators. Meanwhile, several helicases and nucleases including Hjm (Hel308) helicase and Hjc (Holliday junction cleavage), Hje (Holliday junction endonucleases) nucleases implicated in branch migration or HJ cleavage were also identified (66-68). Specifically, Hjm (Hel308) was suggested to function in D- loop step and implicated in the restart of stalled DNA replication fork (69). Hjc is specific for the four-ways DNA structures and has been reported to interact with a number of DNA repair proteins including RadA paralogue RadC2, Hjm (Hel308) and a novel ATPase potentially function in strand migration (SisPINA) (43). Hjc and Hje share the same fold similar to the type II endonucleases, however, the deletion of hje, but not hjc in Sulfolobus islandicus renders the mutant more sensitive to DNA damaging agents (67).
More importantly, genetic analysis by gene deletion showed that mre11, mad50, herA, nurA, radA and hjm, or at least one of the hje and hjc are indispensable for cell survival of Sulfolobus (10,67,70), indicating that HR activity is essential for Sulfolobus. Interestingly, it was proposed that many of bacteria (for example, Mycobacterium tuberculosis, Mesorhizobium loti, Sinorhizobium loti) that possess NHEJ factors spend a significant portion of their life cycle in a stationary haploid phase, in which a template for recombination is not available (71). However, Sulfolobus species have two copies of chromosome for the majority of their life cycle, which is probably a result of adaptation on HRR for their survival.
15
DNA DAMAGE TOLERANCE BY TRANSLESION DNA SYNTHESIS Persisting DNA lesions not removed by any of the repair pathways can interfere with DNA replication. To mitigate the deleterious effect by arresting DNA replication machinery (Prolonged stalling of replication forks leads to fork collapse, generating cytotoxic DSBs), cells also evolved the machinery for tolerating DNA lesions, leaving the damage to be repaired at a later time point (41).
The key players of DNA damage tolerance pathways are enzymes performing translesion DNA synthesis (TLS). During TLS, the stalled replicative polymerase is replaced by TLS polymerase, which is featured by low-processivity but with capability to bypass DNA lesions. The principal polymerases in TLS are those belonging to Y family. As reviewed by Sale (72), though Y family polymerases show virtually no sequence homology to those from other families, they adopt similar overall structure as the replicative polymerase (replicases). An extra little finger domain is encoded in Y family polymerase, and the remaining domains are much smaller than those in replicases. As a results, the active site is much more spacious and solvent exposed, which enable it to accommodate large bulky DNA lesions (72). A thermodynamic model proposed that high fidelity polymerase enhances WC (Watson-Crick) base pairing by partially excluding bulky water from the active cleft (73). In contrast, those error prone Y family polymerases are believed lacking the ability to discriminate WC and non-WC base pairs, with an open active site that enable the water to compete with nucleobases in forming hydrogen bonds (7). In addition, Y family polymerases lack a 3-5’ proof-reading exonuclease domain and all these features together can result in an incorporation error rate up to 1/10 (74).
As described in detail by Sale (72), many of the 15 known polymerases in eukarya have the capability to promote some degree of TLS. The principal eukaryal TLS polymerases are Polη/Rad30 and Rev1 belonging to Y-family, along with the B-family Polζ. Polη is the most well studied one and plays a key role in bypassing CPDs by accommodating both bases of CPD in its active site and incorporating two A opposite to TT. By acting as a molecular splint to straighten the kinked DNA backbone, Polη accurately copy the covalently linked pyrimidines of CPD. However, Polη is incompetent to completely replicate the template with the more highly distorted lesion, 6-4 photoproduct. The ability of Polη to accommodate a
16 dinucleotide lesion in its active site also contributes its capability to replicate intrastrand crosslink at G-G caused by cisplatin (72). Rev1 was proposed to be a G template specific DNA polymerase and it inserts dC opposite an abasic site and N2-adducts of guanine. However, it does not directly pair the incoming dC with the template. Instead, it swings the template’s dG out of the helix and temporarily coordinates it with its little finger domain. At the place of the template dG, Rev1 places its own Arg324 residue, which form hydrogen bond with incoming dC. This mechanism allows it to bypass bulky dG adducts and act as a template-independent dC transferase. Rev1 has also been implicated in replicating DNAs forming secondary structure, and it is required for budding yeast’s survival after exposure to G-adducting agent 4-NQO (72). DNA polymerase ζ (Polζ) is a multi-subunit B family polymerase related to replicative ones, but lacks proofreading exonuclease activity. It was proposed to function in the extension step of translesion DNA synthesis (TLS). Besides, genetic studies suggest a role of Polζ to bypass 6-4 photoproduct and a function in the recombination associated DNA synthesis in S. cerevisiae, which is an error prone process dependent on Polζ (72).
Translesion DNA synthesis in E. coli is mainly performed by Pol IV (DinB), Pol V belonging to Y-family and Pol II from B-family. All three polymerases are induced upon DNA damage, and among them Pol V is the major TLS polymerase in E. coli. UmuC is one of the subunit of Pol V, but when it interacts with the other subunit UmuD, it forms an inhibitory complex that cannot perform translesion DNA synthesis. Upon DNA damage, RecA mediates the cleavage of UmuD2C to UmuD2’C, which leads to the activation of the Pol V polymerase (UmuD’2C complex). Pol II has been implicated in translesion synthesis of abasic sites, interstrand crosslinks, and 3, N4-ethenocytosine adducts, although it has a 3’-5’ exonuclease domain. However, the pol II mutant is not UV sensitive and the in vivo functionality of Pol II is under debate. Nevertheless, the DNA replication was blocked following UV treatment in the pol II mutant, but not the wild type strain at the early time during the cell’s response, suggesting it may has a function in replication restart (75). Pol IV (DinB) orthologue is the most ubiquitous Y-family polymerase found in all three domains of life. Pol IV efficiently and accurately bypasses adducts on the N2 position of deoxyguanosine and even bypasses N2-N2-guanine interstrand cross-link with high fidelity and its processivity dramatically increases upon interaction with the β-clamp (75,76). Strikingly, expression of noncleavable UmuD (together
17 with UmuC) contribute to the survival upon DNA damage by delaying the resumption of DNA replication and ovexpression of Pol IV was shown to rapidly block the replication fork movement, suggesting these two TLS polymerases may have a function in checkpoint control during SOS response (77).
The potential mutagenicity of TLS polymerases means that the activities of them must be under carefully controlled. As reviewed by Goodman and Woodgate (76), in E.coli, the transcriptional control is primarily used to modulate the activity of TLS. The promoter region of pol II and pol IV binds lexA relatively weakly and are induced early in the SOS response. In contrast, the promoter of the umuDC operon (polV) has one of the tightest LexA-binding sites and is induced later in the response (76). Given their early induction and relatively high basal expression level in the absence of DNA damage, it was proposed that Pol II and pol IV participate in most error-free TLS at specific DNA lesions. In contrast, Pol V undergoes very later induction, suggesting that E.coli only uses it as a last resort, once all other error-free repair pathways have been exhausted. In addition, the error-prone pol V undergoes post- translational regulation upon DNA damage. Firstly the UmuD activation is trigged by RecA- filament mediated auto-proteolysis. then, UmuD, UmuD’ and UmuC proteins are rapidly degraded by Lon and ClpXP protease (76). In contrast, much of the regulations of TLS in eukarya rely on post-translational modification (PTM) or specific protein interactions that target those TLS polymerases to sites of damage. For example, ubiquitination of human Polη (or RAD30 in yeast) leads to rapid degradation by proteasome and the interaction between Y family polymerase and PCNA is enhanced upon DNA damage by monoubiquitination of PCNA (76).
The DinB family polymerases is the only group that has been found in all three domains of life (78). Archaeal Y family polymerases are mainly found in Crenarchaea and are proposed to be only present in those archaeal organisms exposed to UV-light (50). Y family polymerases from S. solfataricus (Dpo4) and S. acidocaldarius (Dbh) have been crystallized with different DNA substrates and widely used as a model to study the mechanism of lesion bypass. The structural and biochemical characterization of Sulfolobus Dpo4 demonstrated that it is capable of bypassing a broad spectrum of DNA lesions including abasic site, (deoxyguanosin-8-yl)-1-aminopyrene, benzo (a) pyrene diol epoxide, 8-oxoguanine,
18 methylguanine and benzylguanine, and thymine dimers (79). However, disruption of the gene coding for the Dbh in S. acidocaldarius does not affect cell’s resistance towards several DNA-damaging agents and the overall rate of spontaneous mutation of a target gene (80). Nevertheless, the disruption of this gene does lead to the change of mutation spectrum. Specifically, there are lower frequencies of small indels (insertion and deletion), but higher frequencies of BPSs (Base pair substitution) in dbh- strains. It was thus proposed that Dbh play both mutagenic and anti-mutagenic roles in vivo (80). Apart from Dpo4, two PolB family proteins Dpo2 and Dpo3 from Sulfolobus also show DNA lesion bypass activity in vitro and Dpo3 strikingly bypasses CPD more efficiently than Dpo4 (81). In addition, archaeal PriS (primase small subunit) was shown to bypass the oxidative DNA lesions, such as 8-Oxo-dG (one of the major products of DNA oxidation) and faithfully bypass UV-induced CPD. It was thus concluded that, apart from the de novo primer synthesis, PriS may assist the major replicases during elongation step of TLS (82).
DNA DAMAGE RESPONSE IN THREE DOMAINS OF LIFE
ATM/ATR MEDIATED DNA DAMAGE SIGNALING PATHWAYS IN EUKARYA DNA repair activity in eukarya has been intricately linked to other cellular processes. Upon DNA damage, a well-coordinated signaling cascade that mediates diverse cellular responses is initiated. Briefly, DNA damage sensors recognize the DNA lesions and the damage signals is then transmitted to effectors by transducers, to prompt DNA repair activities, activate cell cycle checkpoint control and regulate other cellular processes to ensure sufficient DNA repair. These cellular responses are principally coordinated by three evolutionally conserved kinases including Ataxia-telangiectasia mutated (ATM), ataxia-telangiectasia and Rad3-related (ATR) and DNA-dependent protein kinase catalytic subunit (DNA-PKcs) (DNA-PKcs is only reported in vertebrate cells), belonging to phosphoinositide-3-kinase-relatedprotein kinase (PIKK) family. Once these kinases are activated, hundreds of substrates are phosphorylated at Ser/Thr-Glu motifs and additional sites in an ATM- or ATR-dependent manner, whereas DNA-PK appears to regulate a smaller subset of targets and mainly functions in NHEJ pathway. The functional regulations of downstream DDR effectors by ATM/ATR and by their downstream kinases including checkpoint kinase 1 (CHK1) and checkpoint kinase 2 (CHK2) largely shape the cellular responses important for genomic stability (83,84).
19
Figure 6. ATM/ATR signaling pathways in Eukarya. ATM mediated cellular responses towards DSB (a) and ATR orchestrated responses towards ssDNA or replication stress (b). Following DSBs ATM is predominantly activated through interactions with NBS1 of the MRN complex. Activated ATM initiates a signaling cascade at the site of damage, including PTM of chromatin proteins and others factors that important for DNA repair such as CtIP- BRCA1 and 53BP1. ATM also activates the function of CHK2 and p53, leading to cell cycle arrest. ATR is activated by a range of cellular processes that produce ssDNA. Activated ATR also activates cell cycle checkpoint by phosphorylating CHK1. Meanwhile ATR contributes the stability of replication fork and regulates DNA replication upon DNA damage by several mechanisms. Picture adapted from Brown et al., 2017 (85) and Blackford and Jackson, 2017 (86).
ATM, the master regulator orchestrates global cellular responses to DSBs ATM is best known for its role in response to DSB by coordinating the DNA repair, cell cycle checkpoint activation, alternation of chromatin structure and other cellular processes (Fig. 6). Under non-stressed condition, ATM exists as multimeric form, which is dissociated into active monomers upon DNA damage (87). Meanwhile, following DSB induction, a proportion of nuclear ATM is rapidly auto-phosphorylated and mobilizes to the sites of damage. MRN complex, which specifically recognizes DSBs, was proposed to recruit ATM
20 to DNA damage sites and activate ATM via the interaction between NBS1 and ATM (88,89). Nevertheless, MRN-independent activation of ATM also existed and there are several interesting insights into the mechanism of ATM activation. (a) The initial trigger of ATM activation is a chromatin conformational change that induced by DSB (87). (b) The mere tethering of ATM or several DDR players to undamaged chromatin is sufficient to induce the ATM-dependent DDR (87). (c) Direct interaction between ATM and broken DNA, specifically a contact with the single stranded stretches at DSBs is required for its activation (88,90).
Once activated, ATM leads to phosphorylation of hundreds of substrates implicated in diverse cellular processes. Furthermore, ATM phosphorylates and activates other protein kinases that phosphorylate yet more substrates, meaning that ATM-dependent signaling events are not just restricted to the factors directly phosphorylated by ATM (91). One of the best studied substrates of ATM is CHK2, which is rapidly phosphorylated by ATM upon DNA damage. Activated CHK2 phosphorylates downstream targets including CDC25A phosphatases, which dephosphorylate and thus activate the cyclin-dependent kinases (CDKs). The phosphorylated CDC25A is then subjected into proteasomal degradation, which leads to the inactivation of CDK, thus halting cell cycle progression. Meanwhile, ATM initiates a signaling cascade involving phosphorylation, ubiquitination and methylation of the chromatin proteins close to damaged sites. For example, the histone variant H2AX is phosphorylated by ATM in response to DSB, and Phosphorylated H2AX (γH2AX) can be recognized by MDC1, which is also phosphorylated and stabilized on chromatin by ATM upon DNA damage. Phosphorylated MDC1 is then recognized by NBS1, thus prompting the retention of MRN complex on
γH2AX sites. MRN complex further recruits ATM and this mechanism allows ATM to phosphorylate additional H2AXs, which in turn binds additional MDC1 in a repeated process that spread the focus. Importantly, γH2AX marked chromatin is transcriptionally inactive and it may represent a mechanism to prevent collision between the DNA replication and transcription machinery (83). Meanwhile, ATM also phosphorylates the key factors in HRR and NHEJ, such as the BRCA1 and CtIP that promote DNA end resection, and 53BP1 that functions in DNA-end bridging (92), thereby facilitating the repair of the DSBs.
Apart from the functional regulation of DDR effectors by phosphorylation, a significant
21 proportion of regulations by ATM upon DNA damage are achieved via transcriptional responses. Briefly, ATM directly phosphorylates p53 or other proteins that directly or indirectly regulate p53 stability, thereby affecting the protein level of this global transcriptional regulator. The functional regulation of p53 leads to the activation or repression of different transcriptional programs that collectively promotes cell survival by cell-cycle arrest/DNA repair activation at lower DNA damage or leads to senescence or apoptosis upon overwhelming DNA damage (91).
ATR, an essential Kinase for cell’s responses to replication stress In contrast to ATM and DNA-PKcs, which respond primarily to DSBs, ATR is activated by a much wider range of genotoxic stresses that induce the accumulation of ssDNA. ATR is recruited to RPA-ssDNA by its partner protein ATRIP. Such RPA-coated ssDNA is generated by nucleolytic processing of various forms of damaged DNAs or by helicase-polymerase uncoupling at stalled replication forks. The ability of ATR-ATRIP binding to ssDNA-RPA renders it as potent factor that senses diverse DNA damage and replication stress (86). However, ATR recruitment to RPA-ssDNA is not sufficient for optimal activation of ATR and a number of extra factors are also required. For example, as illustrated in Fig. 6, the junction of ssDNA and dsDNA is also important for ATR activation. SsDNA/dsDNA structures are recognized by Rad17-RFC clamp loader, which interacts with the Rad9-Rad1-Hus1 (9-1-1) clamp and loads the clamp into ssDNA-dsDNA junction. The 9-1-1 complex further recruits TopBP1, one of the best characterized ATR activators. The following phosphorylation of Rad9 of 9-1-1 complex facilitates the association of TopBP1 with ATR, leading to ATR activation. The recognition of ATR-ATRIP to RPA-ssDNA, as well as the loading of 911 at junction of ssDNA-dsDNA, are both required for TopBP1 to stimulate ATR activation at the site of DNA damage. These factors act together to ensure optimal ATR activation (86,93).
Interestingly, DSBs also activate ATR and it was suggested that ssDNA from end resection process may be the key trigger for ATR activation at DSB. The resection of DSB requires the action of several nucleases including MRN and its associated CtIP, ExoI and Dna2. More interestingly, an in vitro study revealed that during resection, the region of ssDNA is gradually increased and the DSB-induced DNA damage signaling switches from an ATM-activating mode to ATR-activating mode (93). Consistently, the in vivo study revealed that the change of
22 the amount of nucleases involved in resection process including CtIP and ExoI, also leads to the switch between the phosphorylation of CHK2 and CHK1, which are specific substrates of ATM and ATR respectively. Moreover, the inactivation/depletion of ATM, Mre11, CtIP, ExoI, and Dna2 all leads to diminished ATR responses to DSBs (93).
Activation of ATR initiates a signaling cascade that coordinates cell cycle progression and DNA metabolic processes. Similar to ATM, ATR also contributes to the proteasomal degradation of CDC25 phosphatases by activating its downstream kinase CHK1, thereby arresting cell cycle. Moreover, ATR activity is important for responses to DNA replication stress by following mechanisms. First, upon DNA replication stress, ATR-dependent phosphorylation of FANC1 and CHK1-dependent inhibition of CDKs inhibit the firing of dormant origins, which may avoid the exhaustion of replication and repair factor pools, especially RPA. Then ATR prevents replication fork collapse by phosphorylating SMARCAL1 helicase (SWI/SNF-related, matrix-associated, actin-dependent regulator of chromatin, subfamily A-like 1). Meanwhile, ATR also regulated the cellular dNTP level by transcriptional modulation and PTM of the RNR subunit RRM2 (86).
MULTILAYER REGULATIONS OF THE SOS RESPONSE IN BACTERIA DDR in Bacteria is best represented by the LexA mediated SOS-response in E.coli. As illustrated in Fig. 7, upon DNA damage, the accumulation of ssDNA will be protected by SSB protein, followed by the replacement with RecA and formation of RecA-ssDNA filaments, which then interact with LexA, a repressor binding to the SOS box in the promoter of SOS genes. The interaction leads to the conformational change of carboxyl terminal region of LexA, thereby facilitating the auto-lysis of the repressor. The cleavage of LexA decreases its affinity to DNA, thereby leading to the transcription activation of a cascade of SOS genes involved in DNA repair, cell cycle arrest and DNA damage tolerance (94,95).
As described in detail by Erill et al. (95), ssDNA generated by replication fork arrest or other processes is believed to be the main trigger of SOS response. recA and ssb genes are firstly induced to protect and stabilize the fork, followed by the induction of DNA repair genes (uvrAB, ydgQ, uvrD, recN and ruvAB) involved in NER and HRR pathway. For the unrepaired DNA lesion upon severe DNA damage, Translesion synthesis polymerases (Pol IV
23 and Pol V) will be induced to bypass DNA lesions. Meanwhile, the negative regulator for septum formation, SulA, is also induced, which inhibits cell division by interacting with FtsZ (95). The molecular explanation of the later induction of SulA and UmuCD is that the increasing level of DNA damage leads to progressively increasing RecA-promoted LexA proteolysis and thus decreasing LexA concentrations. As a result, a relatively high level of DNA damage is necessary to achieve a very low LexA concentration that allows induction of genes such as sulA and umuC/D, as they have very strong LexA binding sites. This arrangement allows the SOS response to be graded, for example, facilitating DNA repair without blocking cell division or inducing TLS polymerase activity at low levels of DNA damage (96).
Figure 7. Schematic representation of the SOS response in E. coli. The basic circuitry of the E. coli SOS response is illustrated. RecA protein-DNA filaments serve as a co-protease to activate self-cleavage of the LexA protein, the global repressor of SOS. The cleavage of LexA activates the expression of a cascade of SOS genes functioning in DNA damage repair, cell cycle regulation and TLS. Picture taken from Kreuzer, 2013 (96)
24
The induction fold also varies from gene to gene. For example, sulA gene is induced to ca.100 folds, while the NER genes are only induced to 4-5 folds. This level of regulation has been attributed to the diversity of SOS box present in the promoter of SOS genes, including the different locations of the motif, sequence variation of the motif and existence of additional regulatory elements. For example, SOS box has been mapped to -10 region of sulA and umuDC promoter, between -10 and -35 region of recA and uvrB, and -35 region of uvrA. This allows for another layer of regulation throughout SOS (94).
Similar to p53 in eukarya, the LexA level is also under the dedicated control. With the absence of DNA damage, LexA Protein limits the expression of itself by binding to the promoter region of lexA gene. LexA also regulates transcription of the recA gene, which is important for the overall function of the circuit. Upon DNA damage, increased RecA protein level promotes recombination repair and effective cleavage of LexA, whereas increased LexA expression allows the SOS response to be rapidly shut off when the inducing signal of ssDNA wanes (96).
The binding of RecA to ssDNA is also regulated to achieve the fine-tuning and temporal modulation of the SOS response. For example, RecBCD recognizes the double strand ends and performs end processing to generate the ssDNA substrate for RecA. RecFOR recognize DNA nicks and gaps during replication of damaged templates, and then it assists the loading of RecA onto ssDNA. In vitro data revealed that SSB has a strong affinity for ssDNA and thus prevents RecA binding. The replacement of SSB by RecA only occurs when RecO and RecR were added. Consistently, in vivo result showed that the E.coli strain lacking RecFOR exhibits delayed SOS induction. In addition, the SOS-inducible gene product, DinI, stabilizes the RecA/ssDNA filaments, Whereas the antagonistic function of RecX, and RdgC (recombination dependent growth) proteins prevents the formation of RecA-ssDNA filament (97).
In a broader view, the number and the type of genes found in the SOS regulon also vary among different bacterial species. For example, SOS regulation in E. coli consists of over 40 genes, while LexA regulon in gram-positive model organism (Bacillus subtilis) comprises 33 genes, among which only 8 are corresponding counterparts of those of the E. coil system. An
25 extreme example is represented by Pseudomonas aeruginosa, where the genes in SOS regulon is reduced to 15 genes and NER gene are no longer included any more. Comparison of SOS regulons among various bacterial species has uncovered the LexA-regulated core regulon, which only comprises RecA, UvrA, ruvAB and RecN (98), suggesting that beyond basic induction mechanism, many details of the SOS response may have been evolved to fit the specific needs of different species thriving in diverse environment.
CELLULAR RESPONSES TOWARDS DNA DAMAGE IN ARCHAEA Proposed DNA damage sensors in Archaea DNA damage sensing and subsequent signal transduction are of crucial importance for organisms in Eukarya to initiate the DNA damage response. However, how the signal of DNA damage was sensed in archaea remains largely unclear. Nevertheless, there are a few candidates that have been proposed to function as the potential DNA damage sensors.
The biochemical characterization of single strand DNA binding protein (SSB) from S. Solfataricus showed that it is competent to discriminate and destabilize the DNA with lesions or mismatched base pairs (99). Furthermore, it was proposed that the local binding of SSB proteins at damage sites may lead to the recruitment of other factors for DNA repair. Consistently, the carboxyl terminal tail of SSB has been shown to interact with a spectrum of proteins, including MCM, reverse gyrase, NurA and PirA helicase (100-103). The ssDNA in bacteria and eukarya has the potential to trigger SOS and ATR pathway correspondingly, as normally the large amount of ssDNA should not exist in a cell and its presence indicates the existence of a stalled replication fork, resection of damaged DNA, or occurrence of the recombination (9). For these reasons, SSB may also mediate the signal transduction of DNA damage response in archaea by functioning as a ssDNA sensor.
The ability of Mre11-Rad50 complex to specifically recognize free DNA ends enables it as the potential sensor for DSBs. Interestingly, both Mre11 and RadA in Sulfolobus are immediately recruited to DNA and remain DNA-bound during the course of DNA repair following γ-irradiation. It was thus proposed that these two factors could function as the DSB sensors (104). Moreover, Sulfolobus Mre11-Rad50 complex undergoes extra posttranslational methylations post γ-irradiation (105) and the complex in Haloferax volcanii was implicated in
26 both the repair of DSBs and the compaction of the nucleoid post DNA damage (106,107), suggesting this complex may mediate the damage sensing and subsequent DNA repair processes.
In addition, the uracil-stalling feature of archaeal polymerase enables it as the detector of uracil in DNA templates (108). Meanwhile, the replicative DNA polymerases will stall at the DNA lesions and thus it can also function as a general DNA damage sensor. In a recent model proposed by Grogan (9), it was suggested that once DNAP stalled at the bulky lesions, the stalled DNA replication fork could be cleaved by 3’-flap endonuclease (XPF or ssDNA endonuclease) or 5’-flap endonuclease (XPG/Fen1), leaving the bulky lesions at a position close to the double strand DNA end, which can be processed by end processing enzymes such as Mre11-Rad50. If proven to be true, this strategy will represent an adaptation for those organisms without a functional NER pathway to remove bulky DNA lesions.
Similar to DNAP, RNAP can also function in DNA damage sensing. RNAP from Thermococcus kadarenis has been reported to stall at DNA lesions on the template during transcription (109), indicating a general role of RNAP as the DNA damage sensor in all three domains of life. Nevertheless the coupling factors that link the stalled RNAP and downstream DNA repair pathway remain to be identified.
UV-induced genome-responsive expression in Archaea Though the lack of defined signal transduction pathway, previous microarray-based transcriptome analysis of model archaeon do reveal a global transcriptional change following UV irradiation in Halobacterium NRC1 (110) and Sulfolobus species (111-113), suggesting the existence of a cellular response towards UV light in archaea. These findings have concluded the absence of a SOS-like response in archaea, as very few genes that function in DNA repair were significantly induced during UV-responses. Specifically, analysis of UV- response in Halobacterium NRC-1 revealed that there is a moderate increase of radA gene expression, but a coordinated change in other DNA repair genes is not observed. Strikingly, nearly 12% of all genes in Halobacterium NRC-1 implicated in diverse cellular processes were down-regulated at 60 min post the UV treatment and it was proposed that the global repression of metabolism during DNA repair might be a general stress-response mechanism
27 shared by all three domains of life, to maintain internal homeostasis (110).
Whole-genome transcriptome analysis in Sulfolobus revealed that a number of genes implicated in cell cycle progression, transcriptional regulation, translesion DNA synthesis and diverse metabolism are dramatically induced or repressed during UV-response. For example, cell cycle related genes were downregulated post UV irradiation. These genes encode Orc1-1 and Orc1-3 that function in DNA replication initiation and the eukaryal-like ESCRT-III system that mediate cell division process. In contrast, Orc1-2, a paralog of Orc1/Cdc6 protein, was dramatically induced by UV, and it was thus suggested that Orc1-2 may function in the negative regulation of DNA replication initiation upon DNA damage. In addition, a paralog of TFIIB protein (TFB3) was also highly induced by UV-light, suggesting a role in UV-induced transcriptional regulation. Another top induced gene encodes Dpo2 that belong to PolB2 family, which is the only UV inducible DNA polymerase. Sequence analysis of Dpo2 homologues revealed that their catalytic residues are mutated, suggesting these proteins may represent the inactivated polymerases (114). Nevertheless, the in vitro assay indicated that the catalytic subunit of Dpo2 is capable of bypassing DNA templates with Uracil, hypoxanthine and 8-oxoguanine, suggesting that Dpo2 may function in TLS past deaminated and oxidized bases (81).
DNA damage induced cell aggregation and intercellular chromosomal DNA transfer in Sulfolobus It has long been known that S. acidocaldarius efficiently mediates chromosomal auxotrophic markers exchange (115) and the process is stimulated by UV treatment (116). Further characterization of a UV-inducible operon encoding a type IV pili system in Sulfolobus (ups) revealed that ups genes are essential for UV-induced cell aggregation, exchange of chromosomal auxotrophic markers and important for cell survival upon UV treatment (117,118). In addition, two UV-inducible genes encoding membrane components were also revealed to be essential for UV-induced chromosomal DNA transfer, and probably function in DNA import process (Crenarchaeal DNA import system, Ced) (119). Interestingly, cell aggregation was shown to be species-specific and can also be triggered by other DSB inducing agents (117,118). These findings, together with the UV-induced upregulation of HRR genes, suggest that the DNA transfer among cells of the same species has an important
28 role in DNA repair by providing intact templates of homologous DNA (120).
The potential cell cycle checkpoint control upon stressed conditions in Sulfolobus
Fugure 8. The schematics of cell cycle for exponentially growing Sulfolobus spp. The cell cycle phases of Sulfolobus spp. are illustrated. The potential cell cycle checkpoint controls are indicated. The details are described in main text and the figure was taken from Lindås and Bernander, 2013 (2)
Most of cell cycle studies in archaea have been performed in Sulfolobus genus, a haploid crenarchaeote. Specifically, Sulfolobus spp. has three replication origins that fire once per cell cycle. The binding of Orc1/Cdc6 protein to the ORB (origin recognition boxes) is the main regulatory step during replication initiation and this process was proposed to be modulated by the switch between ATP and ADP bound status of Orc1/Cdc6 proteins. The replication is then terminated by fork collision and if homologous recombination occurred during replication, the resulting dimmer can be resolved by the homologue of bacterial XerCD recombinase, XerA. Chromosome replication is followed by genome segregation and cell division, which is mediated by the bacterial like ParA-ParB system and a system with homology to the eukaryal endosomal sorting compelx required for transport III (ESCRT-III), respectively (2).
29
The progression of cell cycle in Sulfolobus is regulated by a yet-to-be defined mechanism. As summarized by Lindås and Bernander (2), and shown in Fig. 8, an exponentially growing Sulfolobus cell goes through a short pre-replicative G1 phase (it occupies <5% of the cell cycle) and is followed by the S phase, where the genome replication occurs (it lasts for 30–35% of the cell cycle). Post-replicative G2 phase occupies over half of the cell cycle, and M and D phase each lasts ca. 5% of the cell cycle, during which the genome segregation and cell division happens. Interestingly, when Sulfolobus culture is growing in a stationary phase, under amino acid starvation stress or diluted into fresh medium, temporary cycle arrest in D (G2) was observed (121,122). In addition, several drugs including daunomycin that suppresses DNA replication and transcription by targeting topoisomerase, N1-guanyl-1,7-diaminoheptane (GC7) that suppresses translation by inhibiting posttranslational modification of protein synthesis initiation factor 5A and acetic acid induce cell cycle arrest in G2 phase (123-125). More interestingly, UV light treatment of S. solfataricus and S. acidocaldarius leads to an increased population with DNA contents as those in G1 and early S phase cells and it was proposed to be a result of the activation of a G1 checkpoint system upon DNA damage (112,113). The ability of Sulfolobus to provoke the cell cycle arrest under stressed conditions suggests the presence of a cell cycle checkpoint system.
TRANSCRIPTIONAL REGULATION IN ARCHAEA
Figure 9. A schematic diagram of transcription regulation in Archaea Binding of TBP to TATA box and TFB to BRE site are essential for the recruitment of RNAP to form PIC. Transcriptional activator or repressor can facilitate or block this process by binding to different regions of the promoter. Activators usually bind to DNA upstream of the BRE whereas repressors bind to TATA/BRE-overlapping sequences. BRE: transcriptional factor B (TFB)
30 recognition element. TATA: the binding site for TBP. Picture adapted from Peeters et al., 2015 (126)
The archaeal transcription machinery shares strong similarity to the eukaryal RNA polymerase II system in subunit composition and general mechanism (127). As illustrated in Fig.9a, archaeal transcription is initiated by the binding of TATA-binding protein (TBP) and transcriptional factor B (TFB) to promoter region, and then, RNAP is recruited to the promoter along with the general transcriptional factor E (TFE) to form the pre-initiation complex (PIC). While TBP and TFB binds to TATA and BRE site of the promoter respectively in a sequence specific manner, TFE interacts with the non-template strand in a sequence-independent manner (126).
Archaeal transcriptional regulation is primarily mediated by specific or general transcription factors that interact with the basal transcription machinery at promoter regions. By binding to different promoter regions, these transcriptional factors can enhance or inhibit the sequential assembly of PIC. For example, as indicated in Fig. 9b, transcriptional activator usually binds to upstream of BRE site and facilitates the recruitment of TBP or TFB. While in contrast, the binding site of most repressors typically overlaps with TATA box and BRE, thereby blocking the formation of PIC. So far, a number of transcriptional factors of both bacterial- and eukaryal-types have been identified in archaea and they function in gene-specific regulation or global transcriptional regulation (128). Interestingly, approximately 70% of known archaeal genomes encode two or more copies of TFB and/or TBP family proteins (128) and functional implications of the paralogs of TFB in transcriptional regulation have been reported. For instance, as many as seven TFBs are encoded in H. salinarum. Among them TFBb may function in heat shock response (129) and TFBf is the exclusive TFB regulator for genes implicated in ribosome biogenesis (130).
Three homologues of TFB protein are encoded by Sulfolobus genus, two of them (TFB1 and TFB2) are in full-length form and the third (TFB3) is severely short. Specifically, TFB3 lacks the carboxyl-terminal core domain that interacts with TBP and binds the BRE, and the B-finger domain that stimulate transcription (113,131). Investigation of the relative abundances of these three TFB-encoding genes in S. solfataricus and S. acidocaldarius
31 demonstrated that only tfb3 is significantly induced upon UV treatment (113) and this regulator is subsequently shown to facilitate the in vitro transcription (132), suggesting it could function in DNA damage-induced transcriptional regulation.
32
SUMMARY OF THE RESULTS
In this work, I have investigated the function of TFB3 and Orc1-2 from S. islandicus Rey15A during DNA damage-induced cellular responses. To introduce DNA damage, 4-NQO was used. It has been reported that the metabolic activation of 4-NQO leads to the formation of DNA adducts, 8-hydroxyguanine and strand breakage (13) and our previous results indicated that NQO induces a similar replication stress as that observed in UV irradiation (133).
TFB3 FUNCTIONS AS A TRANSCRIPTIONAL ACTIVATOR FOR DNA TRANSFER PATHWAY tfb3 is among the top upregulated genes following exposure to UV light in S. solfataricus and S. acidocaldarius, proposing a role for this potential transcriptional regulator in DDR of Sulfolobus. In vitro transcription assay demonstrated that TFB3, together with the TFB1 and TBP are capable of facilitating the in vitro transcription. However, such an stimulatory effect is observed for all tested promoters including non-UV-responsive ones (132) and the in vivo function of TFB3 remain elusive.
In this work, we showed that tfb3 gene was also dramatically induced in S. islandicus by a number of DNA damaging agents including MMS, NQO, cisplatin and UV light, indicating that the induction of tfb3 is triggered specifically by DNA damage. Then we constructed the tfb3 deletion mutant (∆tfb3) and phenotypic characterization of ∆tfb3 revealed that the mutant is more sensitive to 1-2µM NQO treatment than the WT strain. The increased susceptibility of the mutant to NQO prompts us to analyze the transcriptome change in the WT and mutant upon NQO treatment, which led to the identification of 139 upregulated genes and 174 downregulated genes (>2 folds) in WT strain. These genes include most of the highly UV- inducible genes and again no significant induction of DNA repair genes was observed. In contrast, only 10 of those 61 top induced genes (>4 folds) was upregulated in ∆tfb3, indicating that TFB3 does function in transcriptional activation in vivo. Furthermore, many of those TFB3-dependent genes are implicated in the intercellular DNA transfer process, including previously reported ups genes, ced genes and a large number of genes encoding membrane-associated components, indicating TFB3 may specifically function in the transcriptional activation of the genes in DNA transfer pathway upon DNA damage.
To provide insights into the mechanism of TFB3-mediated transcriptional activation, we
33 performed CHIP-qPCR analysis and found that TFB3 is associated with the promoters of its target genes and essentially no enrichment was observed for the promoters of TFB3- independent genes, indicating that TFB3 is specifically recruited onto the promoter of those TFB3-dependent genes upon DNA damage and activate their expression, probably by recruiting RNAP to the promoter. However, different from the canonical TFB family protein, TFB3 lacks the DNA-binding domain that is normally present in the C-terminal cyclin fold of TFBs. As a result, the recruitment of TFB3 to the promoter region is probably mediated by protein-protein interaction. It has been proposed that the Zn ribbon domain of TFB/TFIIB protein functions in RNAP recruitment (134) and the four conserved cysteines in Zn ribbon coordinate the Zn ions (135). Intriguingly, TFB1 in Sulfolobus species lack the first and fourth cysteines that are conserved in TFB/TFIIB family including Sulfolobus TFB3. It was proposed that TFB1 and TFB3 functionally interact with each other to provide the full capacity of transcription (132). In this work, we further performed sequence analysis of TFB3 and identified a coiled-coil (CC) motif at the C terminal region. By performing the site- mutagenesis of TFB3 protein, we showed that both the conserved cysteines in N-terminal Zn ribbon region and the conserved residues in C terminal CC motif are essential for TFB3’s function as a transcriptional activator upon DNA damage. These results suggest that the canonical Zn ribbon of TFB3 may complement the function of TFB1 to provide full transcriptional capacity for those TFB3-dependent genes and the specificity of TFB3 can be achieved by interacting with a sequence-specific regulator via CC motif.
Interestingly, phylogenetic analysis revealed a co-occurrence of TFB3, non-canonical TFB1 and Ced system in a broader range of species in Crenarchaeota, suggesting TFB3 regulated DNA transfer pathway may represent a well conserved DDR regulatory circuit upon DNA damage in Crenarchaeota.
To conclude, this part of work solved the longstanding question about the functional significance of the TFB3’s induction upon DNA damage. Though lack of a DNA binding domain, TFB3 is recruited onto the promoter of its target genes and this process is probably mediated by protein-protein interaction via coiled-coil motif. Most of TFB3-dependent genes are implicated in DNA transfer process, together with the co-occurrence between TFB3 and Ced system, suggesting this regulator has co-evolved specifically with the DNA transfer
34 systems.
ORC1-2 FUNCTIONS AS A GLOBAL REGULATOR ESSENTIAL FOR DDR IN SULFOLOBUS To investigate the function of Orc1-2 during DNA damage response, in this work, we analysed the sensitivity of the orc1-2 deletion mutant previously constructed in S. islandicus Rey15A, towards NQO and a hypersensitivity phenotype was observed. To provide insights into how the deficiency of Orc1-2 affects cell’s survival after DNA damage treatment, the orc1-2 mutant and WT were subjected to transcriptome analysis. Strikingly, transcriptome data revealed that the Orc1-2 deficiency has abolished the differential expression of the majority of NQO responsive genes. Specifically, the Orc1-2 dependent upregulated genes include all those TFB3-dependent genes. In addition, genes involved in HRR including mre11-rad50 operon and Dpo2 operon also exhibits Orc1-2 dependent upregulation. Those Orc1-2-dependent downregulated genes are implicated in DNA replication initiation (Orc1-1 and Orc1-3), genome segregation (SegA and SegB) and cell division (CdvA, ESCRT-III orthologues and Vps4), suggesting that NQO imposes cell cycle arrest in S. islandicus and the regulation is dependent on Orc1-2.
In consistent with the deficiency in transcriptional responses upon DNA damage, Δorc1-2 was shown defective in cell aggregation. Meanwhile, flow cytometry analysis revealed that Δorc1-2 is also defective in cell cycle regulation upon DNA damage, as the population with one chromosome equivalent increased more dramatically in the mutant. Upon DNA damage, cell division in WT happens in a lower frequency than that in Δorc1-2, as a result of transcriptional repression of cell division genes in WT. While in contrast, though genes encoding DNA replication initiators were not repressed in Δorc1-2, the frequent initiation of DNA replication can be readily blocked by genomic DNA lesions. As a result, these processes collectively lead to a dramatic increase of the cell population with a similar genome content equivalent to one chromosome in Δorc1-2.
Interestingly, a conserved motif previously that has been reported in UV-responsive expression in S. acidocaldarius (136) is also present in a number of Orc1-2-dependent genes. Reporter gene assay showed that this motif also mediates NQO-responsive expression in WT strain of S. islandicus E234, but not in Δorc1-2, indicating that both
35
Orc1-2 protein and the motif are essential for DNA damage induced transcriptional regulation. Furthermore, DNaseI footprinting assay demonstrated that the recombinant Orc1-2 protein protects the motif (conducted by Mengmeng Sun), suggesting that Orc1-2 may bind to this DNA damage responsive element (DDRE) in vivo. Interestingly, such a motif also exists in the promoter of orc1-2 gene, suggesting an autoregulation of Orc1-2 protein level upon DNA damage and a higher expression of Orc1-2 may be important for the DDR process.
To investigate the influence of the protein level of Orc1-2 on Sulfolobus DDR, we constructed a strain in which the original promoter of the orc1-2 gene was replaced with the araS promoter that confers arabinose-inducible expression in this archaeon. By culturing the strain in non-inducible media and inducible condition, we are able to control the protein level of Orc1-2 in a basal or high level in a constant manner individually. Phenotypic characterization of the resulting mutant strains revealed that cells expressing a low level of Orc1-2 protein exhibited hypersensitivity to NQO treatment as orc1-2 deletion mutant cells. While in clear contrast, a constant high level of Orc1-2 protein allows the strain responding to NQO treatment more promptly. In parallel with that, quantitative analysis of the expression level DDR genes demonstrated that a constant high level of Orc1-2 protein enabled immediate induction of DDR genes upon NQO treatment. Nevertheless, DDR cascade was not activated by constantly inducing the expression of orc1-2, suggesting that other factors or post-translational modification of Orc1-2 is also required to activate the DDR cascade.
Based on the above summarized results, one could envisage the following scenario of the regulatory network in the DDR of Sulfolobus. As illustrated in Fig. 10, DSBs generated by DNA replication across genomic DNA lesions or directly from NQO treatment can be recognised by Mre11-Rad50 complex, which then recruits NurA and HerA and initiates end resection. Meanwhile, Mre11-Rad50 complex may also transmits the DNA damage signal to certain Orc1-2 modifying factor that activates the function of Orc1-2 upon DNA damage. Once activated, Orc1-2 enhances the expression of itself thus comprising a positive feedback loop. Meanwhile, Orc1-2 binds to the DDRE or other promoter regions
36 of its target genes to modulate transcriptional process, thereby orchestrating the global transcriptional response towards DNA damage.
Figure 10. An Orc1-2 centered DDR regulatory network in Sulfolobus DNA damage agents yield lesions on DNA that will be converted into double-stranded breaks, which activate the DNA damage signal transduction pathway. Then Orc1-2 is probably activated by posttranslational modifications, such as phosphorylation and/or acetylation. The activated form of Orc1-2 then recognizes DDRE present in the promoters of DDR genes and activates/represses their expression, including several different cellular processes as well as its own gene. AAA+: ATPases associated with diverse cellular activities; wH: wing-helix DNA binding domain; DDRE: DNA damage-responsive element; TTS: transcription start site; Ups: UV-responsive pili of Sulfolobus; Ced: Crenarchaeal system for exchange of DNA.
37
DISCUSSIONS AND FUTURE PERSPECTIVES
CHIP-qPCR analysis showed that TFB3 is recruited onto the promoter of TFB3-dependent DDR genes. However, the underlying mechanism remains to be elucidated. As TFB3 lacks any recognizable DNA-binding domain, its recruitment onto the promoter must be indirect, such as by protein-protein interaction with a transcriptional factor that binds to the DDR promoters. Here, Orc1-2 represent a candidate for such a role, as it was shown that Orc1-2 binds to DDRE in vitro. Consistently, deletion of either the orc1-2 or tfb3 abolish the DNA damage induced cell aggregation and our unpublished data suggested that TFB3 cannot restore the function of Orc1-2 in activating gene expression of ups operon upon DNA damage, indicating that both TFB3 and Orc1-2 are required for this process. The ability of Orc1-2 to bind DDRE at the promoter region may assist TFB3 recruitment via the interaction with TFB3’s CC motif. Indeed, the coiled-coil motif is conserved in TFB3 paralogues and was shown to be essential for the function of TFB3, suggesting that it may mediate the recruitment of TFB3.
It was shown that the conserved DDRE motif (ANTTTC) mediates NQO-responsive expression, in the presence of Orc1-2. However, among those top regulated genes, dpo2 and orc1-2 also contain the DDRE motif but is TFB3-independent, suggesting that other co- activators are recruited after Orc1-2 binding, or Orc1-2 itself is sufficient for the activation of dpo2 gene or the encoding gene of itself upon DNA damage.
More interestingly, the variation of the DDRE sequence and location at different promoter was observed. For example, tfb3 and orc1-2 gene promoter contain the most conserved motif that present between -50 and -80 region, while in contrast, most of the DDRE locate at -30 to -50 region of the promoter. This arrangement is reminiscent of the LexA-mediated SOS response, in which the location and sequence variation of SOS box determine the affinity to LexA and thus modulates the transcriptional strength of SOS genes. Similarly, the multilayer regulations of DDR genes may also exist in Sulfolobus.
In contrast, motif searching for the promoters of those downregulated ones failed to identify a conserved regulatory motif (apart from the TATA box region), suggesting a distinctive
38 mechanism for transcription repression of these genes. For example, the TFB3/Orc1-2 dependent upregulation of certain repressors could lead to the repression of certain genes. Another possibility is that activated Orc1-2 competes with TBP for the binding of TATA box region, thereby blocking the PIC formation on the promoter of those downregulated genes.
Interestingly, the induction of Orc1-2 is essential but not sufficient for the activation of DDR in Sulfolobus, suggesting the existence of a yet-to-be defined mechanism to activate Orc1-2 upon DNA damage. It was proposed that the ADP/ATP bound forms may function as a switch for the functional status of Sulfolobus Orc1/Cdc6 proteins and newly synthesized ATP-bound proteins will be inactivated by one round of ATP hydrolysis (137). However, the activation of Orc1-2 upon DNA damage is less likely to be mediated by such a mechanism, as the constant induction of orc1-2 gene in the Orc1-2araS strain, failed to initiate DDR cascade. One possibility is that the newly synthesized Orc1-2 has to interact with certain DNA-damage- activated partner to achieve the conformation change and thus fulfill its role in transcriptional regulation. However, a more attractive model is that certain Orc1-2 modifying factor activates Orc1-2 upon DNA damage by posttranslational modification, a prevailing mode employed by eukaryal organisms to modulate the function of their DDR regulators.
∆orc1-2 shows a hypersensitivity phenotype following the incubation with NQO, while ∆tfb3 is only moderately sensitive to NQO treatment. The difference between the susceptibility of ∆tfb3 and ∆orc1-2 to NQO treatment suggests that those TFB3-independent processes play a key role in cell survival upon DNA damage. Notably, Dpo2 encoding gene is the most top induced genes that are independent of TFB3. It thus represents a candidate that contributes to the survival of ∆tfb3, but not ∆orc1-2 upon DNA damage, by performing TLS. More importantly, the failure of ∆orc1-2 in blocking the expression of those genes involved in cell cycle progression upon DNA damage probably renders the strain more sensitive to NQO agent, as the NQO-induced lesions will be converted into the more deleterious DSBs upon DNA replication.
Based on the discussions above, the further research can be focused on the following aspects.
As the overexpression of orc1-2 does not trigger the activation of DDR cascade, we
39 hypothesized that PTM events mediated by certain kinases or other protein-modifying enzymes may occur on Orc1-2 upon DNA damage, which probably activate the function of Orc1-2. As a result, it is of crucial importance to determine whether Orc1-2 undergoes posttranslational modification upon DNA damage by mass spectrometry, either the phosphorylation or acetylation/methylation. If it proven to be true, then the identification of the Orc1-2 modifier will provide unprecedented insights into the DNA damage induced signal transduction pathway in archaea.
It was suggested those unrepaired lesions in Sulfolobus can block DNA replication and lead to the DSBs formation, which may trigger the cellular signal cascade towards DNA damage (138). So far, it is generally accepted that Mre11-Rad50 is the early factor that recognizes the DSBs and it may recruits either NurA-HerA for end resection or other factors for DNA damage signal transduction. In eukarya, the MRN complex was the main factor for the activation of the ATM kinase by protein-protein interaction and this complex is conserved across archaeal-eukaryal lineages, suggesting Mre11-rad50 could also transmit the DNA damage signal to downstream transducers in archaea, which probably is a kinase. By analogy to the mechanism in Eukarya, the identification of the factors that interact with Mre11-Rad50 under the condition of DNA damage will provide interesting insights into the mechanism of DDR activation in archaea.
In addition, how Orc1-2 regulates the transcriptional process for the target genes without a DDRE motif remains unclear. One possibility is that Orc1-2 interacts with specific or general transcriptional regulators to modulate the expression of certain subset of genes. The identification and characterization of Orc1-2 interactome under the normal growth and upon DNA damage may provide new clues for the mechanism of Orc1-2-mediated transcriptional responses.
40
REFERENCES
1. Woese, C.R. and Fox, G.E. (1977) Phylogenetic structure of the prokaryotic domain: the primary kingdoms. Proc Natl Acad Sci U S A, 74, 5088-5090. 2. Lindas, A.C. and Bernander, R. (2013) The cell cycle of archaea. Nat. Rev. Microbiol., 11, 627-638. 3. Zatopek, K.M., Gardner, A.F. and Kelman, Z. (2018) Archaeal DNA replication and repair: new genetic, biophysical, and molecular tools for discovering and characterizing enzymes, pathways, and mechanisms. FEMS Microbiol. Rev. 4. Lindahl, T. (1993) Instability and decay of the primary structure of DNA. Nature, 362, 709. 5. She, Q., Feng, X. and Han, W. (2017), Biocommunication of Archaea. Springer, pp. 305-318. 6. Grogan, D.W., Carver, G.T. and Drake, J.W. (2001) Genetic fidelity under harsh conditions: analysis of spontaneous mutation in the thermoacidophilic archaeon Sulfolobus acidocaldarius. Proc Natl Acad Sci U S A, 98, 7928-7933. 7. Goodman, M.F. and Woodgate, R. (2013) Translesion DNA polymerases. Csh Perspect Biol, 5, a010363. 8. White, M.F. and Allers, T. (2018) DNA Repair in the Archaea - an emerging picture. FEMS Microbiol. Rev. 9. Grogan, D.W. (2015) Understanding DNA repair in hyperthermophilic Archaea: persistent gaps and other reasons to focus on the fork. Archaea, 2015. 10. Zhang, C., Tian, B., Li, S., Ao, X., Dalgaard, K., Gokce, S., Liang, Y. and She, Q. (2013) Genetic manipulation in Sulfolobus islandicus and functional analysis of DNA repair genes. Biochem. Soc. Trans., 41, 405-410. 11. Genois, M.-M., Paquet, E.R., Laffitte, M.-C.N., Maity, R., Rodrigue, A., Ouellette, M. and Masson, J.-Y. (2014) DNA repair pathways in trypanosomatids: from DNA repair to drug resistance. Microbiol. Mol. Biol. Rev., 78, 40-73. 12. Sancar, A., Lindsey-Boltz, L.A., Unsal-Kacmaz, K. and Linn, S. (2004) Molecular mechanisms of mammalian DNA repair and the DNA damage checkpoints. Annu. Rev. Biochem., 73, 39-85. 13. Friedberg, E.C., Walker, G.C., Siede, W. and Wood, R.D. (2005) DNA repair and mutagenesis. American Society for Microbiology Press. 14. Ciccia, A. and Elledge, S.J. (2010) The DNA damage response: making it safe to play with knives. Mol. Cell, 40, 179-204. 15. Morita, R., Nakane, S., Shimada, A., Inoue, M., Iino, H., Wakamatsu, T., Fukui, K., Nakagawa, N., Masui, R. and Kuramitsu, S. (2010) Molecular mechanisms of the whole DNA repair system: a comparison of bacterial and eukaryotic systems. J Nucleic Acids, 2010, 179594. 16. Yi, C. and He, C. (2013) DNA repair by reversal of DNA damage. Csh Perspect Biol, 5, a012575. 17. Kiener, A., Husain, I., Sancar, A. and Walsh, C. (1989) Purification and properties of Methanobacterium thermoautotrophicum DNA photolyase. J. Biol. Chem., 264, 13880-13887.
41
18. Sakofsky, C.J., Runck, L.A. and Grogan, D.W. (2011) Sulfolobus mutants, generated via PCR products, which lack putative enzymes of UV photoproduct repair. Archaea, 2011. 19. Fujihashi, M., Numoto, N., Kobayashi, Y., Mizushima, A., Tsujimura, M., Nakamura, A., Kawarabayasi, Y. and Miki, K. (2007) Crystal structure of archaeal photolyase from Sulfolobus tokodaii with two FAD molecules: implication of a novel light-harvesting cofactor. J. Mol. Biol., 365, 903-910. 20. Dorazi, R., Götz, D., Munro, S., Bernander, R. and White, M.F. (2007) Equal rates of repair of DNA photoproducts in transcribed and non‐transcribed strands in Sulfolobus solfataricus. Mol. Microbiol., 63, 521-529. 21. Leclere, M.M., Nishioka, M., Yuasa, T., Fujiwara, S., Takagi, M. and Imanaka, T. (1998) The O6-methylguanine-DNA methyltransferase from the hyperthermophilic archaeon Pyrococcus sp. KOD1: a thermostable repair enzyme. Mol. Gen. Genet., 258, 69-77. 22. Perugino, G., Vettone, A., Illiano, G., Valenti, A., Ferrara, M.C., Rossi, M. and Ciaramella, M. (2012) Activity and regulation of archaeal DNA alkyltransferase: conserved protein involved in repair of DNA alkylation damage. J. Biol. Chem., 287, 4222-4231. 23. Kanugula, S., Pauly, G.T., Moschel, R.C. and Pegg, A.E. (2005) A bifunctional DNA repair protein from Ferroplasma acidarmanus exhibits O6-alkylguanine-DNA alkyltransferase and endonuclease V activities. Proc Natl Acad Sci U S A, 102, 3617-3622. 24. Nishikori, S., Shiraki, K., Okanojo, M., Imanaka, T. and Takagi, M. (2004) Equilibrium and kinetic stability of a hyperthermophilic protein, O6- methylguanine-DNA methyltransferase under various extreme conditions. Journal of biochemistry, 136, 503-508. 25. Messling, J.-E. and Williams, A. (2016), Genome Stability. Elsevier, pp. 51-67. 26. Krokan, H.E. and Bjoras, M. (2013) Base excision repair. Csh Perspect Biol, 5, a012583. 27. Grasso, S. and Tell, G. (2014) Base excision repair in Archaea: back to the future in DNA repair. DNA Repair, 21, 148-157. 28. Ishino, S., Makita, N., Shiraishi, M., Yamagami, T. and Ishino, Y. (2015) EndoQ and EndoV work individually for damaged DNA base repair in Pyrococcus furiosus. Biochimie, 118, 264-269. 29. Kiyonari, S., Egashira, Y., Ishino, S. and Ishino, Y. (2014) Biochemical characterization of endonuclease V from the hyperthermophilic archaeon, Pyrococcus furiosus. Journal of biochemistry, 155, 325-333. 30. Shiraishi, M., Ishino, S., Yamagami, T., Egashira, Y., Kiyonari, S. and Ishino, Y. (2015) A novel endonuclease that may be responsible for damaged DNA base repair in Pyrococcus furiosus. Nucleic Acids Res., 43, 2853-2863. 31. Yan, Z., Huang, Q., Ni, J. and Shen, Y. (2016) Distinct catalytic activity and in vivo roles of the ExoIII and EndoIV AP endonucleases from Sulfolobus islandicus. Extremophiles, 20, 785-793. 32. Greagg, M.A., Fogg, M.J., Panayotou, G., Evans, S.J., Connolly, B.A. and Pearl, L.H. (1999) A read-ahead function in archaeal DNA polymerases detects promutagenic template-strand uracil. Proc Natl Acad Sci U S A, 96, 9045-9050. 33. Dionne, I. (2005) Characterization of an archaeal family 4 uracil DNA glycosylase and its interaction with PCNA and chromatin proteins. Biochem. J., 387, 859-863.
42
34. Kiyonari, S., Tahara, S., Uchimura, M., Shirai, T., Ishino, S. and Ishino, Y. (2009) Studies on the base excision repair (BER) complex in Pyrococcus furiosus. Biochem. Soc. Trans., 37, 79-82. 35. Pan, M., Kelman, L.M. and Kelman, Z. (2011) The archaeal PCNA proteins. Biochem. Soc. Trans., 39, 20-24. 36. Shiraishi, M., Ishino, S., Yoshida, K., Yamagami, T., Cann, I. and Ishino, Y. (2016) PCNA is involved in the EndoQ-mediated DNA repair process in Thermococcales. Scientific reports, 6, 25532. 37. Hogrel, G., Lu, Y., Laurent, S., Henry, E., Etienne, C., Phung, D.K., Dulermo, R., Bossé, A., Pluchon, P.-F. and Clouet-d’Orval, B. Physical and functional interplay between PCNA DNA clamp and Mre11–Rad50 complex from the archaeon Pyrococcus furiosus. Nucleic Acids Res. 38. Rouillon, C. and White, M.F. (2011) The evolution and mechanisms of nucleotide excision repair proteins. Res. Microbiol., 162, 19-26. 39. Kisker, C., Kuper, J. and Van Houten, B. (2013) Prokaryotic nucleotide excision repair. Csh Perspect Biol, 5, a012591. 40. Hu, J., Selby, C.P., Adar, S., Adebali, O. and Sancar, A. (2017) Molecular mechanisms and genomic maps of DNA excision repair in Escherichia coli and humans. J. Biol. Chem., 292, 15588-15597. 41. Giglia-Mari, G., Zotter, A. and Vermeulen, W. (2011) DNA damage response. Csh Perspect Biol, 3, a000745. 42. Marteijn, J.A., Lans, H., Vermeulen, W. and Hoeijmakers, J.H. (2014) Understanding nucleotide excision repair and its roles in cancer and ageing. Nat. Rev. Mol. Cell Biol., 15, 465-481. 43. White, M.F. and Allers, T. (2018) DNA Repair in the Archaea–an emerging picture. FEMS Microbiol. Rev. 44. Rouillon, C. and White, M.F. (2010) The XBP-Bax1 helicase-nuclease complex unwinds and cleaves DNA: implications for eukaryal and archaeal nucleotide excision repair. J. Biol. Chem., 285, 11013-11022. 45. Romano, V., Napoli, A., Salerno, V., Valenti, A., Rossi, M. and Ciaramella, M. (2007) Lack of strand-specific repair of UV-induced DNA lesions in three genes of the archaeon Sulfolobus solfataricus. J. Mol. Biol., 365, 921-929. 46. Fujikane, R., Ishino, S., Ishino, Y. and Forterre, P. (2010) Genetic analysis of DNA repair in the hyperthermophilic archaeon, Thermococcus kodakaraensis. Genes Genet. Syst., 85, 243-257. 47. Crowley, D.J., Boubriak, I., Berquist, B.R., Clark, M., Richard, E., Sullivan, L., DasSarma, S. and McCready, S. (2006) The uvrA, uvrB and uvrC genes are required for repair of ultraviolet light induced DNA photoproducts in Halobacterium sp. NRC-1. Saline Systems, 2, 11. 48. Li, G.-M. (2008) Mechanisms and functions of DNA mismatch repair. Cell Res., 18, 85. 49. Larrea, A.A., Lujan, S.A. and Kunkel, T.A. (2010) SnapShot: DNA mismatch repair. Cell, 141, 730-730. e731. 50. Kelman, Z. and White, M.F. (2005) Archaeal DNA replication and repair. Curr. Opin. Microbiol., 8, 669-676. 51. Busch, C.R. and DiRuggiero, J. (2010) MutS and MutL are dispensable for maintenance of the genomic mutation rate in the halophilic archaeon Halobacterium salinarum NRC-1. PLoS One, 5, e9045.
43
52. Grogan, D.W. (2004) Stability and repair of DNA in hyperthermophilic Archaea. Curr. Issues Mol. Biol., 6, 137-144. 53. Ishino, S., Nishi, Y., Oda, S., Uemori, T., Sagara, T., Takatsu, N., Yamagami, T., Shirai, T. and Ishino, Y. (2016) Identification of a mismatch-specific endonuclease in hyperthermophilic Archaea. Nucleic Acids Res., 44, 2977-2986. 54. Nakae, S., Hijikata, A., Tsuji, T., Yonezawa, K., Kouyama, K.-i., Mayanagi, K., Ishino, S., Ishino, Y. and Shirai, T. (2016) Structure of the EndoMS-DNA complex as mismatch restriction endonuclease. Structure, 24, 1960-1971. 55. Castaneda-Garcia, A., Prieto, A.I., Rodriguez-Beltran, J., Alonso, N., Cantillon, D., Costas, C., Perez-Lago, L., Zegeye, E.D., Herranz, M., Plocinski, P. et al. (2017) A non-canonical mismatch repair pathway in prokaryotes. Nature communications, 8, 14246. 56. Ren, B., Kühn, J., Meslet‐Cladiere, L., Briffotaux, J., Norais, C., Lavigne, R., Flament, D., Ladenstein, R. and Myllykallio, H. (2009) Structure and function of a novel endonuclease acting on branched DNA substrates. The EMBO journal, 28, 2479-2489. 57. Bonura, T., Town, C.D., Smith, K.C. and Kaplan, H.S. (1975) The influence of oxygen on the yield of DNA double-strand breaks in X-irradiated Escherichia coli K-12. Radiat. Res., 63, 567-577. 58. Huang, L.C., Clarkin, K.C. and Wahl, G.M. (1996) Sensitivity and selectivity of the DNA damage sensor responsible for activating p53-dependent G1 arrest. Proc Natl Acad Sci U S A, 93, 4827-4832. 59. Lieber, M.R. and Wilson, T.E. (2010) SnapShot: Nonhomologous DNA end joining (NHEJ). Cell, 142, 496-496. e491. 60. Blackwood, J.K., Rzechorzek, N.J., Bray, S.M., Maman, J.D., Pellegrini, L. and Robinson, N.P. (2013) End-resection at DNA double-strand breaks in the three domains of life. Biochem. Soc. Trans., 41, 314-320. 61. Jasin, M. and Rothstein, R. (2013) Repair of strand breaks by homologous recombination. Csh Perspect Biol, 5, a012740. 62. Moynahan, M.E. and Jasin, M. (2010) Mitotic homologous recombination maintains genomic stability and suppresses tumorigenesis. Nat. Rev. Mol. Cell Biol., 11, 196-207. 63. Constantinesco, F., Forterre, P., Koonin, E., Aravind, L. and Elie, C. (2004) A bipolar DNA helicase gene, herA, clusters with rad50, mre11 and nurA genes in thermophilic archaea. Nucleic Acids Res., 32, 1439-1447. 64. Haseltine, C.A. and Kowalczykowski, S.C. (2009) An archaeal Rad54 protein remodels DNA and stimulates DNA strand exchange by RadA. Nucleic Acids Res., 37, 2757-2770. 65. Patoli, B.B., Winter, J.A., Patoli, A.A., Delahay, R.M. and Bunting, K.A. (2017) Co-expression and purification of the RadA recombinase with the RadB paralog from Haloferax volcanii yields heteromeric ring-like structures. Microbiology, 163, 1802-1811. 66. Zhai, B., DuPrez, K., Doukov, T.I., Li, H., Huang, M., Shang, G., Ni, J., Gu, L., Shen, Y. and Fan, L. (2017) Structure and Function of a Novel ATPase that Interacts with Holliday Junction Resolvase Hjc and Promotes Branch Migration. J. Mol. Biol., 429, 1009-1029.
44
67. Huang, Q., Li, Y., Zeng, C., Song, T., Yan, Z., Ni, J., She, Q. and Shen, Y. (2015) Genetic analysis of the Holliday junction resolvases Hje and Hjc in Sulfolobus islandicus. Extremophiles, 19, 505-514. 68. White, M.F. (2011) Homologous recombination in the archaea: the means justify the ends. Biochem. Soc. Trans., 39, 15-19. 69. Northall, S.J., Ivančić-Baće, I., Soultanas, P. and Bolt, E.L. (2016) Remodeling and control of homologous recombination by DNA helicases and translocases that target recombinases and synapsis. Genes, 7, 52. 70. Huang, Q., Liu, L., Liu, J., Ni, J., She, Q. and Shen, Y. (2015) Efficient 5′-3′ DNA end resection by HerA and NurA is essential for cell viability in the crenarchaeon Sulfolobus islandicus. BMC Mol. Biol., 16, 2. 71. Weller, G.R., Kysela, B., Roy, R., Tonkin, L.M., Scanlan, E., Della, M., Devine, S.K., Day, J.P., Wilkinson, A., d'Adda di Fagagna, F. et al. (2002) Identification of a DNA nonhomologous end-joining complex in bacteria. Science, 297, 1686-1689. 72. Sale, J.E. (2013) Translesion DNA synthesis and mutagenesis in eukaryotes. Csh Perspect Biol, 5, a012708. 73. Petruska, J. and Goodman, M.F. (1995) Enthalpy-entropy compensation in DNA melting thermodynamics. J. Biol. Chem., 270, 746-750. 74. McCulloch, S.D. and Kunkel, T.A. (2008) The fidelity of DNA synthesis by eukaryotic replicative and translesion synthesis polymerases. Cell Res., 18, 148. 75. Fuchs, R.P. and Fujii, S. (2013) Translesion DNA synthesis and mutagenesis in prokaryotes. Csh Perspect Biol, 5, a012682. 76. Goodman, M.F. and Woodgate, R. (2013) Translesion DNA polymerases. Cold Spring Harbor perspectives in biology, 5, a010363. 77. Kreuzer, K.N. (2013) DNA damage responses in prokaryotes: regulating gene expression, modulating growth patterns, and manipulating replication forks. Csh Perspect Biol, 5, a012674. 78. Wu, Y., Wilson, R.C. and Pata, J.D. (2011) The Y-family DNA polymerase Dpo4 uses a template slippage mechanism to create single-base deletions. J. Bacteriol., 193, 2630-2636. 79. Trakselis, M.A. and Bauer, R.J. (2014), Nucleic Acid Polymerases. Springer, pp. 139-162. 80. Sakofsky, C.J., Foster, P.L. and Grogan, D.W. (2012) Roles of the Y-family DNA polymerase Dbh in accurate replication of the Sulfolobus genome at high temperature. DNA Repair, 11, 391-400. 81. Choi, J.-Y., Eoff, R.L., Pence, M.G., Wang, J., Martin, M.V., Kim, E.-J., Folkmann, L.M. and Guengerich, F.P. (2011) Roles of the four DNA polymerases of the crenarchaeon Sulfolobus solfataricus and accessory proteins in DNA replication. J. Biol. Chem., 286, 31180-31193. 82. Jozwiakowski, S.K., Gholami, F.B. and Doherty, A.J. (2015) Archaeal replicative primases can perform translesion DNA synthesis. P Natl Acad Sci USA, 112, E633- E638. 83. Blackford, A.N. and Jackson, S.P. (2017) ATM, ATR, and DNA-PK: The Trinity at the Heart of the DNA Damage Response. Mol. Cell, 66, 801-817. 84. Sirbu, B.M. and Cortez, D. (2013) DNA Damage Response: Three Levels of DNA Repair Regulation. Csh Perspect Biol, 5. 85. Brown, J.S., O'Carrigan, B., Jackson, S.P. and Yap, T.A. (2017) Targeting DNA repair in cancer: beyond PARP inhibitors. Cancer discovery, 7, 20-37.
45
86. Blackford, A.N. and Jackson, S.P. (2017) ATM, ATR, and DNA-PK: The trinity at the heart of the DNA damage response. Molecular cell, 66, 801-817. 87. Bakkenist, C.J. and Kastan, M.B. (2003) DNA damage activates ATM through intermolecular autophosphorylation and dimer dissociation. Nature, 421, 499. 88. Lee, J.-H., Mand, M.R., Deshpande, R.A., Kinoshita, E., Yang, S.-H., Wyman, C. and Paull, T.T. (2013) Ataxia telangiectasia-mutated (ATM) kinase activity is regulated by ATP-driven conformational changes in the Mre11/Rad50/Nbs1 (MRN) complex. J. Biol. Chem., 288, 12840-12851. 89. Schiller, C.B., Lammens, K., Guerini, I., Coordes, B., Feldmann, H., Schlauderer, F., Mockel, C., Schele, A., Strasser, K., Jackson, S.P. et al. (2012) Structure of Mre11-Nbs1 complex yields insights into ataxia-telangiectasia-like disease mutations and DNA damage signaling. Nat. Struct. Mol. Biol., 19, 693-+. 90. Schiller, C.B., Lammens, K., Guerini, I., Coordes, B., Feldmann, H., Schlauderer, F., Möckel, C., Schele, A., Strässer, K. and Jackson, S.P. (2012) Structure of Mre11–Nbs1 complex yields insights into ataxia-telangiectasia–like disease mutations and DNA damage signaling. Nature Structural and Molecular Biology, 19, 693. 91. Shiloh, Y. and Ziv, Y. (2013) The ATM protein kinase: regulating the cellular response to genotoxic stress, and more. Nat Rev Mol Cell Bio, 14, 197-210. 92. Escribano-Díaz, C., Orthwein, A., Fradet-Turcotte, A., Xing, M., Young, J.T., Tkáč, J., Cook, M.A., Rosebrock, A.P., Munro, M. and Canny, M.D. (2013) A cell cycle-dependent regulatory circuit composed of 53BP1-RIF1 and BRCA1-CtIP controls DNA repair pathway choice. Mol. Cell, 49, 872-883. 93. Marechal, A. and Zou, L. (2013) DNA Damage Sensing by the ATM and ATR Kinases. Csh Perspect Biol, 5. 94. Simmons, L.A., Foti, J.J., Cohen, S.E. and Walker, G.C. (2008) The SOS regulatory network. EcoSal Plus, 2008. 95. Erill, I., Campoy, S. and Barbe, J. (2007) Aeons of distress: an evolutionary perspective on the bacterial SOS response. FEMS Microbiol. Rev., 31, 637-656. 96. Kreuzer, K.N. (2013) DNA Damage Responses in Prokaryotes: Regulating Gene Expression, Modulating Growth Patterns, and Manipulating Replication Forks. Cold Spring Harbor Perspectives in Biology, 5. 97. Soutoglou, E. and Misteli, T. (2008) Activation of the cellular DNA damage response in the absence of DNA lesions. Science, 320, 1507-1510. 98. You, Z., Bailis, J.M., Johnson, S.A., Dilworth, S.M. and Hunter, T. (2007) Rapid activation of ATM on DNA flanking double-strand breaks. Nat. Cell Biol., 9, 1311. 99. Cubeddu, L. and White, M.F. (2005) DNA damage detection by an archaeal single- stranded DNA-binding protein. J. Mol. Biol., 353, 507-516. 100. Napoli, A., Valenti, A., Salerno, V., Nadal, M., Garnier, F., Rossi, M. and Ciaramella, M. (2005) Functional interaction of reverse gyrase with single-strand binding protein of the archaeon Sulfolobus. Nucleic Acids Res., 33, 564-576. 101. Carpentieri, F., De Felice, M., De Falco, M., Rossi, M. and Pisani, F.M. (2002) Physical and functional interaction between the mini-chromosome maintenance- like DNA helicase and the single-stranded DNA binding protein from the crenarchaeon Sulfolobus solfataricus. J. Biol. Chem., 277, 12118-12127. 102. Wei, T., Zhang, S., Zhu, S., Sheng, D., Ni, J. and Shen, Y. (2008) Physical and functional interaction between archaeal single-stranded DNA-binding protein and the 5′–3′ nuclease NurA. Biochem. Biophys. Res. Commun., 367, 523-529.
46
103. Cadman, C.J. and McGlynn, P. (2004) PriA helicase and SSB interact physically and functionally. Nucleic Acids Res., 32, 6378-6387. 104. Quaiser, A., Constantinesco, F., White, M.F., Forterre, P. and Elie, C. (2008) The Mre11 protein interacts with both Rad50 and the HerA bipolar helicase and is recruited to DNA following gamma irradiation in the archaeon Sulfolobus acidocaldarius. BMC Mol. Biol., 9, 25. 105. Kish, A., Gaillard, J.C., Armengaud, J. and Elie, C. (2016) Post‐translational methylations of the archaeal Mre11: Rad50 complex throughout the DNA damage response. Mol. Microbiol., 100, 362-378. 106. Delmas, S., Shunburne, L., Ngo, H.-P. and Allers, T. (2009) Mre11-Rad50 promotes rapid repair of DNA damage in the polyploid archaeon Haloferax volcanii by restraining homologous recombination. PLoS Genet., 5, e1000552. 107. Delmas, S., Duggin, I.G. and Allers, T. (2013) DNA damage induces nucleoid compaction via the Mre11‐Rad50 complex in the archaeon Haloferax volcanii. Mol. Microbiol., 87, 168-179. 108. Greagg, M.A., Fogg, M.J., Panayotou, G., Evans, S.J., Connolly, B.A. and Pearl, L.H. (1999) A read-ahead function in archaeal DNA polymerases detects promutagenic template-strand uracil. Proceedings of the National Academy of Sciences, 96, 9045-9050. 109. Gehring, A.M. and Santangelo, T.J. (2017) Archaeal RNA polymerase arrests transcription at DNA lesions. Transcription, 8, 288-296. 110. Baliga, N.S., Bjork, S.J., Bonneau, R., Pan, M., Iloanusi, C., Kottemann, M.C., Hood, L. and DiRuggiero, J. (2004) Systems level insights into the stress response to UV radiation in the halophilic archaeon Halobacterium NRC-1. Genome Res., 14, 1025-1035. 111. Salerno, V., Napoli, A., White, M.F., Rossi, M. and Ciaramella, M. (2003) Transcriptional response to DNA damage in the archaeon Sulfolobus solfataricus. Nucleic Acids Res., 31, 6127-6138. 112. Fröls, S., Gordon, P.M., Panlilio, M.A., Duggin, I.G., Bell, S.D., Sensen, C.W. and Schleper, C. (2007) Response of the hyperthermophilic archaeon Sulfolobus solfataricus to UV damage. J. Bacteriol., 189, 8708-8718. 113. Götz, D., Paytubi, S., Munro, S., Lundgren, M., Bernander, R. and White, M.F. (2007) Responses of hyperthermophilic crenarchaea to UV irradiation. Genome biology, 8, R220. 114. Rogozin, I.B., Makarova, K.S., Pavlov, Y.I. and Koonin, E.V. (2008) A highly conserved family of inactivated archaeal B family DNA polymerases. Biology direct, 3, 32. 115. Grogan, D.W. (1996) Exchange of genetic markers at extremely high temperatures in the archaeon Sulfolobus acidocaldarius. J. Bacteriol., 178, 3207-3211. 116. Schmidt, K.J., Beck, K.E. and Grogan, D.W. (1999) UV stimulation of chromosomal marker exchange in Sulfolobus acidocaldarius: implications for DNA repair, conjugation and homologous recombination at extremely high temperatures. Genetics, 152, 1407-1415. 117. Fröls, S., Ajon, M., Wagner, M., Teichmann, D., Zolghadr, B., Folea, M., Boekema, E.J., Driessen, A.J., Schleper, C. and Albers, S.V. (2008) UV‐inducible cellular aggregation of the hyperthermophilic archaeon Sulfolobus Solfataricus is mediated by pili formation. Mol. Microbiol., 70, 938-952.
47
118. Ajon, M., Fröls, S., van Wolferen, M., Stoecker, K., Teichmann, D., Driessen, A.J., Grogan, D.W., Albers, S.V. and Schleper, C. (2011) UV‐inducible DNA exchange in hyperthermophilic archaea mediated by type IV pili. Mol. Microbiol., 82, 807- 817. 119. van Wolferen, M., Wagner, A., van der Does, C. and Albers, S.V. (2016) The archaeal Ced system imports DNA. P Natl Acad Sci USA, 113, 2496-2501. 120. Wagner, A., Whitaker, R.J., Krause, D.J., Heilers, J.H., van Wolferen, M., van der Does, C. and Albers, S.V. (2017) Mechanisms of gene flow in archaea. Nat. Rev. Microbiol., 15, 492-501. 121. Han, W. and Institut, K.U.D.N.-o.B.F.B. (2015) Studies on DNA Damage Response in Sulfolobus Islandicus. University of Copenhagen, Faculty of Science, Department of Biology. 122. Hjort, K. and Bernander, R. (1999) Changes in cell size and DNA content in Sulfolobus cultures during dilution and temperature shift experiments. J. Bacteriol., 181, 5669-5675. 123. Jansson, B.M., Malandrin, L. and Johansson, H.E. (2000) Cell cycle arrest in archaea by the hypusination inhibitor N 1-guanyl-1, 7-diaminoheptane. J. Bacteriol., 182, 1158-1161. 124. Hjort, K. and Bernander, R. (2001) Cell cycle regulation in the hyperthermophilic crenarchaeon Sulfolobus acidocaldarius. Mol. Microbiol., 40, 225-234. 125. Lundgren, M., Andersson, A., Chen, L.M., Nilsson, P. and Bernander, R. (2004) Three replication origins in Sulfolobus species: Synchronous initiation of chromosome replication and asynchronous termination. P Natl Acad Sci USA, 101, 7046-7051. 126. Peeters, E., Driessen, R.P.C., Werner, F. and Dame, R.T. (2015) The interplay between nucleoid organization and transcription in archaeal genomes. Nat. Rev. Microbiol., 13, 333-341. 127. Werner, F. and Grohmann, D. (2011) Evolution of multisubunit RNA polymerases in the three domains of life. Nat. Rev. Microbiol., 9, 85. 128. Martinez-Pastor, M., Tonner, P.D., Darnell, C.L. and Schmid, A.K. (2017) Transcriptional Regulation in Archaea: From Individual Genes to Global Regulatory Networks. Annu. Rev. Genet., 51, 143-170. 129. Lu, Q., Han, J., Zhou, L., Coker, J.A., DasSarma, P., DasSarma, S. and Xiang, H. (2008) Dissection of the regulatory mechanism of a heat-shock responsive promoter in Haloarchaea: a new paradigm for general transcription factor directed archaeal gene regulation. Nucleic Acids Res., 36, 3031-3042. 130. Facciotti, M.T., Reiss, D.J., Pan, M., Kaur, A., Vuthoori, M., Bonneau, R., Shannon, P., Srivastava, A., Donohoe, S.M., Hood, L.E. et al. (2007) General transcription factor specified global gene regulation in archaea. P Natl Acad Sci USA, 104, 4630-4635. 131. Werner, F. (2007) Structure and function of archaeal RNA polymerases. Mol. Microbiol., 65, 1395-1404. 132. Paytubi, S. and White, M.F. (2009) The crenarchaeal DNA damage‐inducible transcription factor B paralogue TFB3 is a general activator of transcription. Mol. Microbiol., 72, 1487-1499. 133. Han, W.Y., Xu, Y.Q., Feng, X., Liang, Y.X., Huang, L., Shen, Y.L. and She, Q.X. (2017) NQO-Induced DNA-Less Cell Formation Is Associated with Chromatin
48
Protein Degradation and Dependent on A(0)A(1)-ATPase in Sulfolobus. Frontiers in Microbiology, 8, 1-12. 134. Chen, H.-T. and Hahn, S. (2003) Binding of TFIIB to RNA polymerase II: mapping the binding site for the TFIIB zinc ribbon domain within the preinitiation complex. Mol. Cell, 12, 437-447. 135. Zeng, Q.D., Lewis, L.M., Colangelo, C.M., Dong, J. and Scott, R.A. (1996) A transcription factor IIB homolog from the hyperthermophilic archaeon Pyrococcus furiosus binds Zn or Fe in N-terminal Cys(4) motif. J Biol Inorg Chem, 1, 162-168. 136. Le, T.N., Wagner, A. and Albers, S.-V. (2017) A conserved hexanucleotide motif is important in UV-inducible promoters in Sulfolobus acidocaldarius. Microbiology, 163, 778-788. 137. Samson, R.Y., Xu, Y., Gadelha, C., Stone, T.A., Faqiri, J.N., Li, D., Qin, N., Pu, F., Liang, Y.X., She, Q. et al. (2013) Specificity and function of archaeal DNA replication initiator proteins. Cell reports, 3, 485-496. 138. Frols, S., White, M.F. and Schleper, C. (2009) Reactions to UV damage in the model archaeon Sulfolobus solfataricus. Biochem. Soc. Trans., 37, 36-41.
49
Nucleic Acids Research, 2018 1 doi: 10.1093/nar/gky236 A transcriptional factor B paralog functions as an activator to DNA damage-responsive expression in archaea Xu Feng1,2, Mengmeng Sun2, Wenyuan Han2, Yun Xiang Liang1 and Qunxin She1,2,*
1State Key Laboratory of Agricultural Microbiology and College of Life Science and Technology, Huazhong Agricultural University, 430070 Wuhan, China and 2Archaea Centre, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark
Received February 27, 2018; Revised March 17, 2018; Editorial Decision March 19, 2018; Accepted March 20, 2018
ABSTRACT are larger than the bacterial counterparts consisting of 5 subunits (1). Studies on transcription initiation show that Previously it was shown that UV irradiation induces a the two types of RNAP use different mechanisms to initi- strong upregulation of tfb3 coding for a paralog of the ate gene transcription. In bacteria, the bacterial RNAP and archaeal transcriptional factor B (TFB) in Sulfolobus sigma factor form the holoenzyme, in which the sigma sub- solfataricus, a crenarchaea. To investigate the func- unit recognizes promoters and binds there to form a pre- tion of this gene in DNA damage response (DDR), tfb3 initiation complex (PIC) (2). RNA transcription in archaea was inactivated by gene deletion in Sulfolobus is- is more related to the process by RNAP II, the enzyme that landicus and the resulting tfb3 was more sensitive is responsible for synthesis of mRNA in eukaryotes. The ar- to DNA damage agents than the original strain. Tran- chaeal transcription starts with recognition of a promoter scriptome analysis revealed that a large set of genes by the TATA-binding protein and transcriptional factor B show TFB3-dependent activation, including genes of (TFB), and then, RNAP is recruited to the promoter along with the general transcriptional factor E to form the PIC the ups operon and ced system. Furthermore, the (3,4). TFB3 protein was found to be associated with DDR Current investigation on archaeal transcriptional regu- gene promoters and functional dissection of TFB3 lation has revealed that the eukaryotic-like transcriptional showed that the conserved Zn-ribbon and coiled-coil machinery is primarily controlled by the promoter-centered motif are essential for the activation. Together, the re- mode of regulation; transcription factors specifically bind sults indicated that TFB3 activates the expression of to DNA motifs present on gene promoter regions and reg- DDR genes by interaction with other transcriptional ulate the gene transcription by affecting the PIC formation factors at the promoter regions of DDR genes to fa- on the promoters (5,6). Transcriptional factors of both bac- cilitate the formation of transcription initiation com- terial and eukaryotic types have been identified in archaea plex. Strikingly, TFB3 and Ced systems are present and function in gene-specific regulation (7–9). In addition, / in a wide range of crenarchaea, suggesting that the many archaeal genomes encode multiple TBP and or TFB (10), and for this reason, archaea have the potential to ex- Ced system function as a primary DNA damage re- plore these basal transcriptional factors to exert global reg- pair mechanism in Crenarchaeota. Our findings fur- ulation in analogy to sigma factors in bacterial transcrip- ther suggest that TFB3 and the concurrent TFB1 form tion. a TFB3-dependent DNA damage-responsive circuit Indeed, early works on two TFB paralogs (TFB1 and with their target genes, which is evolutionarily con- TFB2) of Thermococcus kodakarensis reveals that each of served in the major lineage of Archaea. them can support transcription in vitro without any ap- parent selectivity on promoter, and neither of them is es- sential for cell growth (11). Nevertheless, tfb1 is expressed INTRODUCTION to a higher level in T. kodakarensis and supports better RNA transcription is the first step of decoding genetic infor- cell growth, relative to tfb2 (12). Furthermore, characteri- mation from DNA, and RNA polymerase (RNAP), a multi- zation of Pyrococcus furiosus TFB1 and TFB2 shows that protein complex that is responsible for the process, is evolu- the two factors have different capability to further RNA tionarily conserved in all three domains of life. RNAPs in transcription in vitro (13). Sulfolobus solfataricus and Sul- archaea and eukaryotes have 12 or more subunits, which folobus acidocaldarius encode three paralogs of TFB pro-
*To whom correspondence should be addressed. Tel: +45 532 2013; Fax: +45 3532 2128; Email: [email protected]