A Genome-Scale RNA Interference Screen Identifies Novel Regulators of DNA Double-Strand Break Repair
by
Jordan Young
A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy Department of Molecular Genetics University of Toronto
© Copyright by Jordan Young (2016)
A Genome-Scale RNA Interference Screen Identifies Novel Regulators of DNA Double-Strand Break Repair
Jordan Young
Doctor of Philosophy
Department of Molecular Genetics, University of Toronto, 2016 Abstract
All living organisms are continuously challenged by agents in their normal cellular environment that can inflict damage to their genetic material. DNA damage can have negative implications on cellular fitness by disrupting genomic processes such as DNA replication and gene expression. In addition, DNA lesions can lead to gene mutation and gross chromosomal rearrangements, events that are implicated in the development of cancer. DNA double-strand breaks (DSBs) are considered one of the most severe types of DNA damage. To protect genome integrity, organisms have evolved several DSB repair mechanisms. Homologous recombination (HR) is an essential DSB repair pathway that is of critical importance during the S- and G2-phases of the cell cycle. HR plays a central role in promoting genome stability and preventing tumourigenesis. For example, mutations in the genes coding for the HR factors BRCA1 and BRCA2 are responsible for familial breast and ovarian cancer. The limiting step in the HR pathway is the generation of single-stranded DNA (ssDNA) by DNA end resection. In this thesis, I describe the establishment of an immunofluorescence-based assay that monitors end resection by quantitative image-based cytometry (QIBC). Employing this assay, I conducted a plate-based RNA interference (RNAi) screen using a library that targets 18,452 genes. As expected, the top hits in my screen were known regulators of end resection including CtIP and all three subunits of the MRE11-RAD50-NBS1 (MRN) complex. I also outline the validation of one the strongest candidate end resection activators identified in the screen, the zinc finger protein, ZNF335. I demonstrate that ZNF335 promotes DSB repair by HR through the enhancement of DNA end resection. In the final chapter of this thesis, I describe the establishment of a cell microarray- based platform for conducting RNAi screens. The platform circumvents the long liquid-handling robotic procedures and large quantities of reagents that are common for plate-based screens.
ii
Acknowledgements
The completion of the work presented in this thesis has been the most challenging but rewarding endeavor of my young scientific career. I would like to thank, first and foremost, my supervisor Daniel Durocher for giving me the opportunity to work in his lab. My doctoral research had its ups and downs, but Dan always displayed an infectious enthusiasm that kept me moving forward with a positive attitude. I could not have asked for a more brilliant and creative scientific mentor to guide me through this work and I feel so privileged to have been trained in Dan’s lab. The knowledge and skills I have acquired in the Durocher lab are invaluable and I know they will endow me with a successful career in science.
I would also like to thank all of my committee members past and present: Jason Moffat, Bret Pearson, and Corey Nislow. Their unwavering support of my scientific goals, even when the project did not go as planned, was commendable. Furthermore, their thoughtful suggestions were essential to the completion of this thesis.
Thinking back at what I have learned during my time in the Durocher lab is astonishing. I could not have acquired these skills without the help of numerous Durocher lab members both past and present. Throughout my time in the Durocher lab, I have met so many talented and brilliant scientists from all over the globe. I would like to thank two post-doctoral fellows that have especially mentored me: Cristina Escribano-Diaz and Lara O’Donnell.
I would also like to extend acknowledgement to my colleagues at the Lunenfeld- Tanenbaum Research Institute that were essential to this research: Gagan Gupta (Pelletier lab), Thomas Sun and Alessandro Datti (Robotics facility), Mikhail Bashkurov and Cyrus Handy (High-content imaging facility), and Zhen-Yuan Lin (Gingras lab). Their technical expertise, guidance, and patience were remarkable.
Last (but not least) I would like to thank my family and friends for their support and encouragement. I would especially like to acknowledge those that eventually stopped asking the following questions: “when are you finishing school?” and “how long have you been in school for?” I would like to thank Nicole and Chazz for putting up with my absence on many evenings and weekends. Love you guys.
iii
Table of Contents
Acknowledgements ...... iii
Table of Contents ...... iv
List of Tables ...... ix
List of Figures ...... x
List of Appendices ...... xii
List of Abbreviations ...... 1
Chapter I: Introduction
1.1 Statement of contributions, rights, and permissions ...... 4
1.2 Damage to the DNA double helix ...... 5
1.3 DNA double-strand breaks and how they arise ...... 5
1.3.1 Endogenous sources of DNA double-strand breaks ...... 7
1.3.2 Exogenous sources of DNA double-strand breaks ...... 8
1.3.3 Developmentally programmed DNA double-strand breaks ...... 9
1.4 The cellular response to DNA double-strand breaks ...... 10
1.4.1 DNA damage-induced signal transduction by ATM, DNA-PKcs, and ATR ...... 11
1.4.2 DNA double-strand break-induced cell cycle checkpoints ...... 13
1.4.3 DNA double-strand break-induced senescence and apoptosis ...... 17
1.5 Repair of DNA double-strand breaks ...... 18
1.5.1 Non-homologous end joining ...... 18
1.5.2 Mitotic homologous recombination ...... 21
1.5.3 DNA end resection is the rate-limiting step of homologous recombination ...... 27
1.5.4 Cell cycle-regulated suppression of homologous recombination in G1 cells downstream of end resection ...... 29
1.6 Post-translational modifications and the regulation of DNA double-strand break repair 31
iv
1.6.1 Ubiquitin-dependent signaling at DNA double-strand breaks ...... 31
1.6.2 SUMOylation at DNA double-strand breaks ...... 32
1.6.3 Protein acetylation at DNA double-strand breaks ...... 32
1.6.4 PARylation is one of the earliest protein modifications detected at DNA double-strand breaks ...... 33
1.7 Relevance of DSB repair to human physiology ...... 33
1.7.1 DSB repair promotes tumour suppression ...... 34
1.7.2 DSB repair is important during cerebral cortical development ...... 34
1.8 Rationale and research objective ...... 37
1.9 High-throughput functional discovery utilizing RNA interference screens ...... 37
Chapter II: Genome-scale siRNA screen for regulators of DNA end resection
2.1 Statement of contributions, rights, and permissions ...... 41
2.2 Summary ...... 42
2.3 Introduction ...... 43
2.4 Results ...... 44
2.4.1 Establishment of immunofluorescence-based assays to monitor DNA end resection ...... 44
2.4.2 Quantitative image-based cytometry to monitor DNA end resection ...... 48
2.4.3 Automating DNA end resection assays using liquid-handling robotics ...... 49
2.4.4 Genome-scale siRNA screen utilizing a pooled siRNA library ...... 52
2.4.5 Secondary confirmation screen utilizing cherry-picked siRNA pools ...... 55
2.4.6 Re-screening deconvolved siRNAs for the top resection activator candidates .... 57
2.5 Discussion ...... 59
Chapter III: The zinc finger protein, ZNF335, promotes DNA end resection
3.1 Statement of contributions, rights, and permissions ...... 65
3.2 Summary ...... 66
3.3 Introduction ...... 67 v
3.4 Results ...... 71
3.4.1 Analysis of four siRNA duplexes targeting the ZNF335 messenger RNA ...... 71
3.4.2 ZNF335 depleted cells have defective pRPA32 (S4/S8), RPA32, and BrdU focus formation ...... 73
3.4.3 U2OS cells depleted of ZNF335 are sensitive to DSB-inducing agents ...... 73
3.4.4 Depletion of ZNF335 decreases the efficiency of DSB repair by HR ...... 75
3.4.5 ZNF335 promotes the phosphorylation of CHK1 at a characterized ATR consensus site ...... 78
3.4.6 Defective end resection in ZNF335 depleted cells can be rescued by expressing an RNAi-resistant open reading frame ...... 80
3.4.7 The function of ZNF335 in DNA end resection is dependent on its four C- terminal C2H2 zinc finger domains ...... 80
3.4.8 ZNF335 localizes to sites of DNA damage generated by laser microirradiation . 83
3.4.9 ZNF335 depletion does not affect the expression of the core end resection activators ...... 87
3.4.10 Immunoprecipitation coupled to mass spectrometry (IP-MS) identifies ZNF335 candidate interaction partners ...... 90
3.5 Discussion ...... 93
Chapter IV: Solid-phase transfections in arrayed drops (SPIDR): a cell microarray- based platform for high-content screening
4.1 Statement of contributions, rights, and permissions ...... 98
4.2 Summary ...... 99
4.3 Introduction ...... 100
4.4 Results ...... 101
4.4.1 Design and fabrication of cell microarrays ...... 101
4.4.2 Establishment of solid-phase siRNA transfections in human cultured cells ...... 101
4.4.3 Testing for cell and siRNA cross contamination between samples ...... 104
4.4.4 Solid-Phase transfections In arrayed DRops (SPIDR): a novel method to prevent sample cross contamination on cell microarrays ...... 109
4.4.5 Pilot RNAi screen utilizing the SPIDR platform ...... 109 vi
4.5 Discussion ...... 111
Chapter V: Future directions
5.1 Validation of candidate end resection activators ...... 116
5.2 Validation of candidate end resection inhibitors ...... 117
5.3 Elucidating the mechanism by which ZNF335 promotes end resection ...... 118
5.3.1 Generation of a ZNF335 knockout cell line by genome editing ...... 118
5.3.2 ZNF335 recruitment to DSB sites ...... 118
5.3.3 Investigation of end resection factor recruitment to DSB sites ...... 119
5.3.4 Candidate ZNF335 protein interaction partners ...... 119
5.4 ZNF335 and microcephaly ...... 120
Chapter VI: Materials and methods
6.1 Tissue culture ...... 122
6.1.1 Cell lines ...... 122
6.1.2 RNA interference ...... 122
6.1.3 Generating stable cell lines by lentiviral transduction ...... 122
6.2 Fluorescence microscopy ...... 124
6.2.1 Immunofluorescence ...... 124
6.2.2 Quantitative image-based cytometry ...... 125
6.2.3 Laser microirradiation ...... 125
6.3 Automated genome-scale RNAi screen ...... 127
6.4 Reverse transcription and quantitative PCR ...... 127
6.5 Western blot analysis ...... 128
6.5.1 Whole cell extract preparation ...... 128
6.5.2 SDS-PAGE and immunoblotting ...... 128
6.6 Cell cycle analysis by flow cytometry ...... 128
6.7 Clonogenic survival assays ...... 129 vii
6.8 Homologous recombination assay utilizing the DR-GFP reporter system ...... 129
6.9 Plasmids ...... 130
6.10 Immunoprecipitation coupled to mass spectrometry ...... 130
6.11 Solid-phase reverse siRNA transfections on 1536-well cell microarrays ...... 131
6.12 Solid-phase transfections in arrayed drops (SPIDR) ...... 132
Chapter VII: References……………………………………………………………………..132
Appendix I: Genome-scale RNAi screen data……………………………………………….165
Appendix II: Secondary confirmation screen data………………………………………….167
Appendix III: 53BP1 focus formation SPIDR pilot screen data……………………………169
viii
List of Tables
Table 1.1. Clinical features of some genome instability syndromes associated with defective DSB repair……………………………………………………………………………………….34 Table 2.1. Number of candidate resection activator deconvolved siRNAs that decreased DNA end resection in S and G2 phase U2OS cells……………………………………………….……60 Table 3.1. DSB repair proteins harbouring zinc finger domains………………………………..68 Table 3.2. Candidate protein interaction partners for full-length ZNF335………………...…...90 Table 3.3. Candidate protein interaction partners for ZNF335 Δ1-1014………………………..91 Table 6.1. List of siRNA duplexes used in this study………………………………………….122 Table 6.2. List of primary antibodies used in this study……………………………………….125
ix
List of Figures
Figure 1.1. Mechanisms of DNA double-strand break formation………………………………..5 Figure 1.2. Activation of DNA damage responsive kinases at DNA double-strand breaks…….11 Figure 1.3. Cell cycle checkpoints activated in response to DNA double-strand breaks……….14 Figure 1.4. DNA double-strand break repair by non-homologous end joining…………………18 Figure 1.5. DNA double-strand break repair by mitotic homologous recombination…………..22 Figure 1.6. The regulation of DNA double-strand break repair pathway choice……………….29 Figure 2.1. DNA end resection assay monitoring RPA32 focus formation…………………….44 Figure 2.2. DNA end resection assay monitoring pRPA32 (S4/S8) focus formation…………..45 Figure 2.3. DNA end resection assay monitoring BrdU focus formation………………………46 Figure 2.4. Measuring end resection by quantitative image-based cytometry………………….49 Figure 2.5. Application of the Kolmogorov-Smirnov test to analyze end resection……………50 Figure 2.6. Automating DNA end resection assays using liquid-handling robotics……………52 Figure 2.7. Genome-scale siRNA screen for regulators of DNA end resection………………...53 Figure 2.8. Pathway enrichment analysis for candidate resection activators…………………...55 Figure 2.9. Secondary confirmation screen utilizing the fluorescent ubiquitylation-based cell cycle indicator (FUCCI) system………………………………………………………………....57 Figure 2.10. Re-screening deconvolved siRNA pools for top candidate resection activators….59 Figure 3.1. Knockdown efficiency, growth, and cell cycle position analysis for siRNA duplexes targeting ZNF335 messenger RNA……………………………………………………………...71 Figure 3.2. ZNF335 is an activator of DNA end resection……………………………………...73 Figure 3.3. ZNF335 deficient cells are sensitive to DSB-inducing agents……………………...75 Figure 3.4. ZNF335 promotes DSB repair by HR………………………………………………76 Figure 3.5. ZNF335 promotes CHK1 serine 345 phosphorylation……………………………...78 Figure 3.6. Expression of siRNA-resistant ZNF335 rescues the observed end resection defect in ZNF335 depleted cells…………………………………………………………………………...80 Figure 3.7. The four C-terminal C2H2 zinc finger domains of ZNF335 are sufficient for its function in end resection…………………………………………………………………………81 Figure 3.8. ZNF335 accumulates at sites of laser microirradiation……………………………..84 Figure 3.9. The accumulation of ZNF335 at laser stripes is PARP-dependent………………...85
x
Figure 3.10. The PARP-dependent accumulation of ZNF335 at sites of microirradiation is not required for its function in DNA end resection………………………………………………....87 Figure 3.11. ZNF335 depletion does not affect the expression or protein stability of the core end resection activators……………………………………………………………………………...88 Figure 4.1. Formats for high-content screening………………………………………………..101 Figure 4.2. Solid-phase reverse siRNA transfections in HeLa cells…………………………...102 Figure 4.3. Cell migration between samples is minimal on 1536- and 3456-well cell microarrays……………………………………………………………………………………..104 Figure 4.4. Cross contamination of siRNA complexes is evident on cell microarrays………..105 Figure 4.5. Pre-soaking cell microarrays before cell flooding decreases knockdown efficiency……………………………………………………………………………………….107 Figure 4.6. Solid-phase transfections in arrayed drops (SPIDR)……………………………..109 Figure 4.7. High-content siRNA screen utilizing the SPIDR platform………………………..111
xi
List of Appendices
All appendices are presented in an electronic format and are written on the included DVD.
Appendix I: Genome-scale RNAi screen data……………………………………………….165
Appendix II: Secondary confirmation screen data………………………………………….167
Appendix III: 53BP1 focus formation SPIDR pilot screen data……………………………169
xii 1
List of Abbreviations
DNA deoxyribonucleic acid RNA ribonucleic acid mRNA messenger RNA DSB double-strand break SSB single-strand break ssDNA single-stranded DNA ROS reactive oxygen species HR homologous recombination CPT camptothecin IR ionizing radiation NCS neocarzinostatin ETOP etoposide HU hydroxyurea AID activation-induced deaminase NHEJ non-homologous end joining PIKK phosphatidylinositol 3-kinase-related kinase MMEJ microhomology-mediated end joining D-loop displacement loop SDSA synthesis-dependent strand annealing dHJ double Holliday junction SSA single-strand annealing LOH loss of heterozygosity PARP poly(ADP-ribose) polymerase PAR poly(ADP-ribose) PARG poly (ADP-ribose) glycohydrolase A-T ataxia-telangiectasia NBS Nijmegen breakage syndrome RNAi RNA interference siRNA small interfering RNA esiRNA endoribonuclease-prepared siRNA shRNA short hairpin RNA BrdU bromodeoxyuridine SEM standard error of the mean QIBC quantitative image-based cytometry KS Kolmogorov–Smirnov test IPA Ingenuity pathway analysis FUCCI fluorescence ubiquitination cell cycle indicator GESS genome-wide enrichment of seed sequences lncRNA long non-coding RNA
2
IP immunoprecipitation MS mass spectrometry PI propidium iodide PE plating efficiency SF surviving fraction GFP green fluorescent protein RFP red fluorescent protein DR-GFP direct repeat GFP ORF open reading frame PCR polymerase chain reaction NLS nuclear localization signal 4-OHT 4-hydroxytamoxifen DMSO dimethyl sulfoxide DOX doxycycline SPIDR solid-phase transfections in arrayed drops SBS Society for Biomolecular Screening Gy gray DAPI 4',6-diamidino-2-phenylindole CV coefficient of variation FDR false discovery rate ChIP chromatin immunoprecipitation IB immunoblotting IF immunofluorescence
3
Chapter I
Introduction
4
1.1 Statement of contributions, rights, and permissions
There are no statements to report regarding contributions, rights, and permissions.
5
1.2 Damage to the DNA double helix
A cells genetic material is continually altered through reacting with molecules present in the normal cellular environment. DNA damage can take many forms and it is estimated that replicating cells encounter approximately 106 DNA lesions per cell cycle (Lindahl and Barnes, 2000). Many types of DNA damage can be encountered, including base and sugar damage as well as breaks on one or both strands of the sugar-phosphate backbone. These DNA lesions can have severe effects on cellular fitness by disrupting genomic processes like transcription and replication. Furthermore, DNA damage that is not properly repaired can result in mutation and even genome rearrangements, leading to cell death or neoplastic transformation. Genome integrity is maintained by recognition, signaling, and subsequent repair of DNA damage, which reverse the deleterious consequences of DNA lesions and inhibit their transmission to daughter cells (Hoeijmakers, 2001). These signaling-based DNA damage responses have the ability to activate cell cycle checkpoints, coordinate DNA repair, regulate gene expression, and induce apoptosis if the damage load is too high (Jackson and Bartek, 2009).
1.3 DNA double-strand breaks and how they arise
The research presented in this thesis will center on the cellular response to one particular type of DNA lesion, the DNA double-strand break (DSB). DSBs are generated when the two complementary strands of DNA are severed in close proximity (within 10 base pairs) such that the remaining base-pairing and chromatin structure are no longer able to keep the broken strands together (Fig. 1.1A). Indeed, DSBs are often a consequence of two single-strand breaks (SSBs) that are on opposite strands and in close proximity to each other. Dividing cells encounter 10-50 DSBs during each cell cycle, which is substantially less than the frequency of other DNA lesions (Lieber, 2010). However, DSBs are the most deleterious type of DNA lesion because they do not leave an intact complementary strand to be used as a template to restore lost or damaged nucleotides. Studies in budding yeast have demonstrated that a single unrepaired DSB can lead to permanent cell cycle arrest and subsequent programmed cell death (Bennett et al., 1993). DSBs are also required for the formation of gross chromosomal rearrangements that can lead to amplifications, deletions, and gene fusions. These genomic events have been demonstrated to cause malignant transformation in several cancer types (van Gent et al., 2001).
6
Figure 1.1. Mechanisms of DNA double-strand break formation.
(A) “Two-ended” DSB that has formed as a consequence of close proximity SSBs on opposite DNA strands. Therefore, two-ended DSBs can have small overhang structures at both DNA ends. (B) A leading strand encounters a bulky DNA lesion (i.e. alkylated base) during replication. The DNA lesion will stall the replication fork which can be converted by a nuclease into a “one-ended” DSB. (C) If the leading strand collides with a SSB a “one-ended” DSB will form without the requirement of a nuclease. (D) Topoisomerase and topoisomerase-like (i.e. SPO11- TOPOVIB) enzymes can generate DSBs where the enzyme is covalently linked to the DSB end. (E) Chromosome ends (telomeres) can be recognized as DSBs if not properly protected by the Shelterin complex. Therefore, Shelterin dysfunction can be considered an endogenous source of DSB formation.
7
1.3.1 Endogenous sources of DNA double-strand breaks
1.3.1.1 Reactive oxygen species
DSBs can arise as a consequence of reactive oxygen species (ROS) that are generated during normal aerobic metabolism. Oxygen is an essential element for energy production but is also dangerous because of the high susceptibility of DNA to attack by ROS, an observation referred to as the “oxygen paradox” (Davies, 1995). A common by-product of several metabolic reactions (including mitochondrial respiration) is the highly reactive hydroxyl free radical. The two main modes of attack to the DNA molecule by a hydroxyl radical is the addition to a double bond or the abstraction of a hydrogen atom from either a DNA base or a deoxyribose sugar (Davies, 1995). Oxidation of the deoxyribose sugar can lead to strand breaks in the sugar-phosphate backbone of DNA and if two of these breaks occur on opposite strands and in close proximity a DSB can form.
1.3.1.2 DNA replication
Another predominant endogenous source of DSBs occurs during S-phase when replication forks encounter DNA lesions. Replicative DNA polymerases, which carry out the bulk of DNA synthesis, are unable to use damaged DNA strands as a template and, consequently, are stopped at most DNA lesions. A stalled replication fork can be re-established by nuclease-mediated conversion into a one-ended DSB which is also called a collapsed fork (Fig. 1.1B) (Fricke and Brill, 2003; Kaliraman et al., 2001). In addition, collapsed replication forks can form when a SSB on the leading strand is encountered during DNA synthesis (Fig. 1.1C). One-ended DSBs are repaired by a DSB repair pathway called homologous recombination (HR) which will be the focus of section 1.6.3.
1.3.1.3 Topoisomerase action
Topoisomerase enzymes create strand breaks to alleviate supercoiled DNA topologies and facilitate genomic processes including replication, transcription, and DNA repair (Vos et al., 2011; Wang, 2002). Topoisomerases can be assigned as either type I or II, depending on their cleavage of one or two strands of DNA, respectively (Liu et al., 1979, 1980; Poccia et al., 1978). DNA cleavage is linked to the formation of a transient but covalent enzyme-DNA adduct at the break terminus (Fig. 1.1D) (Lynn and Wang, 1989). Topoisomerase-DNA linkage prevents the
8
release of nicked or broken DNA strands before the enzyme can ligate the ends. However, topoisomerase I cleavage in the vicinity of a SSB can induce DSB formation (Jaxel et al., 1988). Moreover, abortive topoisomerase I activity can lead to one-ended DSB formation if encountered by a replication fork. Abortive topoisomerase II activity after cleavage of both DNA strands and before ligation can directly result in a DSB (Fig. 1.1D) (Brown et al., 1979). Remarkably, a class of chemotherapeutic drugs exploits abortive topoisomerase activity by binding to the enzyme, stabilizing it on cleaved DNA ends, and inhibiting ligation. For example, the topoisomerase type I poison, camptothecin (CPT), binds to the enzyme and after cleavage it intercalates within DNA ends to inhibit ligation (Hsiang et al., 1985; Staker et al., 2002).
1.3.1.4 Telomere dysfunction
Telomeres are repetitive DNA sequences present at the ends of eukaryotic chromosomes and are bound by components of the Shelterin complex (de Lange, 2005). Shelterin protects telomeres from being identified as DSB ends and therefore dysfunction of this complex can lead to aberrant DSB recognition by the cell (Fig. 1.1E) (Karlseder et al., 1999). Furthermore, telomeres shorten every cell division cycle as a consequence of normal replication at the end of chromosomes (Harley et al., 1990; Makarov et al., 1997). Telomere shortening can eventually eliminate telomeric repeats, ablating Shelterin binding and enabling the cell to recognize the chromosome end as a DSB (Harley et al., 1990; Karlseder et al., 2002). Therefore, as a population of cells continues to divide, there will be a greater risk of telomeres being recognized as DSBs.
1.3.2 Exogenous sources of DNA double-strand breaks
DSBs can also arise due to the action of exogenous sources including ionizing radiation (IR), heavy metals, and radiomimetic chemical clastogens such as bleomycin and neocarzinostatin (NCS) (Povirk, 1996; Ward, 1988). These agents can react with water within cells to form hydroxyl free radicals. DSBs occur as a consequence of oxidative attack on the sugar-phosphate backbone of DNA. Chemical inhibitors of topoisomerase enzymes including CPT and etoposide (ETOP) are also potent inducers of DSBs (Hsiang et al., 1985; Minocha and Long, 1984). Finally, exogenous chemicals that cause replication stress can induce the formation of one-ended DSBs. For example, the ribonuclease reductase inhibitor, hydroxyurea (HU), depletes the free pool of deoxyribonucleotides used for DNA synthesis, resulting in replication fork stalling
9
(Krakoff et al., 1968). Stalled replication forks can eventually be processed by nucleases into DSBs (Fricke and Brill, 2003; Kaliraman et al., 2001).
1.3.3 Developmentally programmed DNA double-strand breaks
1.3.3.1 Formation of DNA double-strand breaks in meiosis
Paradoxically, DSBs are not always a deleterious genomic event, and can be formed in a programmed manner to promote several developmental processes required for normal organism physiology. For instance, DSBs are generated in cells of reproductive organs during meiosis I and increases genome diversity in offspring and is required for proper gametogenesis (Kolodkin et al., 1986; Romanienko and Camerini-Otero, 2000). The second meiotic division is similar to mitosis in that it separates the centromeres of sister chromatids, whereas the first meiotic division separates homologous maternal and paternal chromosomes. Meiosis I poses a challenge because homologous chromosomes, unlike sister chromatids, are not necessarily in close proximity. Homologs must locate each other and ‘pair up’ before segregation. Direct association between homologous chromosomes promotes each spindle pole body to attach to one homolog, so that each daughter cell receives only one copy of every chromosome. Therefore, defects in homologous chromosome pairing are associated with aneuploidy and aberrant gametogenesis (Sherman et al., 1994). Homologous chromosome pairing is initiated by developmentally programmed DSBs that are generated at ‘hot spots’ across the length of each chromosome in early meiosis I (Kolodkin et al., 1986). DSBs are generated by the SPO11-TOPOVIBL complex that functions as a heterotetramer and introduces coordinated single nicks on opposite strands leading to a covalent protein-DNA intermediate (Fig. 1.1D) (Keeney et al., 1997; Liu et al., 1995; Robert et al., 2016; Vrielynck et al., 2016). DSBs trigger recombination between homologous chromosomes, which not only keep them physically connected but also promote genetic diversity by permitting the exchange of genetic information between maternal and paternal alleles. Not surprisingly, SPO11-null mice are defective in meiotic recombination and have a severe deficiency in gametogenesis that results in infertility (Romanienko and Camerini- Otero, 2000). The repair of meiotic DSBs must be completed before chromosome segregation and this process is the focus section 1.6.3.5.
10
1.3.3.2 Formation of DNA double-strand breaks in developing lymphocytes
Programmed DSBs also occur in developing B- and T-lymphocytes to help promote immunoglobulin and T-cell receptor diversity (Bassing and Alt, 2004). Sequence variability at immunoglobulin and T-cell receptor loci is critical for the recognition of diverse pathogens by the adaptive immune response. Immunoglobulin and T-cell receptor proteins contain variable regions that specify antigen binding (Edelman et al., 1969). Through a process termed V(D)J recombination, exons that encode variable regions contain variable (V), diversity (D), and joining (J) segments that can be combined in different ways to generate mature immunoglobulin and T-cell receptor genes (Hesse et al., 1987; Titani et al., 1965). Each segment is flanked by signal sequences that are recognized by the RAG1-RAG2 nuclease, which generates DSBs (Oettinger et al., 1990; Schatz et al., 1989). RAG1-RAG2-generated DSBs are recognized and repaired by a specific DSB repair pathway called non-homologous end joining (NHEJ; introduced in section 1.5.1) (Taccioli et al., 1994).
DSBs are also generated in developing B-lymphocytes to trigger a process called class- switch recombination. During B-cell differentiation, class-switch recombination can fuse different immunoglobulin constant regions to a specific variable region (Kataoka et al., 1980; Nossal et al., 1971). For example, after recognition of an epitope, B-cells mature by changing their immunoglobulin constant domain from membrane-bound to soluble. Differential constant domains also allow the immunoglobulin to interact with a variety of effector molecules and promote an effective adaptive immune response. During class-switch recombination, DNA strand breaks are generated by the concerted action of activation-induced deaminase (AID) and transcription in the immunoglobulin switch regions (Muramatsu et al., 2000). AID triggers the deamination of cytosine to uracil resulting in U-G mismatches that are processed by nucleases to yield DSBs. NHEJ or alternative end-joining (the focus of section 1.5.1 and 1.5.2, respectively) can ligate variable exons to specific constant exons during class-switch recombination (Casellas et al., 1998; Manis et al., 2002; Manis et al., 2004; Ward et al., 2004; Yan et al., 2007).
1.4 The cellular response to DNA double-strand breaks
The response to DSBs is a classical signal transduction pathway in which a signal, in this case a DSB, is first detected by sensor proteins and then transduced to downstream effectors. The focus
11
of this section will be on the mechanisms of DSB end detection and how this initiates signal transduction. I will outline the targets of DSB-induced signal transduction activity and how these effector proteins modulate cell cycle checkpoints, senescence, and apoptosis.
1.4.1 DNA damage-induced signal transduction by ATM, DNA-PKcs, and ATR
The critical signal transducers in the DSB response are three phosphatidylinositol-3 kinase-like kinases (PIKKs): ATM, DNA-PKcs, and ATR. Upon activation, they phosphorylate a vast set of proteins on serine or threonine residues present within serine/threonine-glutamine (S/T-Q) consensus motifs (Kim et al., 1999). Phosphoproteomic studies have identified hundreds of putative PIKK targets that are phosphorylated in response to DNA damage (Matsuoka et al., 2007; Mu et al., 2007; Roitinger et al., 2015; Smolka et al., 2007; Stokes and Comb, 2008). Although all three kinases have some overlapping substrates, each PIKK also has distinct functions in the response to DNA damage.
1.4.1.1 ATM activation
In response to DSBs, ATM is recruited to and activated by the DSB sensor complex MRE11- RAD50-NBS1 (MRN), a multifunctional complex that is critical for the early stages of the DSB response (Fig. 1.2A) (Carson et al., 2003; Nakada et al., 2003; Uziel et al., 2003). ATM autophosphorylation results in its dissociation from inactive dimers into active monomers that can bind to damaged chromatin (Bakkenist and Kastan, 2003). Furthermore, ATM is acetylated by TIP60 which further stimulates its activation (Kaidi and Jackson, 2013; Sun et al., 2005). The histone H2A variant H2AX is a particularly important substrate for ATM which phosphorylates it on the conserved C-terminal serine 139 residue (Burma et al., 2001; Downs et al., 2000; Rogakou et al., 1998). Phosphorylated H2AX (also called γH2AX) marks a chromatin domain that is recognized by the checkpoint mediator protein MDC1 (Stucki et al., 2005). In a positive- feedback loop, MDC1 promotes further ATM activation through interactions with NBS1 which enhances the accumulation of the MRN complex and of activated ATM on damaged chromatin (Chapman and Jackson, 2008; Melander et al., 2008; Spycher et al., 2008; Wu et al., 2008a).
1.4.1.2 DNA-PKcs activation
The KU70/80 heterodimer is a DSB end sensor complex that recruits and actives DNA-PKcs
12
Figure 1.2. Activation of DNA damage responsive kinases at DNA double-strand breaks.
(A) ATM activation. The DSB is first detected by the MRN complex which can directly bind to DNA ends. ATM is recruited to DSB sites by the MRN complex which results in its dissociation from inactive dimers to active monomers. ATM acetylation by TIP60 is also important for its activation. (B) DNA-PKcs activation. The DSB ends are first detected by KU70/80. DNA-PKcs is recruited to DSB sites through a direct association with KU70/80. The presence of DNA ends and KU70/80 stimulates DNA-PKcs activation. Two DNA-PKcs molecules can dimerize across a DSB to tether the broken ends together in a so-called ‘synaptic complex’ which holds the broken ends in close proximity and enhances re-joining. DNA-PKcs dimerization also stimulates its kinase activity. (C) ATR responds to the accumulation of ssDNA in the genome which can arise as a consequence of replication stress or DSB end resection during HR. First, ATR is recruited to RPA bound ssDNA through an interaction with ATRIP. The checkpoint clamp 9-1-1 and TOPBP1 are also required for ATR activation.
13
(Gottlieb and Jackson, 1993; Hartley et al., 1995). The complex formed at the break site consisting of DNA, KU70/80, and DNA-PKcs is referred to as simply “DNA-PK” (Fig. 1.2B). Once bound to KU70/80, the catalytic activity of DNA-PKcs is activated. DNA-PKcs kinase activity is essential for DSB repair by NHEJ but the exact mechanism for why this is the case is currently unknown. DNA-PK has been shown to phosphorylate core NHEJ factors such as KU70/80, XRCC4, and XLF, but surprisingly these phosphorylation events are not required for NHEJ (Douglas et al., 2005; Yu et al., 2008; Yu et al., 2003). The most important NHEJ DNA- PK target appears to be DNA-PKcs itself. In response to DSBs, upwards of 40 autophosphorylation sites have been documented for DNA-PKcs (Davis et al., 2014). One critical autophosphorylation site appears to be serine 2056 as ablating this site resulted in less efficient NHEJ (Chen et al., 2005).
1.4.1.3 ATR activation
ATR is activated in response to replication stress, when stalled forks result in the accumulation of ssDNA bound by the ssDNA binding heterotrimeric replication protein A (RPA14-RPA32- RPA70) complex (Zou and Elledge, 2003). ATR recognition of RPA-ssDNA is carried out by an ATR-associated factor called ATRIP (Cortez et al., 2001; Zou and Elledge, 2003). However, ATR/ATRIP localization to RPA-ssDNA is not sufficient for kinase activation. ATR activation also requires the RAD9-RAD1-HUS1 (or 9-1-1) complex and TOPBP1 (Delacroix et al., 2007; Lee et al., 2007; Majka et al., 2006; Roos-Mattjus et al., 2002). DSBs formed in the S- and G2- phases of the cell cycle can be repaired by HR. The first step of HR involves the generation of ssDNA overhangs at DSB ends through a process called DNA end resection. RPA binds ssDNA formed as a consequence of end resection providing another platform for ATR activation (Fig. 1.2C). Therefore, ATR is activated as a consequence of both replication stress and DSBs in S and G2 cells.
1.4.2 DNA double-strand break-induced cell cycle checkpoints
Activated ATM and ATR kinases are critical for transducing signals to effector proteins that halt cell cycle progression (Liu et al., 2000; Matsuoka et al., 1998). In contrast, DNA-PKcs appears to play a less pivotal role in the activation of DSB-induced cell cycle checkpoints (Burma et al., 1999; Jhappan et al., 2000). During the response to DSBs, it is paramount for the cell to arrest the cell cycle and provide time for repair machineries to re-join the breaks before the start of
14
processes like replication or mitosis. The replication or segregation of broken chromosomes can have deleterious outcomes for the cell, including the generation of daughter cells with aneuploidy and genome rearrangements (Cahill et al., 1998; Chan et al., 1999). DNA damage- induced checkpoints occur at entry into S-phase (the G1/S checkpoint), within S-phase (the S- phase checkpoint), and entry into mitosis (the G2/M checkpoint).
1.4.2.1 G1/S checkpoint
The G1/S checkpoint is triggered by ATM-dependent signaling and subsequent enrichment of MDC1 on damaged chromatin in G1 cells (Goldberg et al., 2003; Lou et al., 2003; Stewart et al., 2003). Although the precise mechanism of how MDC1 promotes checkpoint activation remains unclear, it appears to be at the level of enhancing ATM activity (Lou et al., 2006; Mochan et al., 2003). ATM targets that are required for G1/S checkpoint activation include the transcription factor p53 and the checkpoint kinases CHK1 and CHK2 (Fig. 1.3A) (Canman et al., 1998; Chen et al., 1999; Gatei et al., 2003; Matsuoka et al., 1998). P53 is stabilized upon phosphorylation on serine 15 by ATM, as it can no longer be inhibited by MDM2 (Canman et al., 1998; Dulic et al., 1994; Shieh et al., 1997). P53 then activates the transcription of p21 which is a potent inhibitor of the G1/S-promoting cyclin E/cyclin-dependent kinase 2 (CDK2) complex and therefore, induces a G1 arrest (Harper et al., 1993; Xiong et al., 1993). ATM also phosphorylates and activates CHK1 and CHK2 in G1 cells which in turn can phosphorylate the phosphatase CDC25A (Falck et al., 2001; Mailand et al., 2000). Phosphorylation of CDC25A induces its ubiquitylation by SCFβ-TRCP and subsequent degradation in the proteasome, potentiating the phosphorylation of CDK2 at threonine 14 and 15 (Busino et al., 2003; Costanzo et al., 2000; Jin et al., 2003; Mailand et al., 2000). Phosphorylated CDK2 is unable to promote DNA synthesis and entry into S-phase, thereby causing a G1 arrest. In addition, activated CHK2 can also phosphorylate p53 to further enhance p21 expression and the G1/S checkpoint (Hirao et al., 2000).
1.4.2.2 S-phase checkpoint
Eukaryotes replicate their genomes from multiple origins that are distributed across each chromosome. Origins of replication are activated throughout S-phase of the cell cycle such that some origins fire early and others fire late. In response to DNA damage during S-phase, cells
15
Figure 1.3. Cell cycle checkpoints activated in response to DNA double-strand breaks.
ATM and ATR kinases are activated in response to DSBs and transduce signals to effector proteins that halt cell cycle progression. In response to DNA damage, the cell cycle can be arrested at various positions including at the G1/S transition (G1/S checkpoint), within S-phase (the S-phase checkpoint), and at the G2/M transition (the G/M checkpoint). Interestingly, the mechanisms that control cell cycle arrest are different for the three checkpoints. (A) The G1/S checkpoint relies on the inhibition of the S-phase-promoting cyclin E/CDK2 kinase by either p21 or through the inactivation of the CDC25A phosphatase. (B) The S-phase checkpoint relies on the inhibition of Treslin by CHK1 and CHK2. Treslin promotes origin firing during S-phase by directly interacting with TOPBP1 and CDC45. In response to DNA damage, CHK1/2-mediated phosphorylation inhibits the function of Treslin in promoting DNA replication. (C) The G2/M checkpoint relies on the inhibition of the mitosis-promoting cyclin B/CDK1 kinase by the inactivation of the CDC25C phosphatase. Various mechanisms have been described for inhibiting CDC25C in G2 cells exposed to agents that induce DSBs (see main text). Initiation of the G2/M checkpoint does not dependent on p53 but a slower transcriptional program controlled by p53 is required to maintain the checkpoint.
16
activate a checkpoint that can inhibit later firing origins of replication that have not yet been initiated. The S-phase checkpoint is best understood in the budding yeast Saccharomyces cerevisiae and is controlled by Mec1 (ATR in humans) and the checkpoint kinase Rad53 (CHK2 in humans) (Santocanale and Diffley, 1998; Shirahige et al., 1998). In response to DNA damage, Sld3 (Treslin/TICRR in humans) is phosphorylated by Rad53 which inhibits its function in promoting the firing of late replication origins (Fig. 1.3B) (Boos et al., 2013; Lopez-Mosqueda et al., 2010; Zegerman and Diffley, 2010). Phosphorylation of Sld3 (Treslin) abrogates its CDK- dependent interaction with the origin firing factors Dpb11 (TOPBP1 in humans) and Cdc45 (Boos et al., 2011; Zegerman and Diffley, 2010).
1.4.2.3 G2/M checkpoint
The G2/M checkpoint prevents G2 cells from entering mitosis when DSBs are present. Similar to cells in S-phase, G2 cells can activate both the ATM and ATR kinases in response to DSBs (Fig. 1.3C). The critical target of the G2/M checkpoint is the mitosis-promoting activity of the cyclin B/CDK1 kinase (Lundgren et al., 1991). The ATM/ATR-dependent activation of CHK1 and CHK2 in G2 cells results in the phosphorylation of the phosphatase CDC25C on serine 216, creating a 14-3-3 binding site (Peng et al., 1997; Sanchez et al., 1997). Binding of 14-3-3 proteins to phosphorylated CDC25C sequesters it in the cytoplasm and inhibits its phosphatase activity towards nuclear cyclin/CDK substrates (Dalal et al., 1999; Yang et al., 1999). Inhibition of the CDC25C phosphatase enables the nuclear accumulation CDK1 that is phosphorylated on tyrosine 15 by WEE1 (Krek and Nigg, 1991; Lundgren et al., 1991; Norbury et al., 1991). WEE1-dependent phosphorylation of CDK1 inhibits its kinase activity and arrests cells in G2. The checkpoint mediator proteins 53BP1, RNF8, BRCA1 are also critical for establishing a robust G2/M checkpoint, likely through the promotion of ATM/ATR and CHK1/CHK2 activity (Fernandez-Capetillo et al., 2002; Huen et al., 2007; Kolas et al., 2007; Mochan et al., 2004; Wang et al., 2002; Yarden et al., 2002). One particular 14-3-3 family member, 14-3-3σ, is expressed in a p53-dependent manner and is required for the maintenance of the G2/M checkpoint (Hermeking et al., 1997). Interestingly, 14-3-3σ, unlike other family members, cannot bind to phosphorylated CDC25C and inhibits entry into mitosis by an alternative mechanism (Chan et al., 1999). In response to DNA damage, 14-3-3σ inhibits mitotic entry by directly sequestering cyclin B/CDK1 in the cytoplasm. The G2/M checkpoint can also be activated in a CHK1/2-independent manner by the p38 and MAPKAP kinase-2 (MK2) kinases (Bulavin et al.,
17
2001; Manke et al., 2005). In response to DNA damage, p38 and MK2 are activated by ATM/ATR and, like CHK1/2, can control the checkpoint response through the phosphorylation- dependent inhibition of CDC25C. MK2 can directly phosphorylate CDC25C on serine 216 which creates a binding site for 14-3-3 proteins. MK2 can also promote the G2/M checkpoint in the cytoplasm where it controls the posttranscriptional modulation of gene expression (Reinhardt et al., 2010). MK2 can phosphorylate the RNA binding proteins hnRNPA0, TIAR, and PARN, which then bind to and stabilize the messenger RNA of the cell cycle inhibitor gene GADD45α. The maintenance of the G2/M checkpoint relies on a transcriptional program regulated by p53, leading to the upregulation of cell cycle inhibitors including p21, GADD45α, and 14-3-3 proteins (Agarwal et al., 1995; Hermeking et al., 1997; Wang et al., 1999). However, unlike the G1/S checkpoint, cells can initiate a robust G2/M checkpoint in the absence of p53 (Kastan et al., 1991; Levedakou et al., 1995). This observation spurred efforts to therapeutically interfere with the G2/M checkpoint as a potential strategy to sensitize p53-deficient cancer cells to radiation- or chemotherapy-induced DNA damage (McNeely et al., 2014; Russell et al., 1995; Wang et al., 1996; Wang et al., 2001). The most promising therapeutic strategies for inhibiting the G2/M checkpoint in p53-deficient tumours have involved small molecule inhibitors of the ATR, CHK1, and WEE1 kinases.
1.4.3 DNA double-strand break-induced senescence and apoptosis
Cells that experience a degree of DNA damage that is beyond repair can undergo permanent cell cycle arrest (senescence) or programmed cell death. DSB-induced activation of ATM and ATR leads to the phosphorylation and activation of p53 (Canman et al., 1998; Siliciano et al., 1997; Tibbetts et al., 1999). The p53 transcription factor not only mediates the G1/S transient checkpoint but also initiates senescence or apoptotic programs if too many DSBs are present or if their repair is delayed or defective (Di Leonardo et al., 1994; Lowe et al., 1993). The molecular mechanisms that control these p53-dependent cell fate decisions in response to genotoxic stress are largely unknown. It has been suggested that cell fate decisions may be controlled by the level of p53 protein (Batchelor et al., 2011; Batchelor et al., 2008; Loewer et al., 2010). For example, in response to DSBs by ionizing radiation, the levels of p53 exhibit a series of pulses with fixed amplitude and frequency. Higher doses of ionizing radiation increase the number of pulses without affecting their amplitude. Remarkably, precisely timed drug additions that produce a sustained p53 protein pulse can push cells towards senescence and apoptosis rather than a
18
transient cell cycle arrest (Purvis et al., 2012). Controlling the p53 pulse can have critical therapeutic implications when treating tumours that overexpress the oncogenic p53 inhibitor MDMX (Chen et al., 2016). Tumour cells treated with a MDMX inhibitor initiate a rapid pulse in p53 protein levels followed by low-amplitude oscillations. Remarkably, the exposure of cells with ionizing radiation during the p53 pulse coincides with apoptosis. In stark contrast, programmed cell death is inhibited when cells are treated with ionizing radiation after the pulse, when p53 levels demonstrate low-amplitude oscillations.
1.5 Repair of DNA double-strand breaks
In addition to the modulation of cell cycle progression and apoptosis, the PIKK-mediated cellular response to DSBs also involves the coordination of repair enzymes to promote the re-joining of DSB ends. Eukaryotic cells have evolved two major pathways to re-ligate DSBs: non- homologous end joining (NHEJ) and homologous recombination (HR).
1.5.1 Non-homologous end joining
The fastest way to re-join a DSB is to ligate it back together utilizing NHEJ (Fig. 1.4). First, DSB ends are detected and bound by the KU70/80 heterodimer (Mimori and Hardin, 1986). Structural studies have shown that KU70/80 forms a ring with a hole that fits double-stranded DNA ends (Walker et al., 2001). It is thought that KU70/80 is the first factor to bind DSBs due to its high nuclear abundance and strong affinity for DNA ends (Mimori and Hardin, 1986). KU70/80 acts as a molecular scaffold for the recruitment of NHEJ effectors including nucleases, polymerases, and at least one ligase. As discussed previously, the extreme C-terminus of KU80 is required for the recruitment of DNA-PKcs (Gell and Jackson, 1999). An interaction between two DNA-PKcs molecules across a DSB can tether the broken ends together in a so-called ‘synaptic complex’ which holds the broken ends in close proximity and inhibits attack by nucleases (DeFazio et al., 2002). DNA-PKcs dimerization across a DSB also stimulates its kinase activity which is required for DSB repair by NHEJ (Kurimasa et al., 1999; Meek et al., 2007). In the final step of NHEJ, the DSB ends are re-joined by the action of DNA ligase IV in complex with XRCC4, XLF, and PAXX (Ahnesorg et al., 2006; Grawunder et al., 1997; Ochi et al., 2015; Wilson et al., 1997). For ligation to occur there must be undamaged nucleotides at both break ends. DSBs generated by ROS often have complex chemical structures and the multi-
19
Figure 1.4. DNA double-strand break repair by non-homologous end joining.
Repair of DSBs by NHEJ relies on the detection of DSB ends by the KU70/80 heterodimer. The DNA-PKcs kinase is recruited to DSB sites by KU70/80 where it dimerizes across a DSB to form a ‘synaptic complex’. DNA-PKcs dimerization and binding to KU70/80 stimulates its kinase activity. Next, the DSB ends can be processed by enzymes to ensure they are compatible for ligation. The DNA ligase IV complex is responsible for the final re- joining step.
20
functional nuclease Artemis is responsible for removing damaged nucleotides and secondary structures that may inhibit DSB re-joining (Ma et al., 2002; Moshous et al., 2001). Polynucleotide kinase/phosphatase (PNKP) also plays an important role in promoting DNA ligase IV-dependent joining by ensuring that 5’ DNA termini are phosphorylated and 3’ termini are not (Chappell et al., 2002; Koch et al., 2004). Moreover, ligation is inhibited when adenylate groups are covalently linked to the 5’ termini of breaks. Aprataxin (APTX) catalyzes the nucleophilic release of adenylate groups resulting in termini that can again serve as a substrate for DNA ligase IV (Rass et al., 2007). Several DNA polymerases have important roles in NHEJ. Polymerase µ is a highly versatile enzyme that has both template-dependent and template- independent synthesis capabilities and can add nucleotides to DSB ends (Mahajan et al., 2002a). When the resulting short 3’ overhangs share even one nucleotide of complementarity, ligation by the DNA ligase IV complex is enhanced. Remarkably, polymerase µ, together with KU70/80 and the DNA ligase IV complex, can polymerize across a discontinuous template strand, thereby crossing from one DNA end to another (Nick McElhinny et al., 2005). In addition, polymerase λ only has template-dependent synthesis activity and fills in short gaps that form after annealing of 1-4 base pairs of homology at the DSB end (Garcia-Diaz et al., 2009; Lee et al., 2004). Through a largely unknown mechanism, 53BP1 and its effector proteins RIF1 and MAD2L2 have also been shown to bind near to the sites of DSB ends and promote long-range NHEJ reactions, particularly during V(D)J and class-switch recombination (Boersma et al., 2015; Chapman et al., 2013; Di Virgilio et al., 2013; Difilippantonio et al., 2008; Dimitrova et al., 2008; Escribano- Diaz et al., 2013; Ward et al., 2004; Xu et al., 2015; Zimmermann et al., 2013).
1.5.1.1 Alternative end-joining
DSBs can also be repaired by an alternative end-joining mechanism termed microhomology- mediated end joining (MMEJ). Annealing of 5-25 base pair microhomologous sequences at each of the broken DSB ends is a requirement for MMEJ (Roth and Wilson, 1986). MMEJ can be KU70/80-indepedent but requires the activity of poly(ADP-ribose) polymerase 1 (PARP-1) (Audebert et al., 2004). In order for microhomologies to be exposed for annealing, limited DNA end resection is required to generate short 3’ ssDNA overhangs (Moore and Haber, 1996). Furthermore, small segments of microhomology can be introduced to the DSB ends by the template-independent activity of polymerase θ (Ceccaldi et al., 2015; Mateos-Gomez et al., 2015). Next, the exposed ssDNA homologies at each DSB end can anneal together and the
21
template-dependent activity of polymerase θ fills in the gaps before ligation. In contrast to canonical NHEJ which employs DNA ligase IV, ligation during MMEJ is conducted by DNA ligase III (Audebert et al., 2004). If microhomologies are present internally at the DSB ends 3’ flaps will be generated and will need to be resolved by nucleases which can result in deletions (Ma et al., 2003). Therefore, MMEJ is a mutagenic DSB repair pathway and the cellular relevance of such an error-prone mechanism still remains unclear. However, it is conceivable that MMEJ may contribute to the stability of small repetitive elements in genomes such as centromeres and telomeres (Capper et al., 2007). In addition, MMEJ is particularly relevant in developing B-cells where it plays an important role in class-switch recombination (Boboila et al., 2012).
1.5.2 Mitotic homologous recombination
NHEJ can re-join DSBs in the G1-, S-, and G2-phases of the cell cycle. In contrast, HR occurs mainly in S and G2 cells, after replication has generated an identical sister chromatid. Indeed, the two sister chromatids, by virtue of being identical copies of each other and for being in close proximity, are the preferred template for HR. As a consequence of this cell cycle-phase preference, HR occurs primarily in dividing cells, whereas terminally differentiated cells rely on NHEJ (Gao et al., 1998). As most mammalian cell types are non-dividing, the majority of DSBs are likely repaired by NHEJ. However, HR is a critical DNA repair process during proliferative stages of development and somatic cell renewal in mammals. HR can re-join DSBs generated in both mitotically and meiotically dividing cells. Although the mechanistic details of mitotic and meiotic HR are similar, some differences do exist and will be highlighted in section 1.6.3.5. In contrast to mammalian cells, HR is the primary DSB repair pathway in the budding yeast Saccharomyces cerevisiae. The mechanistic details of HR were first uncovered in budding yeast but are highly conserved throughout evolution. For this section, I will focus mainly on primary research that was conducted in budding yeast. A description of yeast HR genes will be given but all corresponding human homologs with different names will be supplied in parentheses.
1.5.2.1 DNA end resection initiates homologous recombination
The first step in HR is the detection of DSB ends by the Mre11-Rad50-Xrs2 (MRE11-RAD50- NBS1) or MRX (MRN) complex (Fig. 1.5) (Raymond and Kleckner, 1993). In addition to activating Tel1 (ATM), MRX promotes a process termed DNA end resection that generates 3’
22
ssDNA tails at each end of the DSB (Moreau et al., 2001; Nairz and Klein, 1997; Paull and Gellert, 1998). End resection is activated during the S- and G2-phases of the cell cycle by CDK- mediated phosphorylation of Sae2 (CtIP) which physically associates with the MRX complex and promotes the nuclease activity of Mre11 (Huertas et al., 2008; Huertas and Jackson, 2009; Sartori et al., 2007). Therefore, in addition to the proximal availability of the sister chromatid, the CDK-dependent initiation of end resection by Sae2 (CtIP) also restricts HR to S and G2 cells. The predicted nuclease activity required for processing DSB ends into 3’ ssDNA overhangs is a 5’-3’ exonuclease. However, Mre11 lacks this activity and is instead a bifunctional nuclease, containing both endonuclease and 3’-5’ exonuclease activity (Paull and Gellert, 1998). It was later determined that Mre11 utilizes both its endonuclease and exonuclease activity to promote end resection (Garcia et al., 2011). First, using its endonuclease activity, Mre11 nicks the strand to be resected up to 300 base pairs from the 5’ terminus of the DSB end. The nick enables end resection in a bidirectional manner, whereby Mre11 exonuclease activity promotes 3’-5’ resection towards the DSB end and the exonucleases Exo1 and Dna2 carry out resection in the 5’-3’ direction away from the DSB end (Garcia et al., 2011; Zhu et al., 2008).
End resection can be separated into two functional steps: short-range (or initiation) and long-range. As Mre11/Sae2 (CtIP) initiate end resection proximal to the DSB end and in the direction towards the break, only short tracts of 3’ ssDNA tails are generated. Therefore, Mre11/Sae2 (CtIP)-dependent 3’ to 5’ processing is termed short-range end resection. The downstream events for HR require long stretches of ssDNA and this is achieved by the exonuclease activity of Exo1 and Dna2, in collaboration with the RecQ helicase Sgs1 (BLM) (Zhu et al., 2008). These nucleases resect DNA in the 5’ to 3’ direction (away from the DSB end) in a process called long-range end resection. In human cells, an additional nuclease, the 3’- 5’ exonuclease EXD2, has been implicated in end resection and can process DSB ends in parallel with MRE11 (Broderick et al., 2016). End resection generates long stretches of ssDNA that are rapidly coated by the RPA complex (Longhese et al., 1994). Through a positive feedback mechanism, the RPA complex stimulates further end resection by Exo1 and Dna2 while also inhibiting the degradation of ssDNA tails and the formation of hairpin structures (Chen et al., 2013).
23
Figure 1.5. DNA double-strand break repair by mitotic homologous recombination.
Repair of DSBs by HR in mitotically dividing cells primarily occurs in the presence of a sister chromatid (in the S- and G2-phases of the cell cycle). The rate-limiting step for HR is the generation of 3’ ssDNA overhangs at each end of a DSB by DNA end resection. Bi-directional end resection is conducted by nucleases including MRE11, EXO1, and DNA2. The defining step of HR is the loading of the RAD51 recombinase onto ssDNA which facilitates strand invasion and homology search in the sister chromatid.
24
1.5.2.2 Rad51 assembly and the search for homology
After end resection, RPA bound to ssDNA is exchanged for the Rad51 recombinase (Sugiyama et al., 1997). Rad51 catalyzes the defining step of HR, strand exchange, during which ssDNA invades the sister chromatid, displacing the complementary strand of the duplex to form a displacement loop (D-loop) (Petukhova et al., 2000; Sugawara et al., 1995). However, RPA impedes Rad51 loading to ssDNA and the recombinase needs accessory factors, called recombination mediators. The prototypical recombination mediator is Rad52 which is recruited to resected DSBs through an interaction with RPA (New et al., 1998). Rad52 also interacts with Rad51 to stimulate RPA removal from ssDNA and replacement with Rad51 (Song and Sung, 2000; Stasiak et al., 2000). Rad51 forms a right-handed nucleoprotein filament when loaded onto ssDNA (Ogawa et al., 1993). Formation of filaments is also stimulated by Rad52 (McIlwraith et al., 2000; Sung and Robberson, 1995). In contrast to budding yeast, mammalian cells lacking RAD52 do not display DNA damage sensitivity and only have minor defects in HR (Rijkers et al., 1998). In mammalian cells, the BRCA1-PALB2-BRCA2 complex targets RAD51 to ssDNA and thereby promotes RAD51 to replace RPA and form a nucleoprotein filament (Buisson et al., 2010; Carreira et al., 2009; Dray et al., 2010; Jensen et al., 2010; Moynahan et al., 1999; Sy et al., 2009; Yang et al., 2005). HR is also stimulated by a group of mediator proteins called the RAD51 paralogs. Five RAD51 paralogs have been identified in mammalian species and they interact with one another to form two distinct complexes: RAD51B-RAD51C-RAD51D-XRCC2 (BCDX2) and RAD51C-XRCC3 (CX3) (Masson et al., 2001). Loss of the RAD51 paralogs leads to severe HR defects, DNA damage sensitivity, chromosome abnormalities, and aberrant RAD51 focus formation (French et al., 2002; Godthelp et al., 2002; Johnson et al., 1999; Pierce et al., 1999). The Caenorhabditis elegans homologs of the BCDX2 and CX3 complexes, RFS-1/RIP-1, can bind to RAD51 nucleoprotein filaments to promote their stability and flexible confirmation, which facilitates strand exchange with the sister chromatid (Taylor et al., 2015). Rad51 nucleoprotein filaments also stimulate searching for homologous sequence after strand invasion (Sugawara et al., 1995). If homology is found, repair synthesis by polymerases will replace missing nucleotides at the break site by using the invading sister chromatid as a primer (Holmes and Haber, 1999).
25
1.5.2.3 Mechanisms for disengaging from the sister chromatid and completing repair
At this step in the pathway, two sub-pathways of HR emerge: synthesis-dependent strand annealing (SDSA) and double Holliday junction (dHJ) formation (Fig. 1.5). The predominant sub-pathway appears to be SDSA, in which the D-loop is ablated, leading to annealing of the newly synthesized strand with the resected strand of the other DSB end (Ira et al., 2006; Kurkulos et al., 1994; Nassif et al., 1994). Gaps are filled in by polymerases and ends are ligated to re-establish the integrity of the chromatid. Another potential outcome when a sister chromatid is invaded by a nucleoprotein filament is that the newly synthesized strand can be captured by the other resected DSB end forming a dHJ (Parsons and West, 1988; West et al., 1983). Two mechanisms exist for processing dHJs: resolution and dissolution. Resolution involves specialized nucleases that cleave dHJs, allowing the strands to pass over each other and anneal to their respective chromatids (Ip et al., 2008). Crossover products, where a strand destined for one chromatid is annealed to the other, may arise as a consequence of resolution. Crossovers have the potential to cause chromosome rearrangements when recombination occurs between non-allelic repetitive sites (i.e. gene paralogs) (Montgomery et al., 1991). In the second mechanism, dissolution, the dHJs are migrated toward each other by the RecQ helicase Sgs1 (BLM) before cleavage (Karow et al., 2000). Junction migration results in a hemicatenane structure that is eliminated by the Sgs1-Top3-Rmi1 (BLM-TOPIIIα-RMI1-RMI2) complex resulting in non- crossover products (Cejka et al., 2010; Chang et al., 2005; Ira et al., 2003; Singh et al., 2008; Wu et al., 2006; Wu and Hickson, 2003; Xu et al., 2008; Yin et al., 2005). HR is rarely associated with crossovers so the preferred HR sub-pathways are likely SDSA and dHJ dissolution as both promote the formation of non-crossover products. An in vitro biochemical study demonstrated that the formation of dHJs is actively blocked by RAD51 (Wu et al., 2008b). RAD51-dependent inhibition of dHJ formation could act as a simple mechanism for promoting SDSA and preventing crossovers during mitotic HR.
1.5.2.4 Single-strand annealing is an alternative homology-based DNA double-strand break repair mechanism
An alternative HR-based DSB repair pathway is called single-strand annealing (SSA). Akin to the alternative end joining pathway MMEJ, SSA also involves the hybridization of homologous regions at each end of a resected DSB (Fishman-Lobell et al., 1992). In contrast to MMEJ, SSA
26
involves the base pairing of longer homologous regions. Therefore, SSA requires end resection and is mainly active in the S- and G2-phases of the cell cycle (Clerici et al., 2005; Huertas et al., 2008). Moreover, a key difference between MMEJ and SSA is that SSA requires the help of Rad52 for annealing longer regions of ssDNA (Fishman-Lobell et al., 1992; Ivanov et al., 1996). SSA may be important for the repair of larger repetitive regions in the genome such as ribosomal DNA loci (Sfeir and Symington, 2015). Like MMEJ, SSA is also mutagenic because annealing of repetitive sequences that are internal to DSB ends will generate 3’ flap structures that are cleaved by nucleases, resulting in potentially harmful deletions (Fishman-Lobell et al., 1992).
1.5.2.5 Meiotic homologous recombination
SPO11/TOPOVIBL-generated DSBs during meiosis promote the exchange of genetic material between homologous chromosomes and are essential for proper meiotic chromosome segregation (Kolodkin et al., 1986). DSBs formed by SPO11/TOPOVIBL are repaired by HR. The steps of meiotic HR are similar to that of mitotically dividing cells but with several key differences. First, Spo11 forms a covalent protein-DNA linkage at DSB ends which needs to be removed for efficient strand invasion (Keeney et al., 1997). Utilizing their nuclease activities, the short-range end resection machinery composed of the MRX complex and Sae2 (CtIP) can remove Spo11 from DSB ends (Neale et al., 2005). Secondly, two recombinase proteins, Rad51 and its meiosis- specific paralog Dmc1, are required for meiotic HR (Bishop et al., 1992; Shinohara et al., 1992). Both Rad51 and Dmc1 can form nucleoprotein filaments on ssDNA (Gupta et al., 2001; Ogawa et al., 1993). Interestingly, Rad51 mutants proficient at filament formation but defective in strand invasion can effectively complete meiotic HR (Cloud et al., 2012). In contrast, Dmc1 mutants with defective strand invasion activity cannot repair Spo11-generated DSBs. Therefore, Dmc1 appears to be important for the strand invasion step of meiotic HR whereas Rad51 is required for nucleoprotein filament formation. In mammalian cells, loading of RAD51 and DMC1 onto resected DSB ends in meiosis is BRCA2-dependent (Siaud et al., 2004; Thorslund et al., 2007). However, the requirement of BRCA1 and PALB2 in recombinase loading at SPO11/TOPOVIBL-generated DSBs has not been established.
27
1.5.3 DNA end resection is the rate-limiting step of homologous recombination
1.5.3.1 Control of DNA end resection length dictates recombination efficiency
DNA end resection is the main focus of my doctoral research and I will further describe the current knowledge regarding its importance and regulation in eukaryotic cells. The first essential step for all homology-based repair pathways is the generation of ssDNA by end resection. Long ssDNA tracts at DSB ends are required to form Rad51 nucleoprotein filaments of the appropriate length. When the length of the filament is increased, homology is found faster (Forget and Kowalczykowski, 2012). In contrast, short filaments are ineffective at sister chromatid strand invasion and homology searching. Therefore, resection length is correlated to HR efficiency. In budding yeast, resection length away from a DSB end can vary from 2000 to 4000 nucleotides (Chung et al., 2010). These long ssDNA tracts are dependent on the processive 5’-3’ nucleases responsible for long-range end resection, Exo1 and Dna2 (Zhu et al., 2008). In cells lacking both Exo1 and Dna2, resection lengths are substantially shorter which inhibits the completion of HR (Chung et al., 2010).
However, hyper-resection or the over-production of ssDNA in the genome can also have adverse cellular consequences. Hyper-resection can increase the proportion of ectopic HR reactions where inappropriate genomic regions with similar nucleotide sequences (i.e. gene paralogs) are used as a template resulting in genome rearrangements (Montgomery et al., 1991). Hyper-resection can also increase the likelihood of exposing repetitive regions that can engage in mutagenic SSA reactions (Fishman-Lobell et al., 1992). Not surprisingly, mechanisms have evolved to curtail end resection, keeping HR in check. Compared to proteins that promote end resection, less is known about inhibitory factors. One such factor, mammalian DNA helicase B (HELB), is recruited to ssDNA by interacting with RPA and it uses its 5’-3’ ssDNA translocase activity to curtail long-range end resection mediated by EXO1 and DNA2 (Tkac et al., 2016). HELB is recruited in an RPA-dependent manner illustrating an elegant feedback mechanism to inhibit on-going end resection. End resection can also be inhibited by controlling CtIP protein levels through the action the prolyl isomerase PIN1 (Steger et al., 2013). PIN1 inhibits CtIP by modulating the isomerization of prolines within several of its CDK sites. PIN1 activity promotes CtIP ubiquitylation and subsequent proteasomal degradation. Therefore, PIN1 deficient cells
28
have high CtIP levels and hyper-resect DSB ends which leads to more mutagenic repair by SSA. Another impediment to end resection is the presence of nucleosomes (Adkins et al., 2013). Not surprisingly, nucleosomes antagonize long-range resection by Exo1 and Dna2 more than short- range resection by MRX and Sae2 (CtIP). Two members of the SNF2 ATPase family of chromatin remodeling enzymes, Fun30 (SMARCAD1) and SRCAP, appear to be the major remodeling enzymes that are recruited to DSB sites to facilitate long-range end resection (Costelloe et al., 2012; Dong et al., 2014).
1.5.3.2 DNA end resection versus end protection: the cross-roads of DSB repair pathway choice
NHEJ and HR are the principal pathways for DSB repair and the choice between them depends on the species, cell type, cell cycle stage, and type of DNA damage. The greatest determinant of DSB repair pathway choice is position within the cell cycle. NHEJ is active in the G1-, S-, and G2-phases of the cell cycle, whereas HR is active after replication in S and G2 cells, where a suitable homologous template is available. Cell cycle-regulated choice between DSB repair pathways is critical for the promotion of genome integrity. For instance, HR is required in S- phase to re-establish collapsed replication forks by repairing one-ended DSBs. Inappropriate NHEJ of these breaks is associated with genome rearrangements and cell death (Bouwman et al., 2010; Bunting et al., 2010; Saberi et al., 2007). In the same vein, inappropriate HR reactions in G1-phase can result in strand invasion and homology-driven copying of a homologous chromosome rather than a sister chromatid, potentially resulting in loss of heterozygosity (LOH) (Little and Benjamin, 1991).
The major control step in the choice between NHEJ and HR is the generation of ssDNA by end resection; once a DSB end is resected NHEJ cannot be performed. However, Ku70/80 binds DSB ends rapidly, before end resection has initiated and can protect DSB ends from the long-range end resection machinery (Fig. 1.6A) (Shao et al., 2012; Sun et al., 2012). In budding yeast, the inhibitory action of Ku70/80 on end resection in S and G2 cells is alleviated by the CDK-dependent activity of the MRX complex and Sae2 (CtIP) (Fig. 1.6A) (Mimitou and Symington, 2010; Shim et al., 2010). One model is that the MRX complex and Sae2 (CtIP) directly removes Ku70/80 from DSB ends. After end resection, Ku70/80 is unable to bind DSB ends and the break is now committed to being repaired by HR, SSA, or MMEJ (Mimori and Hardin, 1986). Moreover, end resection is suppressed in G1 cells through CtIP ubiquitylation and
29
degradation by the proteasome (Lafranchi et al., 2014). CtIP ubiquitylation is conducted by the late mitosis/G1-specific E3 ubiquitin ligase APC/C (CDH1). In addition to KU70/80, 53BP1 and its effector proteins RIF1, PTIP, and MAD2L2 play an important role in protecting DSB ends from end resection in mammalian cells (Fig. 1.6B) (Boersma et al., 2015; Bunting et al., 2010; Chapman et al., 2013; Di Virgilio et al., 2013; Escribano-Diaz et al., 2013; Xu et al., 2015; Zimmermann et al., 2013). RIF1 accumulation at DSBs is strongly inhibited by BRCA1-CtIP in the S- and G2-phases of the cell cycle, promoting end resection and HR (Escribano-Diaz et al., 2013). In contrast, 53BP1 and RIF1 antagonize the accumulation of BRCA1 at DSB sites during the G1-phase, promoting NHEJ.
1.5.4 Cell cycle-regulated suppression of homologous recombination in G1 cells downstream of end resection
In addition to the antagonism of end resection in G1 by KU70/80, 53BP1, RIF1, PTIP, and MAD2L2, HR is also inhibited in G1 cells downstream of end resection through the ubiquitin- dependent regulation of the BRCA1-PALB2-BRCA2 mediator complex (Orthwein et al., 2015). In S and G2 cells, BRCA1 directly interacts with PALB2 and promotes the recruitment of the HR factors BRCA2 and RAD51 to DSB sites. In G1 cells, the BRCA1-PALB2 interaction is ablated by PALB2 ubiquitylation which is carried out by an E3 ubiquitin ligase composed of KEAP1, CUL3, and RBX1. Cells deficient in KEAP1 can assemble a stable BRCA1-PALB2- BRCA2 complex in all cell cycle phases. In S and G2 cells, a deubiquitylase, USP11, is responsible for keeping PALB2 in a hypo-ubiquitylated state, promoting the assembly of the BRCA1-PALB2-BRCA2 complex and subsequent loading of RAD51 (Orthwein et al., 2015; Schoenfeld et al., 2004; Wiltshire et al., 2010). Remarkably, it has been demonstrated that HR can be activated in G1 cells by simply overriding the triple block to resection by expressing a hyper-active phospho-mimetic CtIP mutant (T847E) in a 53BP1 and KEAP1 deficient background (Huertas and Jackson, 2009; Orthwein et al., 2015). In addition, HR is also inhibited in G1 cells by the action of the microRNAs miR-1255b, miR-148b*, and miR-193b* (Choi et al., 2014). These microRNAs target the transcripts of several HR factors including BRCA1, BRCA2, and RAD51, which decreases their expression in G1 cells.
30
Figure 1.6. The regulation of DNA double-strand break repair pathway choice.
(A) DSB ends can be simultaneously recognized by the KU70/80 and MRN complexes. In G1 cells, CtIP is in an inactive state and cannot promote MRE11-dependent end resection. Therefore, KU70/80 can function in stimulating NHEJ. In S and G2 cells, CtIP is activated by CDK-dependent phosphorylation which stimulates the function of MRE11 in initiating end resection. MRE11/CtIP-dependent resection antagonizes the function of KU70/80 possibly by directly removing it from DSB ends. Resected DSBs are committed to repair by HR. (B) The ubiquitin-dependent recruitment of 53BP1 to DSB sites is also important for regulating the choice between NHEJ and HR. In G1 cells, 53BP1 recruits its effector proteins RIF1 and MAD2L2 which inhibit end resection and promote NHEJ. In S and G2 cells, BRCA1 and CtIP inhibit recruitment of RIF1/MAD2L2 to DSB sites, stimulating end resection and HR.
31
1.6 Post-translational modifications and the regulation of DNA double-strand break repair
1.6.1 Ubiquitin-dependent signaling at DNA double-strand breaks
PIKK activation also sets into motion an ubiquitylation-based signaling cascade on chromatin flanking DSB sites that is required for recruiting checkpoint and repair factors. In response to DSBs, MDC1 can bind to γH2AX where it is also phosphorylated by ATM (Stucki et al., 2005). Phosphorylated MDC1 in turn is recognized by the FHA domain of the RING-type E3 ubiquitin ligase RNF8 which catalyzes the ubiquitylation of proteins at DSB sites (Huen et al., 2007; Kolas et al., 2007; Mailand et al., 2007). RNF8-dependent ubiquitylation events are recognized by the ubiquitin binding domains of RNF168, another E3 ubiquitin ligase (Doil et al., 2009; Stewart et al., 2009). RNF8/RNF168 promote the conjugation of lysine 63 (K63)-linked ubiquitin to their substrates which include the histones H1 and H2A (Doil et al., 2009; Mailand et al., 2007; Mattiroli et al., 2012; Stewart et al., 2009; Thorslund et al., 2015). The critical outcome of RNF8/RNF168-dependent ubiquitylation is the recruitment of DSB repair and signaling proteins to chromatin surrounding the break site. Proteins that localize to DSBs in a RNF8-dependent manner include 53BP1, RAD18, BRCA1, RAP80, and HERC2 (Bekker-Jensen et al., 2010; Huang et al., 2009; Huen et al., 2007; Kolas et al., 2007; Mailand et al., 2007; Silverman et al., 2004; Wang and Elledge, 2007). The RNF8 pathway can promote NHEJ, especially in developing lymphocytes during class-switch recombination (Ramachandran et al., 2010). Whether or not the RNF8 pathway has a functional role in directly promoting HR is less clear. Importantly, by promoting the recruitment of 53BP1, RIF1, PTIP, and MAD2L2, the RNF8 pathway plays a critical function in regulating the choice between NHEJ and HR.
In addition to RNF8/RNF168 many other E3 ubiquitin ligases have been implicated in controlled the cellular response to DSBs. The ubiquitin ligase RNF138, in collaboration with its E2 conjugating enzyme UBE2D, is also recruited to DSB sites and promotes end resection and HR (Ismail et al., 2015; Schmidt et al., 2015). Following DSB formation, KU70/80 and the MRN complex rapidly and independently bind to DSB sites (Britton et al., 2013). The zinc finger domains of RNF138 are thought to recognize short tracks of ssDNA generated by MRN/CtIP, whereas its ubiquitin-binding domains likely stabilize it at DSB sites (Ismail et al., 2015; Schmidt et al., 2015). RNF138 can ubiquitylate KU70/80 which removes it from DSB ends and
32
promotes end resection (Ismail et al., 2015). RNF138 can also mediate ubiquitylation of CtIP to enhance its recruitment to DSB sites and stimulate end resection (Schmidt et al., 2015).
1.6.2 SUMOylation at DNA double-strand breaks
Many components of the SUMOylation pathway are recruited to DSB sites and promote both NHEJ and HR (Galanty et al., 2009; Morris et al., 2009). The SUMO-specific E3 ligase PIAS1 promotes the recruitment of BRCA1 and RAP80 to DSB sites whereas PIAS4 enhances the localization of RNF168 and subsequent conjugation of K63-linked ubiquitin chains. Only a handful of SUMOylated substrates at DSB sites have been identified including MDC1, RNF168, HERC2, 53BP1, BRCA1, RPA70, and BLM (Danielsen et al., 2012; Dou et al., 2010; Galanty et al., 2009; Morris et al., 2009; Ouyang et al., 2009). SUMOylation at DSB sites has been directly implicated in the regulation of HR. First, RPA70 is able to physically interact with the SUMO protease SENP6 in S-phase which keeps RPA70 in a hypo-SUMOylated state (Dou et al., 2010). In response to DNA damage, SENP6 dissociates from RPA70 allowing for the conjugation of SUMO2/3. SUMOylated RPA70 promotes the recruitment of RAD51 and thus stimulates HR. Next, the RecQ helicase BLM, implicated in both end resection and dHJ dissolution, is also SUMOylated (Ouyang et al., 2009). The expression of BLM mutants that cannot be SUMOylated results in DNA damage sensitivity and impaired RAD51 recruitment. Interplay between ubiquitylation and SUMOylation has also be demonstrated to promote HR through the action of the SUMO-targeted ubiquitin ligase RNF4 (Galanty et al., 2012; Yin et al., 2012). RNF4 is recruited to DSB sites through its N-terminal SUMO-interacting motifs which bind to SUMOylated proteins such as 53BP1, MDC1, and RPA. RNF4-mediated ubiquitylation regulates the rate of turnover of its substrates by targeting them for proteasomal degradation. Remarkably, RNF4 also promotes the recruitment of the proteasome to DSB sites (Galanty et al., 2012). Importantly, RNF4-mediated ubiquitylation and subsequent proteasomal degradation of RPA70 at DSB sites promotes HR by enhancing the exchange of RPA for RAD51 (Galanty et al., 2012; Yin et al., 2012).
1.6.3 Protein acetylation at DNA double-strand breaks
Several studies have outlined the importance of acetylation for regulating protein localization and function at DSBs. For example, both NBS1 and RAD51 are kept in a hypo-acetylated state by SIRT1 which promotes their recruitment to DSBs (Oberdoerffer et al., 2008). Furthermore,
33
CtIP deacetylation by SIRT6 is important for stimulating its activity in end resection (Kaidi et al., 2010). Protein deacetylation is also important for DSB repair by NHEJ (Miller et al., 2010). Deacetylation of H3K56 by HDAC1 and HDAC2 is important for the recruitment of KU70/80 and Artemis to DSBs. Furthermore, TIP60 forms a stable complex with ATM and promotes its acetylation in response to DNA damage (Sun et al., 2005). Suppression of TIP60 blocks ATM activation and prevents the ATM-dependent phosphorylation of both p53 and CHK2. In response to DNA damage, TIP60 is activated by c-Abl-dependent tyrosine phosphorylation (Kaidi and Jackson, 2013). The mechanism for c-Abl activation is less clear but may involve DNA damage- induced chromatin reorganization.
1.6.4 PARylation is one of the earliest protein modifications detected at DNA double-strand breaks
PARPs are enzymes that catalyze the addition of poly(ADP-ribose) (PAR) chains on protein substrates (Stone and Shall, 1973). PARP enzymes have been implicated in PARylating proteins present at both SSBs and DSBs (Durkacz et al., 1980; Ikejima et al., 1990; Rulten et al., 2011). Importantly, PAR chains are transient protein modifications that are removed quickly at DNA breaks by the action of PAR glycohydrolase (PARG) (Schreiber et al., 2006). PARP-1 physically associates with XRCC1 and is required for sensing SSBs and promoting the recruitment of downstream repair enzymes to re-join the broken ends (Caldecott et al., 1996). PARP-1 also enhances DSB repair by promoting the recruitment of several repair enzymes to DNA damage sites including MRE11, NBS1, APLF, and TIMELESS (Ahel et al., 2008; Bryant et al., 2009; Haince et al., 2008). PARP-1 is required for the repair of DSBs by the alternative KU70/80- indepentent MMEJ pathway (Audebert et al., 2004). Moreover, PARylation is important for the localization of chromatin remodeling factors to DSB sites including ALC1, CHD2, CHD4, EZH2, PCGF2, and BMI (Luijsterburg et al., 2016; Seeber et al., 2013). PARP-dependent chromatin decompaction at DNA damage sites likely promotes accessibly to DNA repair enzymes.
1.7 Relevance of DSB repair to human physiology
Proper repair of DSBs is critical for the maintenance of genome integrity and cellular fitness. It is not surprising that germline mutations in DSB repair genes give rise to a set of human maladies, termed genome instability syndromes. One of the first characterized genome instability
34
syndromes is Ataxia-Telangiectasia (A-T), which is caused by autosomal-recessive mutations in the ATM gene (Savitsky et al., 1995). Cells from A-T patients are defective in checkpoint activation and DSB signaling and repair. As a consequence, clinical features include immunodeficiency (due to impaired V(D)J and class-switch recombination), neurodegeneration, and a predisposition to the development of cancer. A list of other genome instability disorders is summarized in Table 1.1.
1.7.1 DSB repair promotes tumour suppression
A fundamental hallmark of cancer is genome instability (Hanahan and Weinberg, 2000). DSB repair is critical for maintaining genome stability and thus for inhibiting tumourigenesis. This is illustrated by the cancer predisposition observed in some genome instability syndromes. Defective DSB repair has the unique ability to elicit chromosome rearrangements that can drive cancer development. For example, aberrant DSB repair in V(D)J or class-switch recombination can fuse proto-oncogenes to antigen receptor loci resulting in the development of lymphoid tumours (Schlissel et al., 2006).
1.7.1.1 Defective HR is associated with a predisposition to breast and ovarian cancers
Mutations in genes that promote DSB repair by HR have been associated with the development of breast and ovarian cancers (Kato et al., 2000; Miki et al., 1994; Narod et al., 1993; Soria- Bretones et al., 2013; Wooster et al., 1994). The greatest risk factor for breast and ovarian cancer are germline mutations in one of the breast cancer susceptibility genes, BRCA1, BRCA2, or PALB2, which are required for HR. Furthermore, mutations in the RAD51 paralogs RAD51C and RAD51D can predispose individuals to ovarian cancer, whereas mutations in RAD51B can lead to breast cancer susceptibility (Golmard et al., 2013; Loveday et al., 2011; Meindl et al., 2010). Therefore, many HR genes can also be considered tumour suppressor genes.
1.7.2 DSB repair is important during cerebral cortical development
The analysis of genome instability syndromes has allowed researchers to gain perspectives on the relative importance of DSB repair for the development of different tissues. One common feature of these syndromes is neonatal microcephaly, a clinical term used to describe a reduced head circumference in newborns that is greater than three standard deviations below the mean
35
Syndrome Gene Radio- Immuno- Neonatal Cancer Reference sensitivity deficiency Microcephaly Predisposition HR: Ataxia ATM + + ND + (Savitsky et telangiectasia al., 1995)
Nijmegen NBS1 + + + + (Varon et breakage al., 1998)
Ataxia MRE11 + ND ND ND (Stewart et telangiectasia- al., 1999) like
Nijmegen RAD50 + ND + ND (Waltes et breakage-like al., 2009)
Seckel ATR + ND + + (O'Driscoll ATRIP ND ND + ND et al., 2003; Ogi et al., CtIP + ND + ND 2012; Qvist DNA2 ND ND + ND et al., 2011; Shaheen et al., 2014)
Jawad CtIP ND ND + ND (Qvist et al., 2011)
Bloom BLM + + ND + (Ellis et al., 1995)
NHEJ: LIG4 LIG4 + + + + (O'Driscoll deficiency et al., 2001)
ART-SCID Artemis + + ND + (Moshous et al., 2001)
XLF-SCID XLF + + + ND (Buck et al., 2006)
Table 1.1. Clinical features of some genome instability syndromes associated with defective DSB repair.
List of genome instability syndromes (not exhaustive) outlined the causative gene and common clinical features. ‘+’ indicates that the clinical feature has been described in the literature for the disorder. ‘ND’ indicates that the feature has not been described for the syndrome.
36
(Alcantara and O'Driscoll, 2014). Microcephaly arises as a consequence of defective proliferation of neuroprogenitor cells within the developing cerebral cortex (Bond et al., 2002). Neuroprogenitor proliferation defects have been attributed to both increased apoptosis and cell cycle arrest (Chen et al., 2009; Li et al., 2012). The fact that microcephaly is a common symptom of genome instability disorders suggests that the normal re-joining of DSBs is critical to the developing cerebral cortex. It currently remains unclear why DSB repair appears to be more important in the developing brain compared to other tissues. One hypothesis is that neuroprogenitor cells may have a lower threshold for apoptosis making it easier to activate programmed cell death in the presence of unrepaired DSBs. Indeed, apoptosis is a fundamental component of nervous system development and is required for regulating neural cell numbers, tissue remodeling, and eliminating mis-specified cells (Yamaguchi and Miura, 2015). In addition, it is also possible that neuroprogenitor cells have a lower threshold for activating cell cycle checkpoints in response to DSBs. It is interesting to note that patients with A-T do not display microcephaly (Savitsky et al., 1995). One explanation is that apoptosis or cell cycle checkpoint activation could be ATM-dependent in neuroprogenitor cells. Therefore, unrepaired DSBs would be allowed to persist and the damaged cells could accumulate in the developing brain, potentially contributing to the observed neurodegenerative phenotypes described in A-T patients.
1.7.2.1 Seckel syndrome and DNA end resection
Seckel syndrome is a genome instability disorder with clinical features that include intrauterine growth retardation, dwarfism, a ‘bird-like’ facial appearance, and cognitive delay (Shanske et al., 1997). The defining feature of Seckel syndrome is severe microcephaly. A mutation in the ATR gene was first determined to be a cause for Seckel syndrome (O'Driscoll et al., 2003). Later, another Seckel syndrome patient was found to have a mutation in the ATR interacting protein ATRIP, further suggesting that ATR signaling is important for the development of the cerebral cortex and for the prevention of microcephaly (Ogi et al., 2012). As previously discussed, ATR is activated by genomic regions containing ssDNA bound by RPA. Regions of ssDNA-RPA can arise as a consequence of replication fork stalling and DNA end resection during HR. Remarkably, the core end resection factors CtIP and DNA2 were also found to be mutated in two separate cases of Seckel syndrome, suggesting that ATR activation at resected DSBs may be critical for cerebral cortical development (Qvist et al., 2011; Shaheen et al., 2014). It cannot be
37
excluded that the role of CtIP and DNA2 in HR, rather than ATR activation, contributes to the promotion of neuroprogenitor cell proliferation in the developing cerebral cortex. However, the observation that genes promoting HR downstream of end resection have not been implicated in microcephaly suggests that the function of CtIP and DNA2 in ATR activation, rather than HR, is important for cerebral cortical development. Interestingly, Nijmegen breakage syndrome (NBS) and Nijmegen breakage syndrome-like (NBS-like), where components of the resection- promoting MRN complex are mutated, also display neonatal microcephaly (Varon et al., 1998; Waltes et al., 2009).
1.8 Rationale and research objective
DSB repair by HR is a critical pathway for maintaining genome stability and preventing tumourigenesis. Therefore, at the onset of my doctoral research I sought out to uncover new factors that regulate HR using a functional genomics approach. I decided to focus my attention on the rate-limiting step of HR, DNA end resection. Several core resection factors are known in human cells, including the MRN complex, CtIP, EXO1, DNA2, and BLM. However, these factors were first identified in budding yeast which begs the question if additional factors are required for end resection in human cells. Furthermore, the mechanistic detail regarding the regulation of the known core resection activators is not clear. It is also an open question how resection length is regulated in human cells. Lastly, an important outstanding question is how resection is regulated in the context of chromatin structure. My research objective is to conduct an RNAi screen in human cells to mine for new regulators of end resection in an attempt to characterize new factors and to potentially shed light on these unresolved questions.
1.9 High-throughput functional discovery utilizing RNA interference screens
First discovered in Caenorhabditis elegans and conserved in most eukaryotic species, RNAi is a post-transcriptional gene silencing process that is mediated by double-stranded RNA (Fire et al., 1998). Long precursor RNAs are processed by the ribonuclease DICER into the effectors of RNAi called small interfering RNAs (siRNAs). To elicit target messenger RNA depletion in human cells, siRNAs must not be larger than 30 base pairs as this leads to the activation of the anti-viral interferon response (Stark et al., 1998). RNAi can be used to perturb the expression of specific genes of interest. Genetic perturbation is accomplished by cellular introduction of RNAi
38
reagents such as synthetic siRNAs, endoribonuclease-prepared siRNAs (esiRNAs), or siRNA precursors such as short hairpin RNAs (shRNAs) that are designed to target a specific transcript (Moffat and Sabatini, 2006).
RNAi reagents can be employed for systematic high-throughput screens that assay specific cellular phenotypes. To facilitate large-scale screens, a number of genome-scale RNAi libraries have been developed by academic and commercial entities and can be delivered to cultured cells in one of two screening formats: arrayed or pooled (Moffat and Sabatini, 2006). For arrayed screening, each well of a multi-well plate can contain an RNAi reagent targeting a specific gene. To ensure gene silencing, some libraries employ several RNAi reagents that hybridize to different locations on a particular transcript. Moreover, reagents that target the same transcript can be pooled into one well to minimize the number of samples in a screen. One advantage of the arrayed format is that the gene target for each well can be easily identified during downstream analysis. In addition, more complex cellular phenotypes can be quantified for each sample using a method termed high-content screening where automated microscopes take images of RNAi-treated cells in each well. Sophisticated image analysis software programs are then employed to measure a wide range of subtle cellular phenotypes. For example, our laboratory has conducted high-content siRNA arrayed screens that monitor the localization of proteins to DSB sites which can be visualized cytologically as subnuclear foci (Kolas et al., 2007; O'Donnell et al., 2010; Stewart et al., 2009). A plethora of information about these foci can be measured by high-content analysis including number per nuclei, size, shape, texture, and intensity. RNAi libraries can also be screened in a pooled format. Genome-wide shRNA plasmid libraries can be packaged into viral particles and pooled. The pool of virus is infected into a population of cells and after the duration of the screen the unique shRNA sequences that were integrated into the genome can be PCR amplified using vector-derived primers. The representation of each shRNA sequence (or molecular barcode) in the population can be identified by next-generation sequencing or hybridization to a library-specific microarray (Berns et al., 2004; Paddison et al., 2004; Vizeacoumar et al., 2013). Pooled RNAi screens are powerful for determining the effect of gene perturbation between two cell populations. For instance, populations could have different genetic backgrounds or could have been treated with different stressors. If the depletion of a specific gene sensitizes cells in a particular condition, the shRNA sequence will drop out of the population. In contrast, if depletion of a gene promotes survival the
39
shRNA sequence will be enriched in the population. One disadvantage of pooled screening is that it relies on the quantification of cellular fitness and cannot be used for the analysis of more subtle subcellular phenotypes. The following chapter will describe an arrayed genome-scale RNAi screen that I performed utilizing an siRNA library that targets 18,452 human genes. The quantification of DNA end resection was conducted in cells treated with each library reagent using high-content analysis.
40
Chapter II
Genome-scale siRNA screen for regulators of DNA end resection
41
2.1 Statement of contributions, rights, and permissions
Thomas Sun and Alessandro Datti (LTRI robotics facility) developed liquid-handling robotic procedures to preform automated siRNA transfections and immunofluorescence. In addition, they helped in all robotic troubleshooting.
Mikhail Bashkurov (LTRI high-content screening facility) helped in all aspects of automated confocal microscopy and downstream image analysis.
42
2.2 Summary
DNA end resection is the rate-limiting step of HR. Efficient end resection promotes the assembly of a long RAD51 nucleoprotein filament which will enhance the process of sister chromatid strand invasion and homology search. End resection is also at the cross-roads of DSB repair pathway choice. Once end resection has commenced, DSB ends are no longer compatible for KU70/80 binding and re-joining by NHEJ. Therefore, the regulation of end resection initiation at DSB sites is an important mechanism to control the choice between HR and NHEJ throughout the cell cycle. The molecular details of end resection have been mostly dissected in budding yeast, leading one to speculate if additional factors are necessary in human cells. Here, I report the establishment of a cell-based assay that measures end resection in an automated and quantitative manner in human cells. I employ this assay to screen a genome-scale siRNA library that targets 18,452 genes and identify candidate end resection activators and inhibitors. The success of the screen is demonstrated by the identification of known end resection factors including all three subunits of the MRN complex and CtIP. I show that the completion of several confirmation screens allows the categorization of real end resection regulators from false- positives.
43
2.3 Introduction
My goal when I started my doctoral research was to uncover novel genes that function in DSB repair by HR, specifically at the step of DNA end resection. To achieve this goal, I devised an RNAi screen in human cells. Three cell-based assays are routinely used in the DSB repair field to measure end resection and each relies on the quantification of nuclear immunofluorescence intensity which makes them strong candidates for an arrayed high-content RNAi screen. Two of the three assays exploit the function of the RPA complex which rapidly coats ssDNA formed as a consequence of end resection at DSB sites. The first assay monitors the formation of subnuclear RPA foci by conducting RPA32 immunofluorescence (Fig. 2.1A). The second assay monitors RPA focus formation by utilizing an antibody that detects phosphorylated RPA32 (Fig. 2.2A). In response to DSBs, RPA32 becomes hyper-phosphorylated on multiple serine and threonine residues by the DNA damage responsive kinases (Brush et al., 1994; Liu and Weaver, 1993; Shao et al., 1999). RPA32 hyper-phosphorylation prevents the complex from associating with ssDNA at DNA replication forks, which may increase the free pool of RPA that is available for binding resected DSB ends (Vassin et al., 2004). The mechanistic detail of how the recruitment of hyper-phosphorylated RPA32 is inhibited at replication centers in unknown. The kinetics of RPA32 phosphorylation and dephosphorylation are also important for HR. The PP4 phosphatase complex interacts with RPA32 and can dephosphorylate it (Lee et al., 2010). PP4-depleted cells have an increase in hyper-phosphorylated RPA32 that is not bound to chromatin. The free pool of phosphorylated RPA32 sequesters RAD51 away from DSB sites, thereby inhibiting HR. The last assay measures the relative amount of ssDNA generated by end resection through native immunostaining of the thymidine analog, bromodeoxyuridine (BrdU). BrdU is added to the medium and is incorporated into the genome during DNA replication. After long term incubation of cells with BrdU, cells are treated with a DSB-inducing agent and processed for BrdU immunofluorescence. The antibody only detects BrdU in the context of ssDNA and can be used as a more direct readout of the amount of ssDNA formed by end resection (Fig. 2.3A). My first course of action was to establish these cell-based assays so that I could select the most appropriate method for a high-content RNAi screen.
44
2.4 Results
2.4.1 Establishment of immunofluorescence-based assays to monitor DNA end resection In order to conduct an RNAi screen for regulators of end resection, I first needed to establish a cell-based assay to monitor resection in human cultured cells. As discussed above, three immunofluorescence-based assays are routinely utilized in the DSB repair field to monitor resection by microscopy, including the quantification of RPA32, phospho-RPA32 (S4/S8), and BrdU focus formation. Several DSB-inducing drugs are frequently used to induce detectable amounts of end resection in human cells, including the topoisomerase I poison CPT and the radiomimetic drug NCS. CPT induces DSBs only in the S phase of the cell cycle and I surmised that it would be important in a screen to monitor end resection occurring in other cell cycle phases. In contrast, NCS indiscriminately induces DSBs in all cell cycle phases and was selected for the rest of the study. Next, to test immunostaining and NCS treatment conditions I manually seeded and reverse-transfected human bone osteosarcoma U-2 OS (U2OS) cells with control scrambled or CtIP siRNA (Fig. 2.1B, 2.2B, and 2.3B). U2OS cells were used in this study because they possess a flat morphology and are highly adherent to glass, making them an ideal model cell line for conducting automated high-throughput microscopy-based screens (Ponten and Saksela, 1967). In addition, U2OS cells are readily transfected with siRNAs and they are routinely utilized as a model cell line to study DSB repair. After 48 hours-post siRNA transfection (to allow time for target mRNA depletion), cells were treated with various concentrations of NCS for different time periods and were processed for either RPA32, phospho- RPA32 (S4/S8), or BrdU immunostaining. Next, I visualized cells on a LSM780 Zeiss confocal microscope utilizing a 60X oil immersion objective to observe NCS-induced sub-nuclear focus formation. To circumvent high levels of pan nuclear background staining with the RPA32 and BrdU antibodies, cells were pre-extracted with nuclear extraction buffer before fixation to remove any proteins that were not bound to chromatin. Decreased background staining allowed for the visualization of chromatin-associated RPA32 and BrdU foci. I then manually quantified the percentage of cells in each treatment condition that contained more than 15 RPA32, pRPA32 (S4/S8), or BrdU foci (Figure 2.1C, 2.2C, 2.3C, respectively). I selected a threshold of 15 because some treatments contained upwards of 100 foci per nucleus making it difficult to count individual foci. Furthermore, approximately 5-10 foci were present in cells not treated with NCS and in cells depleted of the core resection regulator, CtIP. Therefore, background foci are present
45
Figure 2.1. DNA end resection assay monitoring RPA32 focus formation. (A) The RPA heterotrimeric complex rapidly binds to ssDNA generated by resection at DSB ends. (B) The accumulation of RPA at DSB sites can be observed cytologically as subnuclear foci after RPA32 immunostaining. U2OS cells transfected with scrambled control or CtIP siRNAs were either mock treated (-NCS) or treated with 50 ng/ml NCS for 3 hours. Incubation with NCS was followed by pre-extraction, fixation, and RPA32 immunofluorescence. DNA was counterstained with DAPI to visualize nuclei. Scale bar represents 5 µm. (C) Manual quantification of the percentage of cells with greater than 15 RPA32 nuclear foci after incubation with various concentrations of NCS and at different time points (Mean ± Standard error of the mean (SEM); N ≥ 3).
46
Figure 2.2. DNA end resection assay monitoring pRPA32 (S4/S8) focus formation. (A) The RPA heterotrimeric complex rapidly binds to ssDNA generated by resection at DSB ends. RPA32 become hyper-phosphorylated on multiple serine and threonine residues by PIKKs. Hyper-phosphorylated RPA32 can be detected by employing a phospho-specific antibody that recognizes phosphorylated serines 4 and 8. (B) U2OS cells transfected with scrambled control or CtIP siRNAs were either mock treated (-NCS) or treated with 100 ng/ml NCS for 3 hours. Incubation with NCS was followed by fixation and pRPA32 (S4/S8) immunofluorescence. DNA was counterstained with DAPI to visualize nuclei. Scale bar represents 5 µm. (C) Manual quantification of the percentage of cells with greater than 15 pRPA32 (S4/S8) nuclear foci after incubation with various concentrations of NCS and at different time points (Mean ± SEM; N ≥ 3).
47
Figure 2.3. DNA end resection assay monitoring BrdU focus formation. (A) The generation of ssDNA at DSB sites can be directly measured by incubating cells with the thymidine analog BrdU and conducting BrdU immunostaining. The antibody can only detect BrdU in the context of ssDNA and thus is a valuable tool for monitoring end resection. (B) U2OS cells incubated with BrdU and transfected with scrambled control or CtIP siRNAs were either mock treated (-NCS) or treated with 50 ng/ml NCS for 3 hours. Incubation with NCS was followed by pre-extraction, fixation, and BrdU immunofluorescence. DNA was counterstained with DAPI to visualize nuclei. Scale bar represents 5 µm. (C) Manual quantification of the percentage of cells with greater than 15 BrdU nuclear foci after incubation with various concentrations of NCS and at different time points (Mean ± SEM; N ≥ 3).
48 in U2OS cells that are not a result of DNA end resection at DSB sites. RPA-bound ssDNA can also occur at replication forks and active sites of transcription and these background foci may be a result of these genomic processes. As expected, I observed an increase in the number of RPA32, pRPA32 (S4/S8), and BrdU foci in U2OS cells treated with NCS. These foci were largely dependent on the core resection activator CtIP and increased in number with increasing concentrations of NCS. Furthermore, the foci also increased in number with time, with maximal levels observed from 3 to 5 hours after the addition of NCS. A large population of cells in each condition were devoid of foci and this observation is consistent with the fact that end resection is actively inhibited during the G1 phase of the cell cycle (Bunting et al., 2010; Huertas and Jackson, 2009). Representative micrographs shown in Figure 2.1, 2.2, and 2.3 are of U2OS cells 3 hours post-NCS addition.
2.4.2 Quantitative image-based cytometry to monitor DNA end resection
Genome-scale RNAi screens that quantified the number and intensity of nuclear DNA damage foci required high magnification micrographs (40-60X) and many fields needed to be imaged per sample in order to acquire data on a sufficient number of cells (Kolas et al., 2007; O'Donnell et al., 2010; Stewart et al., 2009). The requirement of more fields resulted in long imaging times per plate, creating a bottle-neck in the screening pipeline. Quantitative image-based cytometry (QIBC) is a plate-based method to rapidly scan multiwell plates using low magnification (4-10X) objective lens (Toledo et al., 2013). QIBC enables users to image the entire well and thus acquire data on thousands of cells for every sample (akin to flow cytometry). Importantly, the analysis of more cells can increase the statistical quality of an RNAi screen (Birmingham et al., 2009). First, I tested whether an increase in the total nuclear RPA32, pRPA32 (S4/S8), or BrdU intensity could be detected in response to NCS using the Celigo plate cytometer (Brooks Automation), which rapidly scans plates with a 4X objective (Fig. 2.6C). U2OS cells were seeded and reverse- transfected with control or CtIP siRNAs in a 96-well plate. After 48 hours, cells were treated with NCS and processed for RPA32, pRPA32 (S4/S8), or BrdU immunofluorescence. Using pRPA32 (S4/S8) stained cells as an example, representative cytometry images are depicted in Figure 2.4A with the corresponding cell-by-cell analysis outlined in Figure 2.4B. Image analysis was conducted using software packaged with the Celigo plate cytometer. In brief, DAPI stained nuclei were segmented and the intensity of either RPA32, pRPA32 (S4/S8), or BrdU immunostaining was measured under a nuclear mask. Cell-by-cell intensities were plotted in
49
frequency distributions and in response to NCS bi-modal distributions were observed. Bi-modal distributions were expected as end resection is a cell cycle-regulated process occurring primarily in the S and G2 phases. Next, an arbitrary intensity threshold was established to determine the percentage of cells that were positive (Fig. 2.4C). The percentage of NCS-induced RPA-, pRPA32 (S4/S8)-, or BrdU-positive cells decreased substantially in cells depleted of CtIP. Furthermore, the dynamic range between the control and CtIP siRNAs was similar to that observed when manually quantifying focus formation using a 60X objective (as outlined in Fig. 2.1, 2.2, and 2.3). Therefore, QIBC is a robust method that can be utilized to more rapidly quantify end resection in U2OS cells.
As end resection assays display non-Gaussian bi-modal distributions, I decided to employ a non-parametric statistical analysis called the two-sample Kolmogorov-Smirnov (KS) test. For this test, cell-by-cell intensity data was plotted in a cumulative frequency distribution for a reference sample (i.e. siCTRL) and a test sample (i.e. siCtIP). As an example, cumulative frequency distributions of pRPA32 (S4/S8) nuclear intensities are shown in Figure 2.5A. The KS score is the maximum vertical distance between the reference and test distributions. The KS test was conducted on RPA32, pRPA32 (S4/S8), and BrdU nuclear intensity distributions from cells depleted of several known resection factors including CtIP, MRE11, RAD50, and NBS1 (Fig. 2.5B). Cumulative frequency distributions and KS tests were conducted using MATLAB (The MathWorks Inc.) in collaboration with Mikhail Bashkurov in the Lunenfeld-Tanenbaum Research Institute’s (LTRI) high-content screening facility.
2.4.3 Automating DNA end resection assays using liquid-handling robotics
With the establishment of cell-based end resection assays, the next step towards the commencement of an RNAi screen was to test whether siRNA transfections and immunostaining procedures could be carried out in an automated manner using liquid-handling robotics. For this I initiated a collaboration with Alessandro Datti and Thomas Sun in the LTRI’s robotics facility. We developed a protocol on the 96 tip Biomek FX (Beckman-Coulter; Fig. 2.6A) liquid handler to seed and transfect U2OS cells with siRNAs in 384-well plates. In addition, we programmed a procedure to conduct immunostaining on the Dimension 4 robotic platform (Fig. 2.6B) utilizing various peripheral instruments including a Biomek FX liquid handler (Beckman-Coulter) and an Embla plate washer (Molecular Devices). To test our protocol, we seeded and transfected U2OS
50
Figure 2.4. Measuring end resection by quantitative image-based cytometry. (A) U2OS cells seeded in a 96-well plate and incubated with scrambled control or CtIP siRNA were treated with 100 ng/ml NCS for 3 hours before pRPA32 (S4/S8) immunofluorescence. DNA was counterstained with DAPI to visualize nuclei. QIBC using the Celigo plate cytometer and image analysis software was conducted to segment DAPI-stained nuclei and measure the mean pRPA32 (S4/S8) nuclear intensity. Scale bar represents 200 µm. (B) Cell-by-cell pRPA32 (S4/S8) nuclear intensities were plotted in frequency distributions. (C) The percentage of NCS-induced RPA32-, pRPA32 (S4/S8)-, and BrdU-positive nuclei was determined by setting an arbitrary intensity threshold (Mean ± SEM; N ≥ 3).
51
Figure 2.5. Application of the Kolmogorov-Smirnov test to analyze end resection. (A) Cell-by-cell pRPA32 (S4/S8) mean nuclear intensity were plotted in a cumulative frequency distribution. The Kolmogorov-Smirnov (KS) score was determined by measuring the maximal vertical distance between the reference (siCTRL) and test distributions (siCtIP). (B) KS scores were calculated for RPA32, pRPA32 (S4/S8), and BrdU immunostaining and multiple control siRNAs (Mean ± SEM; N ≥ 3).
52
cells with control, CtIP, MRE11, or NBS1 siRNA in 384-well plates using the Biomek FX. After 48 hours to allow for target depletion, the plates were loaded onto the Dimension 4 platform for the addition of NCS and subsequent RPA32, pRPA32 (S4/S8), or BrdU immunostaining. The cells were then imaged on the Celigo for QIBC (Fig. 2.6C). The RPA32 and BrdU immunostaining procedures required a pre-extraction with mild detergent before fixation. Unfortunately, after testing various washing pressures with the Embla plate washer, we were unable to find a condition that did not remove pre-extracted cells from the 384-well plates. In contrast, pRPA32 (S4/S8) immunofluorescence did not require a pre-extraction and these cells adhered very well to the plates. A heat map of pRPA32 (S4/S8) nuclear intensity KS scores for the test plate are depicted in Figure 2.6D. KS scores were similar to those observed when I conducted the transfection and immunostaining procedures manually, demonstrating that the assay can be effectively automated using liquid handling robotics. Furthermore, I also observed an excellent dynamic range between the negative and positive control siRNAs. At this stage of the project, I decided to commence a genome-scale siRNA screen that monitors pRPA32 (S4/S8) nuclear intensity in response to NCS treatment.
2.4.4 Genome-scale siRNA screen utilizing a pooled siRNA library
Using the automated end resection assay outlined above, I screened the genome-scale SMARTpool siRNA library (Dharmacon/GE Healthcare) that targets 18,452 genes (Fig. 2.7A). For each gene, the library contains a pool of four distinct siRNAs that hybridize to different regions of the target transcript. Each 384-well screening plate contained both negative (scrambled CTRL siRNA) and positive (CtIP siRNA) controls within the outside columns. After imaging and raw cell-by-cell nuclear pRPA32 (S4/S8) intensities were determined, Mikhail Bashkurov calculated the KS score for each library siRNA by comparing it to a scrambled control siRNA sample on the same plate. I set an arbitrary cut-off for hit identification at a KS score of -19 for resection activators because it included many of the known activators as hits and limited the list to a manageable ~2.5% of the data set (451/18,452). Known end resection activators, including all three subunits of the MRN complex and CtIP were identified as hits in the screen (Fig. 2.7B). In addition, several kinases responsible for RPA32 hyper- phosphorylation, DNA-PKcs and ATR, were also identified as hits. EXO1 and DNA2 are both 5’ to 3’ exonucleases known to carry out long-range end resection but were not identified as hits. I hypothesized that these two nucleases may be able to functionally compensate in long-range
53
Figure 2.6. Automating DNA end resection assays using liquid-handling robotics. (A) BioMek FX (Beckman-Coulter) is a 96 tip liquid dispensing robot and was used to deliver siRNA complexes, NCS, and immunostaining reagents to 384-well screening plates. (B) The Dimension 4 robotic platform facilitated the systematic use of multiple peripheral instruments including a temperature- and carbon dioxide-controlled incubator, a plate washer, and a BioMek FX liquid handler. The platform enabled immunostaining procedures to run on multiple 384-well screening plates at the same time. (C) Celigo plate cytometer (Brooks Automation) which rapidly scans multiwell plates using a 4X objective lens. (D) Heat-map of a 384-well plate illustrating the pRPA32 (S4/S8) nuclear intensity KS score for U2OS cells incubated with various control siRNAs. U2OS cells were seeded and transfected with siRNAs using the BioMek FX. Cells were treated with NCS followed by pRPA32 (S4/S8) immunofluorescence using the Dimension 4 platform. QIBC was conducted using the Celigo plate cytometer and image analysis software. KS scores were calculated using a script generated in MATLAB (Mean ± SEM; N = 2).
54
Figure 2.7. Genome-scale siRNA screen for regulators of DNA end resection. (A) Scatter plot depicting the NCS-induced pRPA32 (S4/S8) nuclear intensity KS scores for 18,452 siRNA pools. (B) KS scores from the RNAi screen for known end resection activators. (C) U2OS cells seeded in a 96-well plate and incubated with various combinations of scrambled control, EXO1, or DNA2 siRNA. Cells were treated with 100 ng/ml NCS for 3 hours before pRPA32 (S4/S8) immunofluorescence. DNA was counterstained with DAPI to visualize nuclei. QIBC using the Celigo plate cytometer and image analysis software was conducted to segment DAPI-stained nuclei and measure the pRPA32 (S4/S8) nuclear intensity. KS scores were calculated using a script generated in MATLAB. (N = 3) (D) KS scores from the RNAi screen for known end resection inhibitors.
55
resection when the other is lost. To test this, I manually co-depleted both EXO1 and DNA2 with siRNAs and observed a dramatic decrease in end resection compared to cells treated with either EXO1 or DNA2 siRNA alone (Fig. 2.7C). Next, I set the cut-off for putative resection inhibitors at a KS score of +13 which also corresponded to ~2.5% of the data set (494/18,452). In contrast to resection activators, the screen was not as successful at identifying known resection inhibitors (Fig. 2.7D). The helicase HELQ was identified as an end resection inhibitor. However, HELQ is known to promote repair of replication-associated DSBs by HR (Adelman et al., 2013; Takata et al., 2013). HELQ functions at collapsed replication forks to promote HR downstream of RAD51 loading onto resected ssDNA. Cells deficient in HELQ have an increased accumulation of γH2AX, RPA32, and RAD51 foci in response to agents that induce replication stress. These foci persist for longer time periods in cells lacking HELQ, demonstrating a defect in the repair of replication-associated DSBs. The increase in NCS-induced hyper-phosphorylation of RPA32 observed in the RNAi screen for HELQ depleted U2OS cells is consistent with these data.
Next, I conducted pathway enrichment analysis for the identified candidate resection activators utilizing software available through Qiagen called Ingenuity Pathway Analysis (IPA). IPA is a rigorously updated database (called the Ingenuity Knowledge Base) that catalogs human genes into functional cellular processes. The list of 451 candidate resection activators was imported to IPA and pathway enrichment P values were determined using the Fisher’s exact test (Fig. 2.8). As expected, genes functionally implicated in DSB repair by HR were enriched in the data set. As end resection is a cell cycle-regulated process, it was not surprising that genes involved in cell cycle control were also enriched.
2.4.5 Secondary confirmation screen utilizing cherry-picked siRNA pools
RNAi screens are known to elicit a high number of false-positive hits (Echeverri et al., 2006). One potential source of false-positives in my screen were siRNAs that could alter the normal cell cycle profile of U2OS cells. Changes in cell cycle progression would impact the quantification of end resection, as it is a cell cycle-regulated process, occurring in only S and G2 cells. Therefore, to normalize any differences in cell cycle progression, I established a secondary confirmation assay employing the fluorescent ubiquitylation-based cell cycle indicator (FUCCI) system to measure end resection in only S and G2 cells (Sakaue-Sawano et al., 2008). Stable FUCCI cell lines express fragments of the cell cycle-regulated proteins Geminin (S/G2/M phases) and CDT1
56
Figure 2.8. Pathway enrichment analysis for candidate resection activators. Pathway enrichment analysis was conducted for the identified candidate resection activators using Ingenuity Pathway Analysis (IPA). IPA is a rigorously updated database that categorizes human genes into functional cellular processes. The list of 451 candidate resection activators was imported to IPA and pathway enrichment P values were determined using the Fisher’s exact test. The percentage of candidate resection activator genes that overlap with the list of genes for each functional process is depicted by the bar graph.
57
(G1 phase) that are fused to a green fluorescent protein (mAG) and a red fluorescent protein (mKO), respectively. Cells that express both reporters (and are yellow) are considered to be at the G1/S transition (Fig. 2.9A).
As a first line of attack, I focused my attention on confirming candidate resection activators. We cherry-picked 360 of the top activators for a confirmation screen but excluded annotated ribosomal, proteasomal, or solute carrier proteins (-40 candidate activators). These genes were excluded because I wanted to focus my attention on candidates that may have a more direct function in promoting end resection. Briefly, U2OS FUCCI cells were seeded into 384- well plates and transfected with the 360 cherry-picked siRNAs from the SMARTpool library (Dharmacon/GE Healthcare). Two days post-transfection the cells were treated with NCS for 3 hours and then immunostained for pRPA32 (S4/S8) using a secondary antibody conjugated to the far red fluorophore, Alexa647. DNA was counterstained with DAPI followed by 4-channel QIBC using the 10X objective of the InCell 6000 automated confocal microscope (GE Healthcare). Images were analyzed by segmenting DAPI-stained nuclei and identifying S and G2 phase cells (Geminin-mAG-positive cells) using an intensity threshold. As expected, there was an enrichment (141/360) of candidate resection activator siRNA pools that decreased the proportion of cells in S and G2 phases of the cell cycle (Fig. 2.9A). Of these 141 siRNA pools, 82 did not elicit any detectable end resection defect. Therefore, cell cycle position was a large source of false-positive hits in the primary screen. In addition, the percentage of pRPA32 (S4/S8)-positive S and G2 cells was determined by setting an intensity threshold. Importantly, there were still many candidate resection activators, that when depleted, decreased pRPA32 (S4/S8) nuclear intensity in only S and G2 phase cells. This list of activators included CtIP and all three subunits of the MRN complex. A cut-off of 0.8 for the relative percentage of pRPA32- positive cells was set to identify hits. I chose this cut-off because it included many of the known resection activators and resulted in a manageable list of 154 confirmed hits.
2.4.6 Re-screening deconvolved siRNAs for the top resection activator candidates
It has been well documented that siRNAs have the propensity to bind off target mRNAs and therefore elicit the down-regulation of multiple genes (Sigoillot et al., 2012). It is likely that some of the 154 candidate resection activators are false positives due to off target mRNA down- regulation. Therefore, I ordered deconvolved siRNAs for the top 70 pools (of 154) identified in
58
Figure 2.9. Secondary confirmation screen utilizing the fluorescent ubiquitylation-based cell cycle indicator (FUCCI) system. (A) Model of the FUCCI system and images of a QIBC end resection assay in U2OS FUCCI cells treated with 100 ng/ml NCS for 3 hours. Scale bar represents 200 µm. (B) Secondary confirmation screen for cherry-picked siRNA pools targeting the top 360 candidate resection activators identified in the primary screen.
59
the secondary screen and re-screened these duplexes individually. I ranked the 70 candidate resection activators based on the number of duplexes that caused a decrease in the percentage of pRPA32 (S4/S8)-positive S and G2 cells by more than 20% relative to the control siRNA (Fig.2.10A). Candidate activators where four out of four duplexes resulted in a pRPA32 (S4/S8) reduction were ranked highest. Within the list of highest ranking candidates was CtIP where all four duplexes decreased pRPA32 (S4/S8) nuclear intensity in S and G2 cells. All three subunits of the MRN complex had at least three of the four duplexes that decreased end resection. The incidence of siRNA off target effects in the screen was likely high as 34.3% of the top 70 candidate activators only had one siRNA duplex that decreased end resection (Fig. 2.10B). It is also possible, albeit unlikely, that only one of the four siRNA duplexes resulted in the appropriate level of knockdown to elicit a detectable resection defect. The 70 candidate resection activators and how many of the four siRNAs caused a decrease in end resection below the 0.8 cut-off are listed in Table 2.1.
2.5 Discussion
In this chapter, I described the establishment of an automated DNA end resection assay in human cells and the utilization of this assay to screen a genome-scale siRNA library. Overall, the screen was successful as evidenced by the identification of known resection activators including CtIP, MRE11, RAD50, NBS1, RPA32, RPA70, WRN, POLE3 (CHRAC17), and SRCAP (Dolganov et al., 1996; Dong et al., 2014; Featherstone and Jackson, 1998; Lan et al., 2010; Petrini et al., 1995; Sartori et al., 2007; Sturzenegger et al., 2014). In contrast, the identification of known end resection inhibitors including 53BP1, PTIP, RIF1, MAD2L2, HELB, and PIN1 was not successful (Boersma et al., 2015; Bunting et al., 2010; Chapman et al., 2013; Di Virgilio et al., 2013; Escribano-Diaz et al., 2013; Steger et al., 2013; Tkac et al., 2016; Xu et al., 2015; Zimmermann et al., 2013). One possible explanation for this discrepancy was that the dose of NCS used in the screen was too high and resulted in a level of end resection that was at the maximum detectable limit. Being at or close to the detection limit would make it difficult to observe increases in pRPA32 (S4/S8) nuclear intensity after siRNA-mediated knockdown. Therefore, I decided to focus my attention on the identified candidate resection activators.
Several known end resection activators were not identified in the primary screen including EXO1, DNA2, BLM, EXD2, SMARCAD1, SIRT6, RNF4, and RNF138 (Costelloe et
60
Figure 2.10. Re-screening deconvolved siRNA pools for top candidate resection activators. (A) The siRNA pools for the top 70 candidate resection activators identified in the secondary screen were deconvolved and the four distinct siRNA duplexes were transfected individually. U2OS FUCCI cells incubated with the deconvolved siRNAs were treated with 100 ng/ml NCS for 3 hours followed by pRPA32 (S4/S8) immunofluorescence and QIBC. The results are displayed as a heat-map where the rows represent each candidate resection activator and the columns represent the four individual siRNA duplexes. Candidate resection activators that had a greater number of siRNA duplexes that decreased the percentage of pRPA32 (S4/S8)-positive S/G2 cells were ranked highest. (B) Breakdown of how many siRNA duplexes for each candidate had at least a 20% decrease in the percentage of pRPA32 (S4/S8)-positive S/G2 cells compared to the control siRNA.
61
4 of 4 3 of 4 2 of 4 1 of 4 0 of 4 IK USPL1 SESN2 ACSM1 CAPN15 ZNF335 SF3B3 CCNC CCNI MX2 SLU7 INSM1 CREB3L4 TRIM64C RPAP2 SF3B2 DNA-PKcs POM121L2 C2ORF69 LEO1 CtIP TP53I13 TNKS1BP1 MEF2D NAA10 RAD50 MS4A7 IL27RA CDC40 WRN C11ORF35 AKNAD1 NHP2L1 MRE11 GOSR1 HIVEP1 ZNF771 DDX25 C9ORF152 NME6 ATRIP KLF17 LRRC61 ZNF32 EIF4A3 NBS1 RNF169 FAM216A C9ORF106 PHF5A POLE3 NDN RPA32 ZCCHC10 BCAS2 SRCAP SMARCE1 PSMD14 MAP1S ZNF821 C9ORF156 TRIM11 RBBP9 ASCC1 ANKRD16 FKBP10 FAM107B LEPRE1 RAB6B ZNF512B
Table 2.1. Number of candidate resection activator deconvolved siRNAs that decreased DNA end resection in S and G2 phase U2OS cells. Genes in green font are known regulators of DSB repair. Genes in orange font are characterized messenger RNA processing factors.
62
al., 2012; Galanty et al., 2012; Gravel et al., 2008; Ismail et al., 2015; Kaidi et al., 2010; Nimonkar et al., 2011; Nimonkar et al., 2008; Schmidt et al., 2015; Broderick et al., 2016). My results suggest that EXO1 and DNA2 were not identified as hits because the nucleases act redundantly and each can functionally compensate for the loss of the other. It is possible that the other known activators were not identified because of poor knockdown efficiency or they elicited moderate resection defects that did not make the hit cut-off. To confirm the resection activators identified in the screen, I cherry-picked siRNA pools for the top hits and re-screened them in a cell cycle phase-specific resection assay utilizing the FUCCI system. A large proportion of the cherry-picked siRNA pools elicited an accumulation of cells in G1 and did not cause a detectable defect in end resection. The high occurrence of false-positive hits due to the cell cycle may have led to real resection activators not being identified. For example, known end resection activators like SMARCAD1 and RNF138 had KS scores of -15.9 and -8.2, respectively, and may have been identified as hits if a lower false-positive rate was achieved. In the future, more candidate activators outside of the -19 KS score cut-off could be re-screened utilizing the FUCCI system.
I also screened deconvolved siRNAs for the top candidate resection activators confirmed in the secondary screen. There was likely a high degree of siRNA off-target effects in the primary screen as ~35% of the candidate resection activators demonstrated end resection defects with only one out of the four deconvolved siRNAs. Several groups have documented the high degree of off-target transcript binding that can occur in RNAi screens (Paulsen et al., 2009; Sigoillot et al., 2012; Sudbery et al., 2010). A bioinformatics study by Sigoillot et al. (2012) described a method to analyze the seven base pair seed sequences of the top scoring siRNAs in a screen to determine if there is enrichment in the binding to any specific transcript. The method was called genome-wide enrichment of seed sequence matches (GESS) and MAD2L1 was identified as a prominent off-target transcript in an RNAi screen for genes required for the spindle assembly checkpoint (Sigoillot et al., 2012). GESS analysis was also conducted for an RNAi screen searching for new regulators of DSB repair and identified an enrichment of siRNA seed sequences that corresponded to the RAD51 transcript (Adamson et al., 2012). Future work should focus on conducting the GESS analysis on the resection activators identified in my RNAi primary screen. It will be interesting to determine if there is an enrichment of siRNA seed sequences that can target known resection activators.
63
The deconvolved screen identified 42 high-confidence candidate resection activators where at least two of the four siRNA duplexes resulted in an end resection defect (Table 2.1). Remarkably, 11 of these resection activators are already known to function in DSB repair. Other interesting functional categories in this list included RNA splicing proteins and long non-coding RNAs (lncRNAs). Proteins containing zinc finger domains were also evident among the candidate resection activators and included ZNF335, ZNF771, INSM1, KLF17, ZCCHC10, and ZNF821. Zinc finger domains can bind to DNA and numerous characterized DSB repair proteins are known to harbour this domain (Table 3.1). ZNF335 immediately caught my attention as it was found to be mutated in a rare genetic syndrome that displays neonatal microcephaly (Yang et al., 2012). Microcephaly is a common clinical feature of patients harbouring mutations in DSB repair genes. The next chapter will focus on the functional characterization of ZNF335 in the promotion of end resection and HR.
64
Chapter III
The zinc finger protein, ZNF335, promotes DNA end resection
65
3.1 Statement of contributions, rights, and permissions
Zhen-Yuan Lin in Anne-Claude Gingras’ laboratory (LTRI) conducted all immunoprecipitation coupled to mass spectrometry experiments.
66
3.2 Summary
Numerous genome instability syndromes have been documented to display neonatal microcephaly as a clinical feature. The ZNF335 gene was recently found to be mutated in a syndrome that causes some of the worst cases of microcephaly ever documented. In addition, ZNF335 was consistently one of the highest scoring candidate resection activators identified in the RNAi screen outlined in Chapter II. Here, I validate the results of the screen and demonstrate that ZNF335 promotes DNA end resection. I uncover that ZNF335-deficient cells are sensitive to agents that induce DSBs and that ZNF335 can promote DSB repair by HR. I show that the four C-terminal zinc finger domains of ZNF335 are required for its function in end resection. In addition, I show that ZNF335 is recruited to sites of DNA damage generated by laser microirradiation and this accumulation is short-lived and dependent on the activity of PARP. However, I provide evidence that PARP activity is not required for end resection. This data suggests that ZNF335 recruitment to sites of laser microirradiation is not necessary for its function in promoting end resection. Lastly, I show that ZNF335 does not regulate the expression or protein stability of the core end resection factors. I conclude that ZNF335 is a new factor that can promote end resection and HR.
67
3.3 Introduction
Zinc finger protein domains are found throughout evolution, from bacteria to humans. Approximately 3% of the human genome encodes proteins that contain zinc finger domains and most of these proteins are completely uncharacterized. Zinc fingers are small self-contained domains that are stabilized by one or more zinc ions and have the capacity to bind both nucleic acids (RNA and DNA) and proteins (Laity et al., 2001). For nucleic acid binding, zinc fingers are usually present as repetitive nucleotide-binding modules giving them the unique ability to specifically bind longer stretches of nucleotides. Numerous zinc finger domain types have been discovered but here I will focus on the classical 22-25 amino acid C2H2 finger which is stabilized by a single zinc ion bound to a pair of cysteines and a pair of histidines. The two cysteines and two histidines are fundamental for zinc binding and proper folding of the zinc finger (Lee et al., 1989). In addition, C2H2 zinc fingers contain three other conserved amino acids in the region between the last cysteine and first histidine. These three amino acids (tyrosine, phenylalanine, and leucine) form a hydrophobic structural core which is also crucial for the folding of the zinc finger module (Lee et al., 1989). The structure of a single zinc finger consists of an antiparallel β-sheet with a loop formed by the two cysteines and an α-helix with a loop formed by the two histidines (Pavletich and Pabo, 1991). These two structural units are held together by the zinc ion. The zinc finger binds DNA through its α-helix where it forms hydrogen bonds at helical positions -1, 3, and 6 to a triplet sequence of nucleotides on one strand of DNA (Pavletich and Pabo, 1991). Later it was found that helical position 2 can also interact with a nucleotide on the opposite strand of DNA (Fairall et al., 1993). Collections of zinc finger mutants have been generated and aided in the formulation of rules that related particular amino acids (at helical positions -1, 2, 3, and 6) to four corresponding nucleotides (Klug, 2010). However, the rules relied on DNA being in the canonical B form and strict adherence to the rules did not always correlate with successful binding to other DNA sequences. A more powerful method to engineer zinc finger domains that can bind specific DNA sequences is to utilize affinity selection from libraries of zinc finger mutants by phage display (Choo and Klug, 1994).
Zinc fingers are the most common domain in the entire human proteome and thus have been implicated in a diverse set of cellular processes including transcription, mRNA processing, translation, protein-protein interactions, and post-translational modifications (Laity et al., 2001). It is not surprising that numerous human developmental disorders and disease states have been
68
linked to zinc finger protein dysfunction. Importantly, many DSB repair factors employ zinc finger domains to carry out their function at sites of DNA damage. These domains facilitate several functional properties to promote DNA repair including DNA binding, poly-ADP ribose (PAR) binding, protein-protein interactions, and substrate SUMOylation and ubiquitylation. An outline of DSB repair factors that possess zinc finger domains can be found in Table 3.1.
Several candidate resection activators identified in the RNAi screen (outlined in Chapter II) harbour zinc finger domains, including ZNF335. The human ZNF335 gene is located on chromosome 20 and contains 28 exons that encode a 1,342 amino acid protein. ZNF335 appears to be vertebrate-specific and contains 13 C2H2-type zinc finger domains that are dispersed throughout the protein (see protein schematic in Figure 3.7A). ZNF335 was first identified as a coactivator of nuclear hormone receptor signaling through an interaction with nuclear receptor coregulatory (Mahajan et al., 2002b). Nuclear hormone receptors are ligand-dependent transcription factors that control gene expression programs for numerous physiological, developmental, and metabolic processes (Aranda and Pascual, 2001). More recently, the ZNF335 gene was found to be mutated in a rare genetic syndrome that causes one of the most severe cases of microcephaly ever documented (Yang et al., 2012). The syndrome was reported in a large consanguineous Arab Israeli pedigree where seven individuals displayed neonatal microcephaly with head circumferences as small as nine standard deviations below the mean. Mapping using single-nucleotide polymorphism arrays identified a single 2 megabase region that was homozygous in all affected pedigree members. Sequencing of this region determined the presence of 40 genes but only one homozygous nonsynonymous change – a G to A transition at nucleotide position 3332 in the coding sequence of the ZNF335 gene. The c.3332g>a mutation resulted in an amino acid change from an arginine to a histidine at position 1111 which is located in the final (thirteenth) zinc finger domain. The mutation is also located at the final position of the splice donor site for exon 21 and resulted in the accumulation of a larger transcript due to intron retention. RNA-sequencing experiments determined that the large transcript retains the two introns flanking exon 21. Patient cells also expressed messenger RNA of the expected size which suggested that some normal splicing did occur. Immunoblotting of whole cell extracts from patient lymphoblast cells showed severely reduced ZNF335 protein at the expected size which likely resulted from translation of normally spliced messenger RNA. The antibody used by Yang et al. (2012) was produced by injecting rabbits with a peptide
69
DSB repair factor Zinc finger DSB Function Reference type repair pathway PARP-1 PARP-type HR, NHEJ Early recruitment of several DSB repair (Beck et al., factors (i.e. APLF) 2014) APLF PBZ-type NHEJ Nuclease component of DNA ligase IV (Iles et al., complex 2007) KAT5 (TIP60) C2HC-type HR, NHEJ Acetylates ATM after DNA damage to (Ikura et al., promote repair and checkpoint 2000) activation TOP3A GRF-type HR Component of BLM complex that (Wu and promotes Holliday junction dissolution Hickson, 2003) INO80B HIT-type HR, NHEJ Component of chromatin remodeling (Morrison et al., complex that relaxes chromatin 2004) structure at DSB sites ZNHIT1 HIT-type HR Component of SRCAP chromatin (Dong et al., remodeling complex that promotes end 2014) resection PIAS1 MIZ-type HR, NHEJ E3 SUMO ligase that promotes (Galanty et al., RNF8/RNF168-dependent 2009) ubiquitylation at DSB sites PIAS4 MIZ-type HR, NHEJ E3 SUMO ligase that promotes (Galanty et al., RNF8/RNF168-dependent 2009) ubiquitylation at DSB sites HERC2 ZZ-type HR, NHEJ HECT E3 ubiquitin ligase that (Bekker-Jensen promotes RNF8/RNF168-dependent et al., 2010) ubiquitylation at DSBs TRIM28 (KAP-1) PHD-type HR, NHEJ Important for ATM-dependent DSB (Ziv et al., repair in heterochromatin 2006) BRCA1 RING-type HR Critical factor involved in resection and (Huen et al., RAD51 loading 2010) RNF4 RING-type HR (SUMO)-targeted E3 ubiquitin ligase (Galanty et al., that regulates RPA turnover at resected 2012; Yin et al., DSBs 2012) RNF8 RING-type HR, NHEJ E3 ubiquitin ligase that promotes the (Huen et al., recruitment of critical repair factors to 2007; Kolas et DSB sites al., 2007; Mailand et al., 2007) RNF168 RING-type HR, NHEJ E3 ubiquitin ligase that promotes the (Doil et al., recruitment of critical repair factors to 2009; Stewart et DSB sites al., 2009) RNF138 RING-type HR E3 ligase that ubiquitylates CtIP and (Ismail et al., promotes its recruitment to DSB sites 2015; Schmidt et al., 2015)
Table 3.1. DSB repair proteins harbouring zinc finger domains.
70
corresponding to the final 42 C-terminal amino acids of ZNF335. Intron retention before and after exon 21 would result in a premature stop codon and a protein product that does not contain the C-terminal region that the antibody was raised against. Therefore, it is possible that this truncated ZNF335 protein accumulated in patient cells but was simply not detected by the authors.
After characterizing the mutation in human ZNF335, Yang et al. (2012) engineered null Znf335 mutations in mice and determined that homozygous loss of Znf335 led to early embryonic lethality at day 7.5 (E7.5). The essential requirement of Znf335 in mice pointed to the hypothesis that the human c.3332g>a mutation may be hypomorphic as affected patients were born to term, albeit with severe clinical symptoms including microcephaly and small birth weight and length. The potential hypomorphic nature of this mutation could be explained by the presence of a small amount of normally sized R1111H-mutated ZNF335 protein that was detected in patient cells. Conditional knockdown of Znf335 in the developing mouse brain caused defects in neural progenitor proliferation (Yang et al., 2012). Furthermore, the low number of Znf335-depleted neurons that did form in the cerebral cortex had abnormal neuronal morphology including small cell bodies and a lack of vertical apical dendritic processes. Importantly, these neuronal phenotypes were also observed post-mortem in affected human patients. Immunoprecipitation coupled to mass spectrometry (IP-MS) uncovered that human ZNF335 interacted with components of the trithorax group of proteins including MLL, SETD1A, ASH2L, RBBP5, and WDR5. The trithorax complex functions in histone methylation and transcriptional activation (Schuettengruber et al., 2011). Chromatin immunoprecipitation followed by next-generation sequencing in early mouse embryos determined that ZNF335 bound to many gene promoters and promoted H3K4 methylation (Yang et al., 2012). ZNF335 was associated with the promoter of the repressor element 1 (RE1)-silencing transcription factor (REST) gene which is a critical epigenetic regulator of neurogenesis. REST acts as a transcriptional repressor by recruiting histone deacetylases (HDACs) and is expressed in neural stem cells to maintain progenitor cell fate and inhibit differentiation into neurons (Ballas et al., 2005). The expansion and maintenance of neural progenitor cells is critical for later neurogenesis. Yang et al. (2012) surmised that ZNF335 could function upstream of REST by promoting its expression. Importantly, it was not ruled out that neural progenitor cells lacking ZNF335 could have cell proliferation defects due to other factors including enhanced apoptosis
71 or activation of cell cycle checkpoints. Proliferation defects due to aberrant mitosis or excessive DNA damage are of particular interest because most genetic syndromes that display neonatal microcephaly are caused by mutations in genes regulating centrosome and DNA repair biology (Thornton and Woods, 2009). In the following study, I demonstrate that ZNF335 promotes DNA end resection and DSB repair by HR. I argue that the function of ZNF335 in DSB repair may contribute to the severe microcephaly observed in patients with the c.3332g>a mutation.
3.4 Results
3.4.1 Analysis of four siRNA duplexes targeting the ZNF335 messenger RNA
The four ZNF335 siRNAs used in this study have identical sequences to those used in the deconvolved confirmation screen. I first determined the efficiency of each siRNA duplex in depleting the ZNF335 transcript and protein in U2OS cells. I conducted reverse transcription and subsequent quantitative PCR using RNA extracted from U2OS cells incubated with ZNF335 siRNAs. Quantitative PCR was conducted with a primer and probe set specific for ZNF335 (Solaris/GE Healthcare). The relative abundance of ZNF335 transcript in each sample was determined by normalizing to the abundance of the reference gene GAPDH. Compared to cells treated with the scrambled control siRNA, the abundance of ZNF335 messenger RNA was reduced by at least 2-fold with 3 out of the 4 siRNAs (Fig. 3.1A). Knockdown efficiency was confirmed at the protein level by western blot analysis utilizing a commercial ZNF335 antibody that targets the C-terminus (Fig. 3.1B). I then conducted a growth rate analysis for U2OS cells treated with the four ZNF335 siRNA duplexes. U2OS cells were reverse-transfected in 6-well plates and the number of cells was manually quantified at various time points (Fig. 3.1C). The growth of U2OS cells incubated with ZNF335 siRNAs correlated with the observed knockdown efficiency. Next, I determined the cell cycle profile of U2OS cells treated with the siRNAs utilizing propidium iodide (PI) staining and flow cytometry (Fig. 3.1D). The cell cycle profile of cells treated with the control siRNA and those treated with ZNF335 siRNA #1 and 3 were similar. Cells incubated with siRNA #2 had a small decrease in the proportion of cells in G1 whereas siRNA #4 had a small increase. Overall, these small differences in cell cycle position cannot account for the end resection defect observed for ZNF335 knockdown in the RNAi screen.
72
Figure 3.1. Knockdown efficiency, growth, and cell cycle position analysis for siRNA duplexes targeting ZNF335 messenger RNA. (A) U2OS cells were transfected with the indicated siRNAs and incubated for 48 hours. Total RNA was extracted from cells and cDNA was synthesized by reverse transcription. Quantitative real-time PCR was conducted using Solaris primers and probe (GE Healthcare) specific to ZNF335 and the reference gene GAPDH (Mean ± SEM; N ≥ 3). (B) U2OS cells incubated with siRNAs targeting ZNF335 were processed for immunoblotting using a ZNF335 specific antibody. (C) U2OS cells incubated with ZNF335 siRNAs were counted at various time points post- transfection to assess their growth rates. (D) Cell cycle profiles for U2OS cells incubated with each ZNF335 siRNA duplex. Cell cycle position was determined by measuring DNA content by staining nuclei with propidium iodide before flow cytometry analysis.
73
3.4.2 ZNF335 depleted cells have defective pRPA32 (S4/S8), RPA32, and BrdU focus formation
I confirmed the results obtained from the screen by monitoring pRPA32 (S4/S8) focus formation in U2OS FUCCI cells depleted of ZNF335 (Fig. 3.2A). U2OS FUCCI cells were seeded and reverse-transfected with control or ZNF335 siRNA. After 48 hours to allow time for knockdown, cells were treated with 100 ng/ml NCS for 3 hours and then processed for pRPA32 (S4/S8) immunofluorescence. Compared to the control, knockdown of ZNF335 resulted in less pRPA32 (S4/S8) foci per nucleus in S and G2 phase cells. The defect in RPA32 phosphorylation could also be detected by western blot analysis (Fig. 3.2D). To demonstrate that ZNF335 knockdown specifically decreased end resection and not just RPA32 phosphorylation I also processed cells for RPA32 and BrdU immunofluorescence (Fig. 3.2B,C). The depletion of ZNF335 decreased both RPA32 and BrdU focus formation compared to the control siRNA.
3.4.3 U2OS cells depleted of ZNF335 are sensitive to DSB-inducing agents
The observation that ZNF335 is important for end resection (and possibly DSB repair) led me to hypothesize that cells lacking ZNF335 may be more sensitive to incubation with agents that induce DSBs. To test this, I conducted clonogenic survival assays (Franken et al., 2006) on U2OS cells treated with control, CtIP, or ZNF335 siRNAs. After 48 hours of siRNA incubation, I sparsely seeded multiple densities of the siRNA-treated cells into 6-well plates and then exposed the cells (for 1 hour) to various drugs that cause DSBs, including NCS, ETOP, and CPT. I allowed individual colonies to grow for two weeks and then fixed and stained them with methanol and crystal violet, respectively. First, I calculated the plating efficiency of cells treated with each of the siRNAs (and no DSB-inducing drugs) using the following formula: