Fine-Tuning the Chromosome Ends the Last Base of Human Telomeres
Total Page:16
File Type:pdf, Size:1020Kb
[Cell Cycle 4:11, 1467-1470, November 2005]; ©2005 Landes Bioscience Extra View Fine-Tuning the Chromosome Ends The Last Base of Human Telomeres Agnel J. Sfeir ABSTRACT Jerry W. Shay Telomeres protect chromosomes from degradation and loss of vital sequence, block end-end fusion, and allow the cell to distinguish between broken ends and chromosome Woodring E. Wright* ends. Mammalian telomeres end in single-stranded (TTAGGG)-rich 3'-overhangs. that are tucked back into the preceding double stranded region to form a T-loop. The end structure Department of Cell Biology; University of Texas Southwestern Medical Center; Dallas, Texas USA of mammalian telomeres has just started to be elucidated and through this extra views we highlight one aspect of that structure. We have recently identified the terminal nucleotides *Correspondence to: Woodring E. Wright; Department of Cell Biology; UT of both the C-rich and G-rich telomere strands in human cells and showed that ~80% of Southwestern Medical Center; 5323 Harry Hines Boulevard; Dallas, Texas 75390- 9039 USA; Tel.: 214.648.2933; Email: [email protected] the C-rich strands terminate precisely in ATC-5', while the last base of the G-strand is less precise. This finding has important implications for the processing events that act on the Received 09/07/05; Accepted 09/08/05 telomere ends post-replication. While the mechanism behind this phenotype is yet to be Previously published online as a Cell Cycle E-publication: unraveled, we discuss potential models that could explain the last base specificity. http://www.landesbioscience.com/journals/cc/abstract.php?id=2161 KEY WORDS Mammalian telomeres consist of many kilobases of 5'-TTAGGG/3'-AATCCC DNA repeats that are bound by specialized proteins forming a unique end-structure.1 The G-rich telomere, end-replication problem, STELA, pot1, T DISTRIBUTE strand (TTAGGG) extends beyond the double stranded region to form a single stranded terminal nucleotide 3'overhang that is believed to be important for telomere structure and proper functioning.2-5 Every time the cell divides, telomeric repeats are lost due to the “end-replication problem” ACKNOWLEDGEMENTS that was first predicted by James Watson6 and Alexei Olovnikov.7 According to the semi- This work was supported by Department of conservative model of DNA replication, synthesizing the ends of the telomere generates Defense BC031037 to A.J.S. and NIH AG01228 two structurally distinct ends. Leading strand synthesis is continuous and copies the to W.E.W.. W.E.W. is an Ellison Medical C-rich strand (3'-AATCCC. -5')DO to the NO very end to initially generate a blunt ended DNA. Foundation Senior Scholar. Alternatively, the replication machinery could fall off before it reaches the end generating a 5'C-rich overhang.8 Lagging strand synthesis of the G-rich strand is discontinuous and carried out by small Okazaki fragments. A 3'overhang will result upon the removal of the RNA primer used to generate the last fragment (Fig. 1). This overhang could be exactly the size of the RNA primer if it were positioned at the end of the telomere or the over- hang could be much longer if the placement of the final Okazaki fragment primer were random. Ultrastructural studies at the EM level have shown that the 3' G-rich overhang does not always exist as a free extension in cells, but is often tucked back into the preceding double stranded region to form a lariat like structure the “t-loop”.3 Furthermore, T- loops can be present on both ends of a chromosome, suggesting that the leading DNA strands (inititially being blunt-ended or possessing a 5' C-rich extension) has been processed in order to generate the 3' G-overhang that is required for t-loop formation. Whether over- hangs of lagging strand synthesis are processed further upon the removal of the last RNA primer is yet to be determined. Overhangs constitute an important structure of the ends that are needed for proper telomere function, yet information about their generation remains scarce. Most of the information we know comes from model organisms (yeast and ciliates), in which genetic and structural studies are more easily undertaken. Ciliates have short and very abundant telomeres that facilitate analysis. Studies by Price et al have shown that the overhangs in Euplotes are generated with precise terminal nucleotides at both ends (GGTTTTGG-3' at the G-rich strand and AAAACCC-5' at the C-rich strand) and the length of the overhang ©2005 LANDES BIOSCIENCEis always 14nt.9 Extensive studies characterizing mechanisms of overhang processing have also been done in Tetrahymena. Instead of ending in GGGTTG-3' as would be expected if the terminus is generated by dissociation of telomerase during the translocation step, most Tetrahymena telomeres end in TGGGGT-3'. Tetrahymena C-rich strands end with CAACCC-5' or CCAACC-5'. This suggests that overhang generation is mediated by two separate processing steps; one cleaves the G strand and the other resects the C strand, and both steps are distinctively terminated at a specific base.10,11 www.landesbioscience.com Cell Cycle 1467 Fine-Tuning the Chromosome Ends THE LAST BASE OF HUMAN TELOMERES The considerable length (5–15kb) of human telomeres makes it hard to analyze the end structure of these repetitive sequences. Recently developed assays to measure the overhang lengths in human cells show that overhang length control is less stringent than model organisms, ranging from ~35 -400 nucleotides.5,12-15 We developed assays that enabled us to determine the end-nucleotides of the telomere. For C-strand terminal nucleotide identification, we performed six different reactions ligating oligonucleotides called C-telorettes to the last base of the C-strand. Six C-telorettes were designed, each ending in a specific permutation of the AATCCC repeat. The telomeric over- hang was used as a template that guides Figure 1. Initial products of DNA replication. As the replication machinery is progressing through the telomere, the annealing of the C-telorettes in close it will generate two structurally different ends. The products of lagging strand synthesis initially have proximity to the C-strand end such that 3'G-rich overhangs while products of leading strand synthesis are initially either blunt ended telomeres or those telorettes that anneal next to the 5' telomeres possessing 5'C-rich overhang. Processing events could then modify these structures. end in register would be ligated to the C-strand. Following ligation, DNA was diluted and PCR amplification was per- formed to determine the fraction of chromosomes that ligated to a given C-telorette, thereby determining their terminal nucleotide. A similar assay was applied for G-strand terminal nucleotide identification, except that a long C-rich template was first annealed to the telom- eric overhang to create a 5' overhang that drives the subsequent annealing and ligation of the G-telorettes to the G-strand (Fig. 2). We showed that the C-strand is very tightly regulated such that 80% of the chromosomes end in ATC-5'. The same last base preference was observed for lead- ing and lagging strands, suggesting that the overhang processing step that specifies the end nucleotides is common to both strands of replication. This is in agreement with model organisms, that show a very precise terminal nucleotide of the C-strand, Figure 2. Strategy for end-nucleotide determination. Individual C and G-rich telorettes, representing all six permutations of the telomeric repeats (AATCCC/TTAGGG), are ligated separately to the terminal and suggests the existence of regulated nucleotide of the C-rich and the G-rich telomeric strands. The fraction of the telomeres that ligate to a given processing machinery that acts on the telorette is determined by PCR amplification reactions using a forward primer that is chromosome specific telomere ends post-replication. Unlike (located within subtelomere) and a reverse primer corresponding to the unique sequence part of each the G-terminus of model organisms that telorette oligonucleotide. was specific, the last G-base of humans was less precise. Nevertheless we found a bias towards three nucleotides occur. This is further supported by the fact that in the presence of (TAG-3', TTA-3' and GTT-3'). Since the C-strand ending in 3'- telomerase a shift in terminal nucleotide identity was seen so that CCAATC-5' is the template for leading strand replication, synthe- almost 50% of human chromosomes ended in TAG-3'. This sequence sizing the G-strand all the way to the very end generates a sequence matches the last base of the RNA template region of telomerase, and ending in GGTTAG-3'. If the replication machinery fell off one or is the pause site at which telomerase is predicted to dissociate from two nucleotides before the end, the G-strands would terminate in the telomere.16 It would be the sequence present if no further GGTTA-3' and GGTT-3'. These three ends accounted for 70% of processing of the G-terminus occurred thereafter (Fig. 3). the ends, suggesting that specific cleavage of the G-strand did not 1468 Cell Cycle 2005; Vol. 4 Issue 11 Fine-Tuning the Chromosome Ends 5'-3' exonuclease starts resecting the 5' end of the C-rich strand. In parallel, telomere binding proteins (such as TRF1 and TRF2) are loaded onto the preceding double-stranded DNA. The last protein complex loaded close to the telomere end could create a barrier that blocks further nuclease resection. That protein should bind to the double stranded DNA with great specificity to dictate the identity of the last base (Fig. 4A). An alternative model predicts that single stranded DNA binding proteins such as Pot-1, RPA or hnRNPA are being loaded on a 3'-G-rich overhang that is created by partial exonuclease resection of the 5'-end of the C-rich strand.