DNA Prokaryote Transcription Steps (Updated February 2013) 5'
Total Page:16
File Type:pdf, Size:1020Kb
5' TTGACA TATAAT +1 AGGAGGT ATG TTA ATG TGA TAG 3' 3' Gene A Gene C AACTGT ATATTA TCCTCCA TAC ATT TAC ACT ATC 5' URS - 35 - 10 transcription Shine-Dalgarno translation translation sequences start rho or GC hairpin Pribnow Box start stop loop transcription σ discriminator termination sequences DNA Prokaryote Transcription Steps (updated February 2013) 5' 3' β' 1. In initiation the RNA polymerase holoenzyme with two alpha, one beta, URS ω sequences one beta prime and one omega bind the σ sigma ( ) unit which binds to the σ α Pribnow box and the TTGACA β discriminator. TTGACA TATAAT +1 AGGAGGT ATG TTA 3' Gene A AACTGT ATATTA 5' - 35 - 10 TCCTCCA TAC ATT rho or GC hairpin Pribnow Box Shine-Dalgarno translation translation σ discriminator start stop loop transcription termination sequences URS sequences 5' 2. Transcription factors bind the upstream regulatory 3' sequences and to the alpha (α) subunits of the RNA polymerase. These factors affect binding strength of the RNA polymerase and the ability of sigma (σ) to bend and open (melt) the DNA double helix around the transcription start point. 3. The RNA polymerase starts synthesis of the mRNA assembling a strand of about 11 RNA nucleotides. β' ω +1 TTGACA TATAAT AGGAGGT ATG TTA 3' σ Gene A AACTGTα ATATTA TCCTCCA TAC ATT 5' - 35 - 10 β Shine-Dalgarno translation translation rho or GC hairpin Pribnow Box transcription start stop loop transcription σ discriminator start termination sequences URS sequences 4. In promoter clearance sigma domains reposition so that the RNA 5' polymerase holoenzyme can enter the elongation stage where the rest of the 3' template DNA is transcribed increasing the length of the mRNA strand. 5. NusA and Nus G bind to the RNA polymerase to keep it on track to finish DNA transcription. The elongation complex leaves behind the transcription factors associated with the URS. In rho-independent termination, NusA facilitates GC hairpin loop formation. +1 β' ω TTGACA TATAAT 3' σ 5' AAUCGUAGGAGGUCCAGCGAUG α nusG 3' ATT 5' 5' AACTGT ATATTA β - 35 - 10 3' TCCTCCA 5' 3' TAC 5' Gene A translation rho or GC hairpin transcription Pribnow Box Shine-Dalgarno translation stop loop transcription start σ discriminator start termination sequences nusA 5a. Rho proteins bind to mRNA strand until they catch up to the beta subunit of the RNA polymerase. When both mRNA and the beta subunit are bound, the complex falls off of the DNA template strand as rho functions like a DNA-RNA helicase and unwinds the mRNA from the 5' AAUCGUAGGAGGU template DNA strand. There is a stem loop but no Uracil AUG tract. β' ω 5' 3' rho σ nusG 3' α 5' β nusA TATAAT +1 5' TTGACA AGGAGGT ATG TTA ATG TGA TAG 3' Gene A Gene C 3' 5' AACTGT ATATTA TCCTCCA TAC ATT TAC URS transcription ACT ATC - 35 - 10 Shine-Dalgarno translation translation sequences start Pribnow Box start stop σ discriminator 5' β' ω σ α rho β 3' mRNA 5b. GC-hairpin loop preceding an AAAAA-rich template DNA sequence puts drag on the RNA polymerase so that it is more easily derailed where the A-U hydrogen bonds are fewer in number and therefore weaker. 5' AAUCGUAGGAGGU AUG 5' AAAAAAAAA 3' UUUUUUUUU 3' 5' β' 5' ω α A β U UUUUUUUUU G nusA nusG 3' mRNA σ DNA Eukaryote Transcription Steps (updated February 2013) Eukaryote transcription is monocistronic meaning that only one polypeptide coding region is under control of the promoter. The promoter has several sequences that are similar to the Pribnow and TTGACA boxes in prokaryote promoters. The TATA box (TATAAA) is almost identical to the Pribnow sequence. Only about 32% of the known eukaryote core promoters have the TATA box at -26 to -31. If the iniator (Inr) core promoter element (YYA+1Nt/aYY)* is present at - 2 to + 4, it acts synergistically with the TATA box. The downstream promoter element (DPE) at +28 to +32 (a/g G a/t c/t g/a/c) in TATA-less promoters requires initiator to function. Motif ten element (MTE) at +18 to +27 (C g/a A a/g C g/c c/a/g AACG g/c) also requires initiator. It can act independently of the TATA box or the downstream promoter element, or synergistically when either are present. Proximal promoter elements include the CAAT box at -70 to -200 (CCAAT) and the GC box also at - 70 to -200 (GGGCGG) whose binding proteins act to tether long-range regulatory elements such as enhancers to the core promoter through the mediator transcription factor. Long-range regulatory elements (upstream activating sequences) include enhancers, silencers, and insulators which are found at euchromatin/heterochromatin boundaries where they prevent bleed- over by enhancers and silencers of adjacent genes. Also included are locus control regions (LCRs) which organize downstream chromatin into open configurations for RNA polymerase access for transcription. Matrix attachment regions (MARs) in interphase chromatin and scaffolding attachment regions (SARs) in metaphase chromosomes organize chromatin into loops averaging 70 kilobases (kb) between attachment points recruiting chromatin remodeling enzymes to assist tissue-specific and developmental stage specific transcription of genes. The TATA(box) Binding Protein (TBP) binds the minor groove of the DNA at the TATAAA sequence with saddle-like antiparallel β sheets which cause the DNA to bend almost 90o and melt (open up) in a fashion similar to sigma protein binding of the Pribnow, TGn and TTGACA boxes. TBP associated factors (TAFs) bind the TATA binding protein to form the TFIID complex. TAF-1 and TAF-2 bind initiator. SP1 that binds the GC box also binds TAF-4 with a glutamine-rich transactivation domain. TFIIB binds to one end of TBP and to a GC rich TFIIB recognition element (BRE) upstream to the TATA box. This gives both direction and strand specificity to the transcription pre-initiation complex because TFIIB also binds RNA polymerase II (RNA pol II is a 12 subunit holoenyzme) with a cysteine-rich zinc-binding ribbon domain to recruit RNA pol II to the transcription pre-iniation complex with DNA at base +1 above the active site center. TFIIF also assists the placement of promoter DNA in this complex strengthening binding. Next TFIIE binds and then it recruits TFIIH where H stands for helicase. With TFIIE, the helicase unwinds the DNA in the active site and TFIIF captures the nontemplate DNA strand moving it out of the active site while the template strand migrates into the active site channel. TFIIE drops off. TFIIH also has kinase activity and it phosphorylates the C-Terminal Domain (CTD) of the largest RNA pol II subunit at multiple serine- rich amino acid repeats. This initiates promoter clearance. Site-specific serine phosphorylation and dephosphorylation are involved in RNA pol II binding (pre-initiation), promoter clearance, elongation and termination in transcription. Only dephosphorylated CTD-RNA pol II can bind the promoter. In promoter clearance TFIIH helicase activity and TFIIB assist formation of the replication bubble. Once the RNA strand exceeds 10 nucleotides (bases), TFIIB drops off. Additional phosphorylation of the CTD of the RNA pol II by TFIIH pushes the polymerase into the elongation phase. TFIIH drops off. TFIID stays behind to form a new pre-initiation complex. TFIIF stays to keep nontemplate DNA sequestered. TFIIS binds to the RNA pol II complex to keep the polymerase on track much like nusA and nusG in prokaryote transcription. This gives a basal rate of transcription. Mediator-binding of RNA pol II and proximal and long-range regulatory element transcription factors can speed up processing of the pre-initiation complex and moving through promoter clearance. In eukaryotes there are three different RNA polymerases: RNA polymerase I transcribes rDNA, RNA polymerase II transcribes DNA that codes for polypeptides as hnRNA and structural genes that produce splicing snRNA, while RNA polymerase III transcribes 5S rDNA, tDNA and other snDNA genes.] Other transcription factors bind the CAAT box, GC boxes or CACCC boxes if present as well as enhancer or silencer sequences which may also be found in certain upstream regulatory sequences of a given structural gene promoter. Sometimes included in these regulatory sequences are response elements for different hormones, heat shock, light, etc and possibly homeoboxes (animals) or MADS boxes (plants) that control developmental pathways. Regulation of eukaryotic genes appears to be more complex than that of prokaryotic genes. Once all necessary factors are in place, the DNA double helix opens and now the RNA polymerase is able to directly transcribe the RNA. Termination for rRNA sequences uses a rho-like factor that binds the DNA downstream of the termination site to cause the RNA polymerase I to separate from the DNA. This is different than the prokaryote mechanism of binding the newly synthesized RNA molecule and with helicase activity unwinding it from the RNA polymerase-DNA complex to stop transcription. RNA polymerase III relies on a DNA termination sequence of many A nucleotides generating a polyU sequence in tRNA or 5S rRNA to make it easier for the polymerase and RNA to separate from the DNA template. This is similar to the rho-independent system of bacteria except that the G-C hairpin loop is not required. For RNA polymerase II transcribing polypeptide genes to ultimately make mRNA, the RNA transcribed by RNA polymerase is called hnRNA (heterogeneous nuclear RNA) or sometimes pre-mRNA (precursor-mRNA). Transcription often continues for several thousand bases past the end of the polypeptide coding region. On this 3' end of the hnRNA at about 30 to 50 nucleotides past the translation stop codon, there is polyA cleavage site AAUAAA.