A Primer on Gene Regulation
Total Page:16
File Type:pdf, Size:1020Kb
12 A Primer on Gene Regulation To understand the principles Genes are the units of nucleotide sequence in DNA that specify a protein or Goal of gene regulation. non-coding RNA. The full complement of genes in the genome ofEscherichia coli is about 5,000, and that in the human genome is about 20,000. These genes Objectives are expressed via their transcription into RNA and subsequent (in the case After this chapter, you should be able to of protein-coding genes) translation into protein. Importantly, not all genes are expressed at the same time. In bacteria, some genes are expressed at a • distinguish between negative and constant rate, but others are turned ON (transcribed) or OFF in response to positive control. cues from the environment. In multicellular organisms, such as the embryo • calculate K for repressor binding to eq of an animal, genes are turned ON in a cell- or tissue-specific manner at the DNA. right time and in the right place in response to developmental cues. Gene • explain the lac operon AND gate. regulation is a vastly complicated and fascinating subject encompassing an extraordinary range of molecular mechanisms. This chapter is intended as a primer for introducing the classic example of the lactose operon in bacteria and the concepts of negative and positive control. Genes involved in lactose metabolism are grouped in a single transcription unit, the lactose operon The subject of gene regulation derives from the seminal discoveries on the lactose (lac) operon of E. coli made by François Jacob and Jacque Monod while they were working at the Institut Pasteur in Paris and for which they shared the Nobel Prize in Physiology and Medicine in 1965. An operon is two or more genes that are co-transcribed from a common promoter as part of a single transcription unit. Thus, an operon is transcribed as a single transcript that contains the coding sequences for two or more proteins. Chapter 12 A Primer on Gene Regulation 2 OH O HO + 6 O 6 CO + 6 H O + Energy HO OH 2 2 2 OH Glucose Glucose metabolism Glucose cellular enzymes OH OH β-linkage OH OH OH OH O O β-galactosidase O O O OH + HO HO HO OH HO HO OH OH OH OH OH H2O Lactose metabolism Lactose Galactose Glucose Figure 1 β-galactosidase cleaves lactose, producing glucose that can fuel cellular metabolism (The grouping of genes into operons is common in bacteria but rare in eukaryotes.) The lactose or lac operon contains three genes, lacZ, lacY, and lacA, but we will only be concerned with the most promoter-proximal gene, lacZ, which encodes β-galactosidase. β-galactosidase is an enzyme that enables E. coli to metabolize the sugar lactose. The preferred carbon and energy source for E. coli is glucose, but E. coli will instead metabolize lactose if no glucose is present in the growth medium. Lactose is a disaccharide composed of the sugars galactose and glucose. β-galactosidase cleaves the glycosidic bond (a β-glycosidic bond that links the 1 position of galactose to the 4 position of glucose) that connects galactose and glucose, thereby releasing free glucose and free galactose, which another cellular enzyme converts into glucose (Figure 1). If E. coli is growing on its preferred carbon source, glucose, then it would be wasteful to produce β-galactosidase. On the other hand, if the growth medium contains lactose and not glucose, then production of β-galactosidase is essential for growth and viability. How does E. coli cope with these conflicting requirements? The answer is that transcription of the operon is subject to a regulatory mechanism that turns ON the operon when lactose is present. (Shortly, we will come to the interesting circumstance when glucose and lactose are present simultaneously.) The Lac repressor negatively regulates thelac operon How does lactose turn ON transcription of the lac operon? Transcription is controlled by a regulatory protein known as the lactose operon repressor or LacI. The gene for LacI is located just upstream of thelac operon and is transcribed from its own separate promoter. The repressor is a tetramer of four LacI subunits (i.e., it has quaternary structure). The LacI tetramer binds to a nucleotide sequence known as the operator, which overlaps with Chapter 12 A Primer on Gene Regulation 3 Figure 2 Expression of the lac repressor (LacI) operon is negatively regulated by LacI operator promoter lacZ lacY lacA transcription genes encoded by lac operon +1 upstream downstream the promoter for the operon; by binding, LacI blocks RNA polymerase from accessing the promoter and hence blocks transcription (Figure 2). LacI is therefore a paradigmatic example of negative regulation in which the binding of a regulatory protein to DNA represses transcription. (We will come to positive regulation presently.) How does the lac operon escape repression to turn on the synthesis of β-galactosidase when lactose is present in the growth medium instead of glucose? The answer is that lactose acts as an inducer that binds to LacI, preventing the repressor from binding to the operator (Figure 3). Because LacI forms a tetramer, the inducer has four binding sites on the repressor. The inducer turns ON (derepresses) the operon by preventing the binding of the repressor to the operator and allowing RNA polymerase to bind. (Actually, the inducer is not lactose per se but rather a slightly modified form of lactose called allolactose. When lactose enters the cell, some of it is converted to allolactose by β-galactosidase. The two disaccharides differ only in that the 1 position of galactose is linked to the 4 position of glucose in lactose and to the 6 position of glucose in allolactose. That the inducer is allolactose and not lactose is an oddity of nature that need not concern us further in what follows.) repressor in high- anity conformation repressor bound to inducer inducer operator DNA + operator DNA transcription transcription repressed not repressed Figure 3 Inducer triggers the dissociation of the repressor from the operator Notice that the repressor exists in two conformations, as indicated in the cartoon by circular and rectangular shapes. Chapter 12 A Primer on Gene Regulation 4 inducer repressor bound Keq < 1 to inducer repressor repressor high-anity low-anity conformation conformation Figure 4 Inducer shifts the equilibrium between thelac repressor’s high-affinity and low-affinity DNA binding conformations towards the low-affinity conformation How exactly does the inducer remove the repressor from the operator? The inducer’s effect is another example of Le Châtelier’s principle (Figure 4). The repressor exists in two conformations: a conformation in which it has high affinity for DNA and a conformation in which it has low affinity for DNA. The two conformations are in equilibrium, with the high-affinity conformation being strongly favored. The inducer, however, only binds to the low-affinity conformation. Therefore, when lactose is present, the inducer binds to the low-affinity conformation and removes it from the high-affinity/low- affinity equilibrium. In order for the ratio of low-affinity to high-affinity repressor to remain equal to the equilibrium constant, there must be a net conversion of high-affinity repressor to the low-affinity conformation. This depletes the amount of repressor in the high-affinity conformation. Taken as a whole, the presence of inducer perturbs the equilibrium between low- affinity and high-affinity conformations, decreasing the amount of high- affinity repressor and ultimately decreasing the amount of repressor bound to DNA. Let’s look more closely at how the repressor prevents RNA polymerase from binding to the promoter. When RNA polymerase binds to the promoter, it physically contacts a stretch of DNA that extends upstream to roughly position −40 relative to the start site of transcription (recall that the sigma factor contacts the −35 and −10 sequences) and downstream to roughly position +20. Meanwhile, the stretch of DNA contacted by the repressor, the operator, overlaps with the downstream region of the promoter, covering the transcription start site and extending past the end of the promoter (Figure 5). Thus, when the repressor binds to the operator, it physically occludes RNA polymerase. DNA covered by RNA polymerase transcription 5’ AATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTTACATTTATGCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGCGGATAACAATTTCACAC 3’ 3’ TTACACTCAATCGAGTGAGTAATCCGTGGGGTCCGAAATGTAAATACGAAGGCCGAGCATACAACACACCTTAACACTCGCCTATTGTTAAAGTGTG 5’ −35 sequence −10 sequence +1 CAP site DNA covered by repressor Figure 5 Binding of the repressor to the operator occludes RNA polymerase Shown are the DNA binding sites for RNA polymerase, the repressor, and CAP, which is introduced below. Chapter 12 A Primer on Gene Regulation 5 Figure 6 The operator is composed of two palindromic “half-sites” Shown is a surface representation of the repressor. The repressor exists in the cell as a tetramer composed of four polypeptide chains; however, only the two polypeptide repressor chains that contact the operator are shown (cyan and green). The lac operator DNA is also shown. On the bottom is a diagram of the two palindromic “half-sites” of the operator. Dashes indicate bases that are not identical between the half-sites. lac operator DNA 5’GGAATTGTGAGCGGATAACAATTTC 3’ 3’CCTTAACACTCGCCTATTGTTAAAG 5’ 5’ AATTGT-A-C 3’ 3’C-A-TGTTAA 5’ “half-site” “half-site” lac operator The sequence of bases that makes up the operator is present in two copies in an inverted repeat (or head-to-head) orientation, meaning that the operator is a palindrome (Figure 6). Because of its symmetry, the operator can be divided into two “half-sites.” Two of the four polypeptide subunits of the tetrameric repressor contact the operator, with one subunit contacting one half-site and the other subunit contacting the other half-site. Proteins that exhibit quaternary structure and bind to repeated sequences in DNA is a common theme among DNA-binding proteins both in bacteria and eukaryotes.