c03_063-094v2 8/26/05 8:57 AM Page 63
CHAPTER 3
Insulin, in its active form, is a hormone made up of two, covalently connected, peptide chains containing a total of 51 amino acids. It is secreted by the pancreas and regulates the metabolism of the sugar glucose. Impaired production or improper utilization of insulin causes the disease diabetes mellitus which is one of the most common metabolic diseases, afflicting millions of individuals in the world. Regular injection of insulin by diabetics maintains a proper concentration of blood glucose and allows them to lead a normal life. Because human insulin has been scarce and expensive, it was common for diabetics to use the peptide from other animals. Insulins from pigs and cows have structures similar to human insulin, with amino acid sequence (primary structure) differences at just one and three positions, respectively. Allergic reactions to the foreign insulins do occur but are rare. Human insulin made by recombinant DNA methods using genetically engineered bacteria is now available for clinical use. (© Julia Kamlish/Photo Researchers, Inc.) Amino Acids, Peptides, and Proteins
3.1 The Amino Acids in Proteins 3.4 Structural Properties of 3.5 Studying Protein Structure Properties of -Amino Acids Proteins and Function Classification of Amino Acids Molecular Mass Proteomics Reactivity and Analysis of Amino Acids Defining Protein Composition and Isolating and Purifying Proteins 3.2 Polypeptides and Proteins Behavior Determination of Primary Structure Four Levels of Protein Structure 3.3 Protein Function Sequencing DNA Importance of Protein Sequence Data Enzymes Structural Proteins Immune Proteins Transport and Storage Proteins Regulatory and Receptor Proteins Muscle Contraction and Mobility Proteins
63 c03_063-094v2 8/26/05 8:57 AM Page 64
64 Chapter 3 Amino Acids, Peptides, and Proteins
The term protein was first used by the Dutch chemist Gerardus Mulder in 1838 to name a specific group of substances abundant in all plants and animals. The impor- tance of proteins was correctly predicted by Mulder, who derived the name from the Greek word proteios, meaning “primary” or “first rank.” Proteins, biopolymers con- structed from amino acids, display a wide range of structures and functions.The amino acid building blocks are selected from a pool of 20 different molecules, each having a distinct chemical structure. Each type of protein has a specific structure that is defined by its genetically determined sequence of amino acids. The information necessary for the placement of amino acids in the proper proportion and order resides in the sequence of nucleotide bases in a gene (region of DNA). Now that the human genome sequence has been completed, we know to a high degree of accuracy the order of nucleotides in DNA. The challenge ahead is to understand the purpose of each and every protein product that is coded in the genome. We called this large group of pro- teins the proteome. Our scientific focus in the next few chapters will turn to proteomics—the broad study of all cellular proteins. More specifically,proteomics has the following goals: Classify and characterize all protein products coded by DNA, investigate regulation and expression levels (cellular concentrations and impact on health), determine the complete structure of each protein, study how proteins interact with one another and with other biomolecules, and identify the functional role of each. The unique amino acid composition and sequence in each type of protein allow it to fold into the precise three-dimensional arrangement necessary for carrying out its designated biological function. Proteins as a group of biomolecules display great func- tional diversity.Some proteins have roles in muscular contraction and structural rigid- ity; others serve in the transport and storage of small molecules.Antibodies (molecules for immune protection), enzymes (biological catalysts), and some hormones are pro- teins.The great variety in biological function is the direct result of the many variations in amino acid composition and sequence that are possible for a protein.
The Amino Acids in Proteins 3.1 Learning Objective Know the structural, chemical, and biological properties of the -amino acids present in proteins. α -Carbon COOH Properties of -Amino Acids H2N C H An amino acid is any organic molecule with at least one carboxyl group (organic R acid) and at least one amino group (organic base). Using this broad definition, sev- eral hundreds of different amino acids are known to be present in plant and animal cells. In this chapter, however, we focus on a select group of amino acids: those 20 that pH ' 7.4 2 are genetically coded for incorporation into proteins (Table 3.1). (See Window on Biochemistry 3-1 for a look at two additional amino acids that are incorporated into COO some microbial proteins.) This set of the 20 common amino acids shares a general structure of a carbon center (the alpha carbon) surrounded by a hydrogen, a car- H3N C H boxyl group, an amino group, and a side chain represented by R (Figure 3.1). Because R of this placement of functional groups around the -carbon center, they are often called -amino acids. The nature of the side chain, which can vary from a simple hydrogen atom to complex ring systems, determines the unique chemical and bio- logical reactivity of each individual amino acid. Figure 3.1 The general structure The amino acid structure at the top of Figure 3.1 shows the groups present, but not for the 20 amino acids in proteins. ¬ The four groups are linked covalently to in their natural ionic state. The carboxyl group ( COOH) is acidic, and hence it is ¬ the carbon. The structure on the top capable of donating a proton to solution.The amino group ( NH2) is basic and can shows the functional groups present, accept a proton. The structure at the bottom of Figure 3.1 shows the ionic structure but not in their proper ionic state. of the amino acid after these acid–base reactions and at physiological pH (7.4). The The bottom structure shows the correct R side chains may also have functional groups that are acidic or basic and thus may ionic structures at physiological pH. donate or accept protons. c03_063-094v2 8/26/05 8:57 AM Page 65
3.1 The Amino Acids in Proteins 65
Table 3.1 Table of abbreviations of the 20 amino acids found in proteins One-Letter Three-Letter Name Abbreviation Abbreviation Glycine G Gly Alanine A Ala Valine V Val Leucine L Leu Isoleucine I Ile Methionine M Met Phenylalanine F Phe Proline P Pro Serine S Ser Threonine T Thr Cysteine C Cys Asparagine N Asn Glutamine Q Gln Tyrosine Y Tyr Tryptophan W Trp Aspartate D Asp Glutamate E Glu Histidine H His Lysine K Lys Arginine R Arg
Stereochemistry The tetrahedral carbon in every amino acid, except glycine (R H), has four different atoms or groups bonded to it, hence is a chiral center. An important consequence of this arrangement is the existence of two nonsuperim- posable stereoisomers (enantiomers), called D- and L-amino acids (Figure 3.2). To show absolute configuration of the stereoisomers, the -carboxylate group is drawn up and behind the plane of the page.The R group (side chain) is drawn down and also behind the plane.The L isomer now has its amino group to the left and in front of the plane of the page. The D isomer has its amino group in front and to the right. The D and L isomers are mirror images of each other. Solutions of each pure stereoisomer show optical activity, that is, they rotate the angle of plane-polarized light. If the angle of light rotation for one isomer is to the left, then the other isomer rotates the polarized light to the right. In other words, the angle of rotation is opposite for the two solutions. Although both D and L isomers of amino acids exist in nature, only the L isomers are used as building blocks for proteins. However, discoveries of cellular D-amino acids in free form and in proteins have intrigued biochemists. The amino acids D-alanine and D-glutamate were first discovered in short pep- tides used as scaffolding in the cell walls of Gram-negative bacteria and archaea. The presence of the unusual stereoisomers makes the cell walls less vulnerable to hydrolytic attack by enzymes. D-Valine is present in peptide antibiotics including
mirror plane Figure 3.2 The D and L enantiomers for the amino acid phenylalanine. COO OOC Note that the two structures are mirror images of each other. The H3N C H H C NH3 dashed lines indicate bonds to groups behind the plane and wedges indicate CH2 H2C bonds to groups in front of the plane of the carbon.
L-Phenylalanine D-Phenylalanine c03_063-094v2 8/26/05 8:57 AM Page 66
66 Chapter 3 Amino Acids, Peptides, and Proteins
valinomycin, actinomycin D, and gramicidin A.The D-amino acids found in bacterial cell walls and antibiotics are not incorporated by the normal protein synthesis process (DNAS mRNAS protein; as described in Section 1.5). Rather the peptides are synthesized by individual reactions that use free amino acids that are not linked to transfer RNAs. Biochemists have recently found D-amino acids (especially aspartate) in proteins isolated from human teeth, eye lenses, erythrocytes, and some tumors. The proteins were those that had accumulated with age. Apparently, in aging protein, conversion of the natural L-aspartate to D-aspartate (racemization) occurs over time. The mod- ified proteins usually show less biological effectiveness than normal proteins. Perhaps the most intriguing new discovery about D-amino acids is that D-serine is synthesized in mammalian brain where it functions as a neurotransmitter. Researchers have found surprisingly high physiological concentrations of the rare enantiomer. Biologists at Johns Hopkins University School of Medicine have found an enzyme in mouse brain that catalyzes the racemization of L-serine to D-serine.