And Assignment of the Chromosomal Locus (Human Serum Proteins/Proteinase Inhibitors/Human Chromosome Mapping/Thiol Ester Bond) CHEN-CHEN KAN*, ELLEN Solomont, K
Total Page:16
File Type:pdf, Size:1020Kb
Proc. Nati. Acad. Sci. USA Vol. 82, pp. 2282-2286, April 1985 Biochemistry Nucleotide sequence of cDNA encoding human a2-macroglobulin and assignment of the chromosomal locus (human serum proteins/proteinase inhibitors/human chromosome mapping/thiol ester bond) CHEN-CHEN KAN*, ELLEN SOLOMONt, K. TERTIA BELTt, AILEEN C. CHAIN*, LYNNE R. HIORNSt, AND GEORG FEY* *Department of Immunology, Research Institute of Scripps Clinic, 10666 North Torrey Pines Road, La Jolla, CA 92037; tImperial Cancer Research Funds Laboratory, P.O. Box 123, Lincoln's Inn Fields, London WC2A 3PX, England; and tDepartment of Biochemistry, University of Oxford, South Parks Road, Oxford OX1 3QU, England Communicated by Frank J. Dixon, December 6, 1984 ABSTRACT Six a2-macroglobulin (a2M) cDNA clones ester sites are conserved. These observations led to the pro- were isolated from a human liver cDNA library by using syn- posal that the C3, C4, and a2M genes are derived from a thetic oligonucleotides as hybridization probes. One of these, common ancestral gene (16). The sequences of murine and pa2M1, carries a 4.6-kilobase-pair insert, which was se- human C3 and human C4 and partial cDNA sequences of quenced. The insert contains the coding sequences for the ma- murine C4 have recently been determined (17-21). Compari- ture a2M polypeptide (1451 amino acids) and for a 23-amino son of human a2M with murine C3 revealed a 25% overall acid signal peptide at the NH2 terminus of the precursor pro- sequence homology (17, 18, 22), supporting the proposal of a2M. At the 3' end of the insert a poly(A) addition signal A-A- their common evolutionary origin. The human C3 gene is lo- T-A-A-A and part of the poly(A) tail of the messenger RNA cated on chromosome 19 (23), and the human C4 genes map were found. The protein sequence deduced from the nucleotide to the histocompatibility complex on chromosome 6 (24). sequence agrees with the published a2M amino acid sequence In addition to binding to proteinases, a2M can also adhere for all except three residues. The a2M locus was assigned to to small molecules, including lectins, cations (zinc), basic human chromosome 12 by Southern blot analysis with DNA proteins (myelin basic protein, histone H4), and platelet-de- from a panel of mouse/human somatic cell hybrids, using a2M rived growth factor (PDGF) (1, 25). The binding mechanisms cDNA as a hybridization probe. involved are probably unrelated to the trapping of protein- ases because adherence of small molecules does not prevent a2-Macroglobulin (a2M) is a serum glycoprotein and a major the proteinase binding (1). A physiological role for the com- plasma proteinase inhibitor with a wide specificity. a2M-re- plex formation between a2M and these small molecules has lated proteins are present in all vertebrate species (1-4). Hu- not been established. The striking similarity of the amino man a2M is a tetramer of four identical 185-kDa subunits, acid sequences of PDGF and p28Sis, the transforming protein arranged as a pair of dimers each consisting of two disulfide- of the simian sarcoma virus (26, 27), suggests that viral and linked monomers (5, 6). The a2M polypeptide has a so-called other cellular transformation mechanisms may be mediated bait region and an internal thiol ester bond, which account by growth factor-like proteins. Thus, a2M may serve as an for its properties as a proteinase inhibitor. The bait region, important role in limiting the effects of PDGF-like mitogens. composed of a series of target peptide bonds for plasma pro- The amino acid sequence of a2M has recently been report- teinases (7, 8), is located close to the center of the monomer- ed (28). For our future studies of the structural properties of ic subunit. The thiol ester bond is a covalent linkage between a2M that underlie its functions and of the regulation of the y-carboxyl function of glutamate residue 975 and the sul- expression of the a2M gene, it is necessary to prepare cDNA hydryl group of cysteine residue 972 (9-12). Cleavage of the clones and to derive the complete cDNA sequence. Here we bait region by proteinases leads to a conformational change report the isolation and sequence analysis of human a2M of the a2M molecule. This conformational change activates cDNA clones and the assignment of the human a2M locus to the thiol ester bond, leading to its hydrolysis and as a conse- chromosome 12. quence to covalent binding between the proteinase and a2M via the reactive glutamate 975. a2M receptors are present on MATERIALS AND METHODS the surfaces of fibroblasts and macrophages. These receptor proteins preferentially bind a2M-proteinase complexes or Materials. Restriction enzymes were from Boehringer a2M molecules with hydrolyzed thiol ester bonds (13, 14). Mannheim, DNA polymerase I (Klenow fragment) was from This suggests that the conformational change, which accom- Bethesda Research Laboratories, and the 17-nucleotide uni- panies the complex formation between a2M and proteinases versal sequencing primer was from Collaborative Research. and the hydrolysis of the thiol ester bond, exposes regions of T4 DNA polymerase and dideoxy- and deoxynucleotide tri- the a2M molecule that are recognized by these receptors. phosphates were from P-L Biochemicals. The radionucleo- Receptor-mediated endocytosis of proteinase-a2M com- tides [a-32P]dCTP (>400 Ci/mmol; 1 Ci = 37 GBq) and plexes by macrophages and liver cells leads to clearance of deoxyadenosine 5'-(a-[35S]thio)triphosphate (>400 Ci/ the complexes from the circulation. mmol) were from Amersham. T4 DNA ligase and the Esche- Internal thiol ester bonds are also found in the comple- richia coli strain JM101-TG1 were gifts from D. Bentley and ment proteins C3 and C4 (15), which are derived from pre- T. Gibson. Oligonucleotide probes were synthesized by Cre- cursor polypeptides of size similar to a2M (180-200 kDa). ative Biomolecules (South San Francisco, CA). Their thiol ester sites are found in positions comparable to Synthetic Oligonucleotide Probes and Isolation of cDNA those in a2M, and the amino acid sequences of all three thiol Clones. The construction of the human liver cDNA library enriched for mRNA species over 2000 nucleotides in length The publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "advertisement" Abbreviations: C3 and C4, third and fourth components of comple- in accordance with 18 U.S.C. §1734 solely to indicate this fact. ment; a2M, a2-macroglobulin; kb, kilobase pair(s). 2282 Biochemistry: Kan et aL Proc. Natl. Acad. Sci. USA 82 (1985) 2283 with the plasmid vector pAT 153 Pvu II-8 has been previous- Schuell) (43). Filters were hyridized with nick-translated (44) ly described (19). Two methionine residues at positions 1378 a2M cDNA in 2.5x NaCl/Cit (lx NaCl/Cit is 150 mM sodi- and 1385 of the human a2M sequence (28) and their sur- um chloride/15 mM trisodium citrate), lOx Denhardt's solu- rounding sequences were chosen as probe sites because they tion (2% Ficoll/2% bovine serum albumin/2% polyvinylpyr- were encoded by nucleotide triplets with minimal degenera- rolidone), 0.1% sodium dodecyl sulfate, 6% polyethylene 1378 glycol 6000 (Fisons, Loughborough, England) or 10% dex- cy. These amino acid sequences are Asn-Met-Ala-Ile- 1385 tran sulfate, and denatured salmon sperm DNA at 50 ,ug/ml. Val-Asp and Asp-Val-Lys-Met-Val. The corresponding Hybridization was carried out at 65°C overnight. Filters oligonucleotide mixtures were were washed at 65°C successively with 2x, 1x, 0.3x, and AA 0.lx NaCl/Cit, three times for 20 min each. ST'-T-C-N-A-C-G-A-T-N-G-C-C-A-T-G _T-T-3' RESULTS (N represents all four nucleotides) and Identification and Sequence Analysis of cDNA Clones. A S5'-A-C-C-A-T- C_T-T-N-A-C-G T-C-3'. human liver cDNA library was screened with two mixed synthetic oligodeoxynucleotide probes, a 17-mer and a 14- This cDNA library consisted of about 15,000 different mer constructed on the basis of known amino acid se- recombinant clones; 30,000 colonies were plated on nitrocel- quences. Of 30,000 colonies screened, 34 clones hybridized lulose discs. A set of replica filters were prepared from the with both the 17-mer and 14-mer probes. master filters (29). Colonies on replica filters were grown, After restriction enzyme analysis of the cDNA inserts, six then transferred to chloramphenicol plates to amplify the recombinant plamids were selected for detailed analysis. plasmid. The colonies were processed (30, 31) and hybrid- One of them, pa2M1, carries a 4.6-kilobase pair (kb) insert ized with kinase 32P-labeled (32) 14-mer and 17-mer oligonu- whose nucleotide sequence was determined. The nucleotide cleotide mixtures. Positive colonies were rescreened until sequence from the 5' nontranslated region to the poly(A) tail genetic homogeneity was reached. The length of cDNA in- of the mRNA and the derived amino acid sequence are given serts was determined by restriction analysis with the en- in Fig. 1. The complete coding sequence was determined on zymes HindIII and Sal I. both DNA strands; each character was determined 3.75 DNA Sequence Analysis. Plasmid DNAs carrying the long- times on average. A methionine encoding triplet (nucleotides est cDNA inserts were digested with HindIII and Sal I. The 44 to 46) was identified as the translational initiation site. fragments were separated by electrophoresis in low-melting This methionine, together with the following 22 amino acid agarose gels and recovered from the agarose by extraction residues, forms the signal peptide of the a2M protein, which with phenol.