CANCER RESEARCH56. 2539-2545. June I, 19961 Molecular Modeling of the Amino-Terminal Ring Domain of BRCA1

Rachelle J. Bienstock,1 Tom Darden, Roger Wiseman, Lee Pedersen, and J. Carl Barrett

National institute of Environmental Health Sciences, Laboratories of Quantitative and Computational Biology fR. J. B.. T. D., L P.] and Molecular Carcinogenesis IR. W.. J. C. B.J, Research Triangle Park. North Carolina 27709

ABSTRACT in Fig. 1. Although a common functionality for the zinc ring domain has not been identified, it appears to interact with DNA either through The equine herpes virus zinc ring domain nuclear magnetic resonance direct binding or indirectly by mediating protein-protein interactions. structure was used for homology-based modeling of the amino-terminal The zinc ring domain of several proteins has been shown to exhibit zinc ring domain of the BRCA1 breast and ovarian cancer susceptibility . The zinc ring domain of BRCAJ is of particular interest because it DNA-binding activity (10) or nuclear localization (11—13). is the location of significant and frequently occurring missense (Cys6tGly, The BRCA1 zinc ring domain and its structure and function are of Cys―Gly,andCys―Tyr)andframeshift (lSSdelAG) mutations observed particular interest, because it is the location of some of the most In several high-risk kindreds. The BRCA1 zinc ring domain possesses frequently occurring mutations linked to breast and ovarian cancer. 54% sequence similarity with the equine herpes virus zinc ring domain. The l85delAG mutation (deletion oftwo nucleotide base pairs in exon The model structure undergoes little conformational variance after 140 ps 2) in the zinc ring domain has been shown to occur in 1 in 100 of solvated molecular dynamics. This model proposes BRCA1 zinc ring Ashkenazi Jewish individuals (14, 15). This is in comparison with the domain residues that may play a role in DNA binding and/or protein estimated I in 833 frequency of all BRCAI mutations in the general protein interactions. These predictions provide a point of departure for population (14, 15). In this article, a three-dimensional model for the the design of mutants to probe BRCA1 zinc ring domain functionality. zinc ring domain of BRCA1 is presented. This model proposes po tential BRCAI zinc ring domain residues that may play a role in the INTRODUCTION function of this domain, either binding directly to DNA or mediating DNA binding through protein-protein interactions. BRCAJ, the breast and ovarian cancer susceptibility gene on human chromosome l7q21, was recently identified (1, 2). Mutations in this gene account for approximately 95% of families with increased mci MATERIALS AND METHODS dence of early onset breast and ovarian cancer and 45% of families with increased incidence of breast cancer alone (3, 4). BRCAJ is Sequence Alignment. The three-dimensional structures of two zinc ring believed to encode a tumor suppressor protein, because most muta domains, the EHV2and the acute PML proto-oncoprotein, have recently been tions appear to result in loss of function. This gene is expressed in determined by multidimensional NMR (8, 9, 1 1). The sequence alignment several tissues, including breast and ovary, and encodes a I863-amino between the PML and EHV zinc ring domains is shown in Fig. 2 along with the secondary structural elements identified for each protein from the solved acid protein. NMR structures. Several criteria were used to select the EHV ring domain The amino-terminal domain of BRCA1 possesses high sequence structure rather than the PML structure as the basis for the BRCA1 model. The homology with a topology (1). This zinc finger topology, BRCAI zinc ring domain, for example, possesses greater sequence similarity C3HC4,isreferredtoasthe“zinc-ringdomain,―“RINGfinger,―or to EHV than to PML. Using a sequence alignment of Smith and Waterman (16) A-Box (5) and is found in several viral proteins, proto-oncoproteins, with standard default parameters, the BRCA I zinc ring domain possesses 54% and regulatory and transcription factors (6). These include develop sequence similarity and 38% sequence identity with the EHV zinc ring domain, mental regulatory proteins [XNF-7, SINA, PSC, SU(z)2, and DG17}, compared with 42.8% sequence similarity and 30.6% sequence identity to the cell regulatory proteins (lAP and DROC3H), signal transduction PMLdomain.Fig. 2 shows the sequencealignmentbetween the BRCA1 and proteins (CRAF1), human oncoproteins (MDM2, MEL18, BMI-l, EHV and PML ring domains,respectively. RET, T18, CBL, and PML), proteins that regulate gene expression A BLASTP (Basic Local Alignment Search Tool for Proteins; Ref. 17) search of the Brookhaven Data Bank selected and aligned the EHV with the (RPT-l, PAS4, TLR, and HIP116), viral proteins, including some BRCA1 zinc ring domain, and yielded a P of 9.6 X l0@ (chance likelihood viral gene expression proteins, (lEl 10, VZ61, CG3O, PE-38, EPO, of the two sequences relationship). Folding search programs, such as Pro EHV, LCMV, and P11), and proteins involved in DNA repair or files-3D by Luthy et a!. (18), also indicated that the EHV and BRCA1 zinc ring recombination (RAD5, RADI 8, RAD16, RAG- 1, and UVS2). protein domains have similar folds. Hydrophilicity plots of Kyte and Doolittle The zinc ring family, first identified in 1991 (7), was found to (19), surface probability [calculated according to the method of Emini et a!. possess two finger-like domains connected by a linking region and to (20)], chain flexibility predictions (Karplus-Schulz), secondary structure pre require zinc for folding. The zinc ring C3HC4 sequence folds inde dictions of turns and a helices by Chou and Fasman (21), turns prediction by pendently and is a distinct stable domain (8, 9). This domain differs Gamier et al. (22), and the antigenicity index of Jameson and Wolf (23) all structurally from other zinc finger domains in that the first and third support greater similarity of the EHV ring domain structure to BRCAI than to pairs of cysteines form one zinc-binding site, and the second and PML(datanot shown). A multiple-sequencealignmentof the threesequences (the Pileup algorithm from the package below; Refs. 24 and 25) yields pairwise fourth pairs of cysteines form the second metal-binding site. This similarity scores ofO.44 for BRCA1 and EHV, 0.37 for BRCAI and PML, and crossed pattern is similar in configuration to a 1—3, 2—4 disulfide 0.42 for EHV and PML. Therefore, EHV was used as the basis for developing bond pattern. A schematic of the BRCA1 amino-terminal zinc ring a model for the BRCA1 zinc ring domain. domain with cysteine and histidines ligated to zinc indicated is shown Several multiple sequence alignments have been published previously, aligning the BRCAI zinc ring domain sequence to the sequences of other Received I2/28/95; accepted 4/3/96. known ring domains. The alignment by Borden et a!. (11) also shows greater The costs of publication of this article were defrayed in part by the payment of page sequence similarity between BRCA1 and EHV than between BRCA1 and charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. I To whom requests for reprints should be addressed, at National Institute of Envi 2 The abbreviations used are: EHV, equine herpes virus; PML, promyelocytic leuke ronmental Health Sciences, P.O. Box 12233, Mail Drop 10-03, Research Triangle Park, mia; NMR, nuclear magnetic resonance; CVFF, consistent valence force field; kcal, NC 27709. Phone: (919) 541-3397; Fax: (919) 541-1578; E-mail: rachelle@ kilocalorie; RMS, root-mean-square; HSV, herpes simplex virus; CRD, cysteine rich picard.niehs.nih.gov. domain of protein kinate C; WT1,Wilms' tumor gene; TFIIIA, transcription factor III A. 2539

Downloaded from cancerres.aacrjournals.org on September 30, 2021. © 1996 American Association for Cancer Research. MOLECULAR MODELING OF BRCAI AMINO-TERMINALZINC RING DOMAIN

The two zinc ions were entered into the structure by superposition with the herpes virus structure, which gave a general positioning for the zinc ions in the protein. The resulting structure was energy minimized in vacuo using a distance-dependent dielectric constant, as described by Guenot and Kollman (28). The ligand-zinc ion distances were constrained in the minimization to the distances found between the corresponding ligands and zinc in the herpes virus protein. The zinc-cysteine sulfur atoms were pulled together using 500 cycles of conjugate gradients with a force of 100 kcal/mol, followed by minimization with 500 cycles of steepest descents and 1000 cycles of conjugate gradients until the maximum derivative was less than 0.1 kcal/A. All constraints were R then removed, and the structure was reminimized with 1000 cycles of conju F SQ L gate gradients until the maximum derivative was less than 0. 1 kcal/A. Charges were assigned to the zinc ions and cysteine ligands Sy and histidine Nbl . Because of the ionic and covalent nature of the zinc ligand interactions, the zinc ion charges are reduced and assigned values of + I.5 in the four cysteine-ligand environment and + 1.9 in the three-cysteine- and one-histidine ligand environment. All cysteine Sys have negative partial charges, because a hydrogen is abstracted on binding with zinc. The histidine N81 also has a negative partial charge on binding with zinc. In this model, the cysteine Sys in L the four-ligand case were assigned partial charges of —0.275,andcharges of I —0.450wereassigned in the three-ligand case. The histidine N bound to the MQK zinc was assigned a partial charge of —0.670. Distances were constrained in a several-step minimization (in vacuo) until

Fig. 1. Schematic of the BRCA1 zinc ring domain with cysteine and histidines ligated the ligand-zinc distance criteria were met. This procedure minimized energy to zinc indicated. strain and prevented bumping and overlap of atoms. The average zinc ligand distance in the constrained minimized structure is 2. 1 A. The structure was then reminimized with no constraints. The model was considered acceptable PML. The sequence alignment by Zhou et a!. (26) was not designed as a when reasonable zinc ligand distances could be maintained after removal of structural alignment and does not consider the EHV sequence in the alignment. constraints and reminimization. Pairwise sequence alignments were performed using the Genetics Computer Molecular Dynamics. Moleculardynamicssimulationswereperfonnedto Group Wisconsin Sequence Analysis package, version 8, Bestflt alignment test the stability of the zinc ring BRCA1 model structure. The structure algorithm of Smith and Waterman (16). The Pileup (24, 25) alignment algo obtained from constrained minimization was solvated in a s-A layer of water rithm was used for multiple sequence alignments. and subjected to 140 ps of molecular dynamics. No distances were constrained The EHV-l (Brookhaven Data Bank entry file, 1CHC) zinc ring domain during the molecular dynamics. The molecular dynamics simulation (using the was aligned with that of the BRCA1 protein as described in Fig. 2. Residues Biosym CVFF) was performed with nonperiodic boundary conditions using 1—67ofthe 1CHCprotein aligned with residues 18—86ofthe BRCAI protein. the cell multipole method (fourth-order multipole) for treatment of nonbonded Alignment of the cysteine and histidine residues that form the stabilizing Coulombic and van der Waals interactions. A distance-dependent dielectric ligand-zinc interactions is maintained in both proteins. Secondary structural constant was used (28). The structure was initially reminimized with solvent, elements in the EHV are shown above the sequence. The EHV possesses an and dynamics were performed beginning with the energy-minimized, solvated antiparallel 13sheet that begins at the amino terminus, followed by an a helix structure. The dynamics step size was I fs and structures, were saved along the and a final single (3sheet. Fig. 3 is a graphical sketch of the EHV, illustrating trajectory every 2 ps. Calculations were performed on a Silicon Graphics the secondary structural elements and their relationship to the zinc ions and Indigo2 (MIPS R4400; 150-MHz central processing unit) computer. ligands. Homology Modeling. A three-dimensional structural model was devel RESULTS o_ for the amino terminus (residues 18—86)ofBRCA1 constituting the zinc ring domain using homology modeling. The coordinates of the average NMR Structural differences have been identified between the PML and structure (mean average over 40 NMR structures) of the EHV (entry file, the EHV NMR-determined zinc ring structures (11). The PML struc 1CHC; Brookhaven Protein Data Bank) served as the basis structurefor ture is composed of a @3strandfollowed by two antiparallel @3strands, @ homology modeling. The coordinates of the aligned secondary structural one turn of a helix, and a fourth f3 strand. The EHV structure elements were applied to the BRCA1 sequence. The secondary structural consists of two antiparallel @3strands followed by an a helix and a elements proposed for BRCA1 correspond to fJ sheet 1 (residues 34—38),@3 third @3sheet. Although EHV and PML structural differences have sheet 2 (residues 41—45), a helix (residues 46—54),and f3 sheet 3 (residues been used to support the idea that the ring motif is structurally diverse, 71—76). There were two insertions in the BRCA1 structure, residues 58—60 and 69. Therefore, it was necessary to insert protein loops. Homology mod two-layer a/;3 proteins such as EHV and f3proteins such as PML have cling was performed using the Biosym Technologies homology module, ver been shown to have overall common folds (29). Both PML and EHV sion 2.3.5. BRCA1-inserted loops were modeled by beginning with the loop structures possess major conserved features: the “cross-brace―Zn2@ search procedure of the homology modeling. Coordinates were suggested for binding arrangement; the topology of the central antiparallel @3 the protein loops, and the lowest energy loop structures were selected. The strands; and packing of the conserved residues in the hydrophobic structure developed was energy minimized after splicing the loops to resolve core (11). In both structures, the 6-nitrogen of the histidine is ligated discontinuities between the loops and a helix and @3sheet structures and to the zinc in place of the more common c-nitrogen. The C@RMS overlaps of side chains using steepest descents and conjugate gradient conver deviation between the two structures for residues comprising the first gence algorithms. zinc-binding site (Cys24, Cys27, Cys44, and Cys47) is 1.93 A, Calculations:EnergyMinimization.Calculationswereperformedusing indicating that this binding site is somewhat similar in both structures. the Biosym Technologies Discover version 94.0 program, using the CVFF modified to include nonbonded parameters for zinc. Only nonbonded zinc The majority of structural differences between PML and EHV occur interactions were considered. Parameters for the Lennard-Jones constant [po between the central (3strands and second zinc-binding sites, where the tential distance (Rij), 2.2 A; potential well (a), 0.0125 kcal/mol] were adapted sequences of the two ring-domains are dissimilar. from those used for carbonic anhydrase II (27). These values were used to Several subfamilies are beginning to arise within the zinc ring calculate the A and B constants required for the CVFF. protein family. Although the zinc ring occurs near the amino-terminus 2540

Downloaded from cancerres.aacrjournals.org on September 30, 2021. © 1996 American Association for Cancer Research. MOLECULAR MODELING OF BRCA1 AMINO-TERMINALZINC RING DOMAIN

region of greatest structural dissimilarity

t—@1l I@2—Ilahelixl I—f@3—I BRCA1 18 MQKILE ..CPICLELIKE .PVSTKCDHIFCKFCMLKLLNQKKGPSQCPLCKN ..DITKRSLQESTRFSQLVEEL 86 @ :1 111111 .:.:..IIIJ.l::::.l... liii EHV 1 MATVAE .RCPICLEDPSNYSMALPCLHAFCYVCITRWIRQNPT . . .CPLCKV . .PV .ESWHTIESDSEFGDQL 67 11.1.:: :..JlIII.:I.I:...I 11:1.: I: :: I PML 49 EEEFQFLRCQQCQAEAK CPKLLPCLHTLCSGCLEASGMQ CPICQAPWPL.GADTPALDNVFFESLQR 114 I-f311 1J321 1f331 ———i I3i@I I@4I @ (helical turn) helix)

Fig. 2. Sequence alignment of EHV, PML, and BRCAI zinc ring domains illustrating secondary structural elements and the region of greatest structural dissimilarity. This alignment was used as the basis for developing a homology model for the BRCAI zinc ring domain. BRCA1 possess 54% sequence similarity and 38% sequence identity with the EHV zinc ring domain. BRCAI possessed less sequence similarity, 42.8%, and less sequence identity, 30.6%, with the PML zinc ring domain than with EHV, as shown in the alignment. Multiple sequence alignment of the BRCA1, EHV, and PML zinc ring domains yields pairwise similarity scores of 0.44 for BRCA1 and EHV, 0.37 for BRCAI and PML, and 0.42 for EHV and PML.

ring finger protein and the HSV type 1 ring domain are not function ally interchangeable. (HSV is required for activation of viral gene expression.) Neither a chimeric HSV protein that contained the PML ring finger nor PML was able to activate gene expression in assays (31). This work suggests that ring finger domains do not possess a common functionality. Different subcategories of zinc ring proteins may reflect structural classes of zinc ring domains with differing functionality. The final minimized BRCA1 zinc ring domain model is shown in Fig. 4 with cysteine and histidine zinc ligands. Superimposing the two minimized BRCA1 structures before and after removing the distance constraints, the RMS deviation is 0.02 A (all atoms), and the backbone

D67

@o6o

Fig. 3. Graphical sketch of the EHV illustrating secondary structural elements from the solved NMR structure. The zinc ring domain is a two-layer a/(3 protein. which from the @ NH2 terminus possesses an antiparallel sheet followed by an a helix, a large loop, and @ a final single sheet.

in most protein family members, there are several proteins for which the zinc ring occurs near the middle of the sequence (SINA, PE38, and Zn blndkg sits I XNF7) or near the carboxyl terminus (CRAF1, PAS4, CBL, PSC, RAD16, RAG-i, and PAF-l). Several zinc ring domain proteins have ATPase motifs (HIP116, RAD16, and RAD5). One ring protein, MSL-2 (male specific lethal-2) has a metallothionein domain in addition to the zinc ring domain (26). PML belongs to a subfamily of ring proteins that have been shown to possess another cysteine bistidine motif, identified as a “B-box―followingthe zinc ring domain Fig.4. BRCAImodelhighlightingresiduesproposedtoparticipateinBRCA1DNA (30). PML has two B-box domains followed by an a helical, coiled- bindingandprotein-proteininteractions.Residuespredictedtoparticipateinprotein @ coil domain downstream from the zinc ring domain (1 1). The B-box proteininteractions(Lys3t,GInte, Asp67)areshowninred.Residuespredictedtobe ...... involved in DNA binding (GlnM and Lys5°) are shown in pink. Zinc ions are shown in motif and additional a helical, coiled-coil domains are absent in @eenCysligands(tozinc)areshownin yellow,andHisligands(tozinc)areshownin BRCA1 and EHV. It has recently been demonstrated that the PML blue.TheCys6'andCysMmissensemutationsitesarelocatedintheZn2 bindingsite. 2541

Downloaded from cancerres.aacrjournals.org on September 30, 2021. © 1996 American Association for Cancer Research. MOLECULAR MODELING OF BRCA1 AMINO-TERMINALZINC RING DOMAIN

Table 1 RMS deviationaverageNMR between secondary structural elements of the EHV with the structure. A value of —0.4is approximately equivalent to an structureComparison structure and minimized BRC'AI model X-ray crystal structure with a resolution of 2 A (33). NMRstructure of each of the secondary structural elements in the EHV average eachsecondarywith the minimized BRCA1 model structure. By individually comparing WHAT IF checks interatomic distances, atomic packing, torsion antiparallel13structural element of the two molecules, it is possible to see that the and side chain angles, and unsatisfied hydrogen bond donors and duetosheetis wherethetwostructuresexhibitthegreatestdifferenceinstructure.Thisis acceptors and gives a measure of the overall quality of the structure. @ differsfromthe cis-Pro in the @3turnbetween sheets 1and 2 in the EHV structure. This turn @ the more commonly found type I turn in the BRCA1 structure.No. The WHAT IF overall quality score for this BRCA1 model structure was —2.8. backboneSecondary of structural RMS deviation atoms superimposed The EHV structure on which this BRCA1 model was based gave a 0)f3sheet element (A) (N, Ca, C, and WHAT IF quality score of —2.60.Asis reasonable, the overall quality 40sheet1 1.46 and validity of this model appears comparable to that of the NMR 32Antiparallel2 0.66 structure on which it was based. 72a f3 sheets ((3 sheets 1 and 2) 1.60 72@3sheet3helix 0.69 Molecular Dynamics: Testing the Solvated Structure. Fig. 6 is a 4013 0.71 plot of the backbone RMS deviation versus time across the 140-ps 16813sheets1and2—ahelix 1.81 sheets1—3-ahelix 1.85 216 trajectory. The backbone RMS deviation of the dynamics structure compared with the starting structure equilibrated between 110 and 140 PS to an average value of 1.48 A. The backbone RMS deviation RMS deviation is 0.018 A (N, Ca, C, and 0). This demonstrates that between the starting and ending structure was 1.65 A. A backbone the zinc ligand distances are maintained without constraints, and that superposition of all seventy structures saved along the molecular the structure does not change significantly when the distance restraints dynamics trajectory shown in Fig. 7 illustrates that there are no major are removed. structural reorganizations along the trajectory. The zinc ligand dis Superimposing the minimized BRCA1 model structure with the tances are maintained within tolerance (±0.30 A) along the 140 ps. EHV structure yields a total backbone RMS deviation of 3.0 A. This Overall, the structure is remarkably stable. includes the protein loops, where there is the greatest difference The backbone RMS deviation between the BRCA1 final average between the two structures. A backbone (N, Ca, C, and 0) superpo molecular dynamics structure and the EHV structure is 3.1 A, which sition of the separate secondary structural elements reveals much includes protein loops in the structure. Superposition of the separate greater similarity in the structures (as shown in Table 1). secondary structural elements of these structures, as with the mini Fig. 5 shows a superposition of the minimized BRCA1 model mized BRCA1 structure, also reveals much greater similarity in the structure and EHV average NMR structure. The loop regions of the structures (as shown in Table 2). modeled protein are the most difficult to define, because they exhibit Fig. 5 shows a superposition of the final average molecular dynam the greatest flexibility. The smallest RMS deviations between the ics BRCA1 model structure with the minimized BRCA1 model struc EHV structure and BRCA1 model are in regions of the protein that contain a single, well-defined secondary structure (a helix or single f3 sheet). The greatest difference between the secondary structural dc ments of the minimized BRCAI structure and the EHV structure is in the antiparallel f3 sheet. The @3turn between the two antiparallel (3 sheets differs in both structures. In the EHV structure, this turn (Ala21, Leu22, Pro23, and Cys24) contains a cis-Pro in the i + 2 position and has the characteristic ij and 4 angles of a type V lb (3turn (32). In the BRCA1 model, this turn does not contain a cis-Pro (Thr@7,Lys3t, Cys39, and Asp40) and has the characteristics of the more commonly found type I (3turn. In the loop region ofthe protein [which is between the a helix and the final (3 sheet ((3 sheet 3), consisting of residues 55—71in BRCA1I, the BRCA1 model contains two cis-Pro residues (cis-Pro58 and cis-Pro62), the corresponding prolines in the EHV structure are both trans. These prolines must be cis in the BRCA1 structure to accommodate the zinc ligand distances in the structure. The accuracy of the model was validated by subjecting the model structure to PROCHECK version 3.0 (33), and WHAT IF version 4.99 (34). PROCHECK assesses the stereochemical quality of the struc ture, nonbonded contacts within the structure, side-chain conforma tions, bond lengths and angles, and peptide bond planarity (w) and reports the deviations from values in well-refined structures. In the BRCA1 model structure, all 4 and i@,angles fell within allowed regions of the Ramachandran plot. All the main chain parameters (the peptide bond planarity, number of bad nonbonded interactions, hy drogen bond energies, and a-carbon tetrahedral distortion) were all within acceptable limits from ideal values (in well-refined structures). All of the side chain parameters (x' gauche minus and plus and X' trans- and f trans-angle values) were comparable to those for high resolution structures. The overall G-factor value given by PRO CHECK for this structure was —0.4.Ideally, scores above (less negative than) —0.5 indicate good-quality structures, and values Fig. 5. Superposition of the BRCAI minimized model (yellow), final average molec greater than (more negative than) —1.0 are indicative of problems ular dynamics BRCA1 model structure (blue), and EHV average NMR structure (red). 2542

Downloaded from cancerres.aacrjournals.org on September 30, 2021. © 1996 American Association for Cancer Research. MOLECULAR MODELING OF BRCAI AMINO-TERMINALZINC RING DOMAIN

Backbone RMSdeviation vs. time

2 @ 1.8 @ 1.6 1.4 Fig. 6. Plot of the BRCA1 backbone RMS deviation versus 1.2 time for the 140-ps molecular dynamics trajectory. The average structure (110—140ps) has a backbone RMS to the starting ! 0.8 structure of 1.48 A. @ 0.6 @ 0.4 @ 0.2 0

Tim•(p.)

nonspecifically to DNA-cellulose (10, 35). Several zinc ring proteins (i.e., ICPO, RO-52K, and PML) have been shown to exhibit nuclear localization (13). It has been suggested that PML is involved in promoting protein-protein interactions that stabilize PML nuclear body formation (11). Other zinc finger motifs in addition to the ring domain that possess two independent zinc-binding sites are the LIM domain and the CRD (36). The CRD domain has interleaved zinc ligands, and its recently solved NMR structure revealed the CRD to have a fold similar to that of the ring domain. Both domains consist of a three-stranded (3 sheet and an a helix. The CRD is in the protein kinase C regulatory domain that is believed to contain the binding sites for diacylglycerol and phorbol esters and to be critical for regulation of kinase activity. The role that the CRD plays, however, is unknown (37). Many of the proteins containing the LIM domain are transcription factors (the name “LIM―isderived from the three transcription factors in which it was first observed: Lin-il, Isl-l, and Mec-3. However, like the ring domain, the LIM domain has not been shown to have biologically relevant DNA-binding activity. Recent studies (38) have demonstrated a specific protein-protein interaction between two LIM domain proteins, zyxin and CRP. Therefore, it is possible that the ring Fig. 7. Backbone superposition of all 70 structures along the molecular dynamics trajectory. domain may function similarly to the LIM domain, and that protein protein interactions in which the ring domain participates may play a role in gene expression regulation. Both the LIM and ring domains are tare and EHV average NMR structure. The superposition of the bipartite structures with two zinc-binding domains; therefore, it is average molecular dynamics and minimized BRCA1 model structures possible that they may function similarly to the glucocorticoid recep illustrates that the major region of flexibility in the BRCA1 model tor, in which one zinc site is required for nucleic acid binding, and the structure is the loop region between the a helix and (3sheet 3 (residues other mediates protein-protein interaction, so that the receptor forms 55—71).The interleaved zinc ligand structure of the ring domain is a homodimer. This may be the case, because the HSV Vmwl 10 zinc often compared in its ability to stabilize a structure to the cross-linked ring protein has been suggested to be a multimer in solution (35). disulfide bond conformation. This is demonstrated here by the small If BRCA1 functions as a moderator of protein-protein interactions, changes exhibited by the BRCA1 model structure along this molec it may act similarly to the CRD or LIM domain. In the CRD, residues ular dynamics trajectory. (Coordinates for the minimized and average molecular dynamics BRCA1 model structures are available by con tacting the authors.) TableaverageNMR 2 RMS deviation between secondary structural elements of the EHV structureComparisonstructure and the BRCAI final average molecular dynamics NMRstructure of each of the secondary structural elements in the EHV average DISCUSSION eachsecondarywith the BRCA1 final average molecular dynamics structure. By comparing antiparallel13structural element of the two molecules it is possible to see that the A specific common functionality for the zinc ring domain has not exhibitthesheet,asin theminimizedBRCA1modelstructure,iswherethetwostructures been identified to date. The domain appears to interact with DNA greatest difference in structure.No. either through direct binding or indirectly by mediating protein backboneSecondary of structural RMS deviation atoms superimposed protein interactions. No member of the zinc ring domain family has 0)p element (A) (N, C@,C, and been shown to bind DNA in a sequence-dependent manner. Also, the sheet140sheet 1.35 EHV zinc ring domain did not bind to DNA or RNA under assay 32Antiparallel2 0.59 conditions (9). However, a synthetic zinc ring peptide was shown to 72a f3 sheets (@ sheets 1 and 2) 1.38 7213helix 0.60 bind to a subset of randomized DNA sequences, and the DNA-protein 4013sheet3 0.96 complex formation was zinc dependent (6). Likewise, the EHV, 16813sheetsIand2—ahelix 1.87 Mel-18, and XNF7 zinc ring domains have been shown to bind sheets1—3--ahelix 1.95 216 2543

Downloaded from cancerres.aacrjournals.org on September 30, 2021. © 1996 American Association for Cancer Research. MOLECULARMODELINGOF BRCAI AMINO-TERMINALZINCRING DOMAIN that are part of the central antiparallel (3 sheet and a helix play a currently linked to breast and/or ovarian cancer. If the BRCA1 zinc significant role in protein binding (37). Although the BRCA1 zinc ring structure is ultimately shown to bind DNA in a manner similar to ring domain possesses only 15% sequence identity and 31% sequence other zinc finger transcription factors, mutations at these positions similarity with CRD, in an alignment by Smith and Waterman (16), may inhibit binding to DNA. Molecular modeling of interactions the central four BRCA1 zinc cysteine ligands, Cys@, Cys47, Cys61, between BRCA1 and DNA will be useful in the design of mutants and Cys@, align with the CRD zinc cysteine ligands Cys―5 Cys'18 aimed at understanding BRCA1 function. Cys'32, and Cys'35. This corresponds to the central antiparallel (3 Several predisposing BRCA1 mutations linked to breast and ovar sheet region of the molecule. Cys'32 and Cys'35 are required for the ian cancer have been identified as occurring in the zinc ring domain. phorbol ester affinity of CRD. Val'@, also required for phorbol ester One of the original BRCA1 mutations (kindred 1901) contains a binding of CRD, is replaced by Leu73 in BRCA1, an aliphatic residue frameshift in exon 2 (l88delll; Ref. 1), which essentially removes the with similar chemical properties. A group of CRD conserved residues zinc ring domain from the protein. In addition, the most common (Tyr'°@,Phe'14 Lys131, and Asn'38) have been identified (37) as BRCA1 mutation seen to date is l85delAG (C24ST0P39 in codon 24; being positioned for interactions with protein kinase C activators. In Ref. 45), which essentially disrupts the BRCAJ gene product precisely BRCA1, Tyr'°@is replaced by Lys38, another ionizable residue; at the first residue of the C3HC4 domain. Three frequent missense 14 is conserved; Lys13' is replaced by Gln@°; and Asn138 is mutations, Cys61Gly, Cys@Gly, and Cys@Tyr; Ref. 45) would also replaced by Asp67. Asn can potentially deamidate to Asp, and these disrupt the zinc ring domain structure. The zinc ion in proteins residues are structurally similar. The Lys38, Gln°°,andAsp67 residues interacts strongly with several ligands, the sulfur in Cys, the nitrogen are all on the surface of the BRCA1 zinc ring model and have side in His, and the oxygen in Asp, Glu, and water. If the Cys residues chains with extended orientations favorable for interaction with other were mutated to Gly or Tyr, the zinc ion coordination, no longer proteins (Fig. 4). The Phe43 residue is buried in the BRCA1 model. tetrahedral, would be unstable. The zinc ion would either be lost from These predicted surface residues in BRCA1 (Lys38, Gln@°,andAsp67) the structure, and the protein would assume a different fold, or water may participate in protein-protein interactions if the mechanism of would serve as the fourth ligand for zinc, altering the conformation of BRCA1 zinc ring domain protein interaction is similar to that of the the protein. Because the correct protein fold would no longer be CRD. Mutations of these residues would probe the BRCA1 zinc ring maintained, the mutated zinc ring would no longer interact with DNA domain functionality as a mediator of protein-protein interactions. and/or protein in the same manner. Studies are under way to ascertain A variety of zinc finger crystal structures [zif268 (mouse immediate whether the zinc ring domain of BRCA1 can bind DNA in a sequence early protein), human glioblastoma, Gal4 (yeast transcriptional acti specific manner, or whether BRCA1 protein-protein interactions are vator), WT1 , and estrogen and glucocorticoid receptors] have shown of biological significance. interactions between the protein zinc finger a helix and the DNA major groove (39—42). Although there are significant differences ACKNOWLEDGMENTS between the zinc ring and zinc finger structures, a comparison be tween the EHV zinc ring and the classic TFIIIA-type ring finger (9) We are grateful to Howard Smith for his assistance and support with showed some structural similarity. Backbone atom superposition of computer graphics. the human enhancer-binding protein TFIIIA-type zinc finger and the EHV zinc ring finger yielded an RMS deviation of 2.0 A (9). The first REFERENCES two (3 strands and the a helix of EHV overlapped well in length and orientation with those of the human enhancer-binding protein TFHIA I. Mild, Y., Sensen, J., Shattuck-Eidens, D., Futreal, P. A., Harshman, K., Tavtigian, S., Liu, Q., Cochran, C., Bennett, L. M., Ding, W., Bell, R., Rosenthal, J., Hussey, C., type zinc finger. Hydrophobic core residues and positively charged Tran, T., McClure, M., Frye, C., Hattier, T., Phelps, R., Haugen-Strano, A., Katcher, residues on the outer surface of the a helix also superimposed in the H., Yakumo, K., Gholami, Z., Shaffer, D., Stone, S., Bayer, S., Wray, C., Bogden, R., two structures. It has been shown in zinc finger domain proteins that Dayananth, P., Ward, J., Tonin, P., Narod, S., Bristow, P. K., Norris, F. H., Helvering, L.. Morrison, P., Rosteck, P., Lai, M., Barrett, J. C., Lewis, C., Neuhausen, S., the a helix of the zinc finger domain interacts with DNA, usually via Cannon-Aibright, L., Golgar, D., Wiseman, R., Kamb, A., and Skolnick, M. H. A hydrogen-bonding between the amino acid side chains of the protein strong candidate for the breast and ovarian cancer susceptibility gene BRCAI. Science and guanine DNA bases (39—42). The tumor suppressor gene WT1, (Washington DC). 266: 66—71,1994. 2. Futreal, P. A., Liu, Q., Shattuck-Eidens, D., Cochran, C., Harshman, K., Tavtigian, S., which encodes a transcription factor, contains a zinc finger domain in Bennett, L. M., Haugen-Strano, A., Swensen, J., Mild, Y., Eddingion, K., McClure, which many mutations linked to Wilms' tumors occur (43). WT1 M., Frye, C., Weaver-Feldhaus, J., Ding, W., Gholami, Z., Soderkvist, P., Terry, L., Jhanwar, S., Berchuck, A., Iglehart, J. D., Marks, J., Ballinger, D. G., Barren, J. C., possesses four zinc fingers and binds to guanine-rich DNA sequences. Skolnick, M. H., Kamb, A., and Wiseman, R. BRCAJ mutations in primary breast and Cancer-linked mutations in WT1 have been shown to change zinc ovarian carcinomas. Science (Washington DC), 266: 120—122,1994. finger DNA-binding properties (44). 3. Narod, S. A., Ford, D., Devukee, P., Barkardottir, R. B., Lynch, H. T., Smith, S. A., Ponder, B. A. J., Weber, B. L., Gather, J. E., Birch, J. M., Cornelis, R. S., Kelsell, We can extend our model to propose that the a helix of the BRCA1 D. P., Spurr, N. K., Smyth, E., Haites, N., Sobol, H., Bignon, Y. J., Change-Claude, zinc finger could bind in the major groove of B-DNA. Mutations of J., Hamann, U., Lindblom, A., Borg, A., Piver, M. S., Gallion, H. H., Struewing, J. P., polar residues in the a helix of the BRCA1 zinc ring domain presum Whittemore, A., Tonin, P., Goldgar, D. E., and Easton, D. F. An evaluation of genetic heterogeneity in 145 breast-ovarian cancer families. Am. J. Hum. Genet., 56: 254— ably could interfere with proposed protein-DNA binding. The a helix 264, 1995. of the BRCA1 zinc ring domain consists of Cys@, Lys45, Phe46, 4. Easton, D. F., Bishop. D. T., Ford, D., and Crockford, G. P. Genetic linkage analysis in familial breast and ovarian cancer. Results from 214 families. Am. J. Hum Genet., Cys47, Met48, Leu49, Lys50, Le5' , Leu52, Asn53, and Gln54. This a 52: 678—701,1993. helix contains two amino acids with ionizable side chains, Lys45 and 5. Von Arnim, A. G., and Deng, X. W. Ring finger motif ofArabidopsis thaliana COP1 Lys50, and two amino acids with hydrophilic side chains, Asn53 and defines a new class ofzinc-binding domain. J. Biol. Chem., 268: 19626—19631,1993. 6. Lovering, R., Hanson, I. M., Borden, K. L B., Martin, S., O'Reilly, N. J., Evan, G. I., Gin54. The residues that would be involved in DNA binding, based on Rahman, D., Pappin, D. J. C., Trowsdale, J., and Freemont, P. S. Identification and the assumption that the BRCA1 Zinc ring domain a helix inserts in the preliminary characterization of a protein motif related to the zinc finger. Proc. Nail. DNA major groove, are G1n54 and Lys5°(hydrophilic residues with Aced. Sci.USA, 90: 2112—2116,1993. 7. Freemont. P. S., Hanson, I. M., and Trowsdale, J. A novel cysteine-rich sequence side chains that extend toward the surface; Fig. 4). If BRCA1 follows motif. Cell, 64: 483—484,1991. other zinc finger DNA-binding proteins, the side chains of these 8. Everett, R. D., Barlow, P. N., Milner, A., Luisi, B., Orr, A., Hope, G., and Lyon, D. hydrophilic residues would hydrogen bond with the DNA guanine. A novel arrangement of zinc-binding residues and secondary structure in the C3HC4 motif of an a herpes virus protein family. J. Mol. Biol., 234: 1038—1047,1993. Neither of these positions, however, corresponds to BRCA1 mutations 9. Barlow, P. N., Luisi, B., Milner, A., Elliot, M., and Everett, R. D. Structure of the 2544

Downloaded from cancerres.aacrjournals.org on September 30, 2021. © 1996 American Association for Cancer Research. MOLECULARMODELINGOF BRCAI AMINO-TERMINALZINCRING DOMAIN

C3HC4 domain by ‘H-nuclearmagneticresonance spectroscopy. J. Mol. Biol., 237: 28. Guenot, J., and Kollman, P. A. Conformational and energetic effects of truncating 201—211,1994. nonbonded interactions in an aqueous protein dynamics simulation. J. Comp. Chem., 10. Tagawa, M., Sakamoto, T., Shigemoto, K., Matsubara, H., Tamura, Y., Ito, T., 14:293—311,1993. Nakamura, I., Okitsu, A., Imai, K., and Taniguchi, M. Expression of novel DNA 29. Efimov, A. V. Structural similarity between two-layer a/f3 and @3-proteins.J. Mol. bindingproteinwithzincfingerstructureinvarioustumorcells.J. Biol.Chem.,265: Biol., 245: 402—415,1995. 20021—20026, 1990. 30. Reddy, B. A., and Etkin, L. D. A unique bipartite cysteine-histidine motif defines a 11. Borden, K. L. B., Boddy, M. N., Lally, J., O'Reilly, N. J., Martin, S., Howe, K., subfamily of potential zinc-finger proteins. Nucleic Acids Res., 19: 6330, 1991. Solomon, E., and Freemont, P. S. The solution structure of the RING finger domain 31. Everett, R. D., Maul, G. G., Orr, A., and Elliott, M. The cellular ring Finger protein from the acute promyelocytic leukemia proto-oncoprotein PML. EMBO J., 14: PML is not a functional counterpart of the herpes simplex virus type 1 ring finger 1532—1541,1995. protein Vmwl 10. J. Gen. Virol., 76: 791—798,1995. 12. Petty, L J., Rixon, F. J., Everett, R. D., Frame, M. C., and McGeoch, D. J. 32. Wilmont, C. M., and Thornton, J. M. Analysis and prediction of the different types of Characterization ofthe JEIJO gene of herpes simplex virus type I. J. Gen. Virol., 67: p3-turn in proteins. J. Mol. Biol., 203: 221—232,1988. 2365—2380,1986. 33. Laskowski, R. A., MacArthur, M. W., Moss, D. S., and Thornton, J. M. PROCHECK: 13. Koken, M., Linares-Cruz, G., Quignon, F., Viron, A., Chelbi-Alix, M., Sobczak a program to check the stereochemical quality of protein structures. J. AppI. Cryst., Thepot, 3., Juhlin, L., Degos, L., Calvo, F., and de The, H. The PML growth 26: 283—291,1993. suppressor has an altered expression in human oncogenesis. Oncogene. 10: 1315— 34. Vriend, G., and Sander, C. Quality control of protein models. Directional atomic 1324,1995. contact analysis. J. AppI. Cryst., 26: 47—60,1993. 14. Goldgar, D. E., and Reilly, P. R. A common BRCAJ mutation in the Ashkenazim. 35. Everett, R. D., Orr, A., and Elliott, M. High level expression and purification of Nat. Genet., 11: 113—114,1995. herpes simplex virus type I immediate early polypeptide Vmwl 10. Nucleic Acids 15. Struewing, J. P., Abeliovich, D., Peretz, T., Avishai, N., Kaback, M. M., Collins, Res., 19: 6155—6161, 1991. F. S., and Brody, L. C. The carrier frequency of the BRCAJ l85delAG mutation is 36. Schwabe, J. W. R., and Klug, A. Zinc mining for protein domains. Nature Struct. approximately 1% in Ashkenazi Jewish individuals. Nat. Genet.. 11: 198—200,1995. Biol., /: 345—349,1994. 16. Smith, T. F., and Waterman, M. S. Comparison of biosequences. Adv. Appl. Math., 37. Hommel, U., Zurini, M., and Luyten, M. Solution structure of the cysteine rich 2:482—489,1981. domain of rat protein kinase C. Nature Struct. Biol., 1: 383—387,1994. 17. Altschul, S. F., Gish, W., Miller, W., Myers, E., and Lipman, D. J. Basic local alignmentsearchtool.J. Mol.Biol.,215:403—410,1990. 38. Schmeichel, K. L., and Beckerle, M. C. The LIM domain is a modular protein-binding interface. Cell, 79: 211—219,1994. 18. Luthy, R., McLachlan, A. D., and Eisenberg, D. Secondary structure-based profiles. 39. Suzuki, M. Common features in DNA recognition helices of eukaryotic transcription Use of structure-conserving scoring tables in searching protein sequence databases for factors. EMBO J., 12: 3221—3226, 1993. structural similarities. Proteins Struct. Funct. Genet., 10: 229—239, 1991. 40. Suzuki, M. A. Framework for the DNA-protein recognition code of the probe helix in 19. Kyte, J., and Doolittle, R. F. A simple method for displaying the hydropathic character of a protein. J. Mol. Biol., 157: 105—132,1982. transcription factors: the chemical and stereochemical rules. Structure, 2: 317—326, 20. Emini, E. A., Hughes, J. V., Perlow, D. S., and Boger, 3. Induction of hepatitis A 1994. virus-neutralizing antibody by a virus-specific synthetic peptide. J. Virol., 55: 836— 41. Pavletich, N., and Pabo, C. 0. Zinc finger-DNA recognition: crystal structure of a 839, 1985. Zif268-DNA complex at 2.1 A. Science (Washington DC), 252: 809—817,1991. 21. Chou, P. Y., and Fasman, G. D. Prediction ofthe secondary structure ofproteins from 42. Pavletich, N., and Pabo, C. 0. Crystal structure of a five-finger GLI-DNA complex: their amino acid sequence. Adv. Enzymol., 47: 45—148,1978. new perspectives on zinc fingers. Science (Washington DC), 261: 1701—1707.1993. 22. Gamier, J., Osguthorpe, D. J., and Robson, B. Analysis of the accuracy and impli 43. Bickmore, W. A., Oghene, K., Little, M. H., Seawright, A., van Heyningen. W., cationsofsimplemethodsforpredictingthesecondarystructureofglobularproteins. and Hastie, N. D. Modulation of DNA binding specificity by alternative splicing J. Mol. Biol., 120: 97—120,1978. ofthe Wilms tumor wtl gene transcript. Science (Washington DC), 257: 235—237, 23. Jameson, B. A., and Wolf, H. The antigenic index: a novel algorithm for predicting 1992. antigenicdeterminants.Comput.Appi.Biosci.,4: 181—186,1988. 44. Little, M. H., Prosser, J., Condie, A., Smith, P. J., Heyningen, V. V., and Hastie, N. D. 24. Feng, D. F., and Doolittle, R. F. Progressive sequence alignment as a prerequisite to Zinc finger point mutations within the WI'! gene in Wilms tumor patients. Proc. NatI. correct phylogenetic trees. J. Mol. Evol., 25: 351—360,1987. Acad. Sci. USA, 89: 4791—4795, 1992. 25. Higgins, D. G., and Sharp, P. M. Fast and sensitive multiple sequence alignments on 45. Shattuck-Eidens, D., McClure, M., Simard, J., Labne, F., Narod, S., Couch, F., a microcomputer. Comput. AppI. Biosci., 5: 151—153,1989. Hoskins, K., Weber, B., Castilla, L., Erdos, M., Brody, L., Friedman, L., Ostermeyer, 26. Zhou, S., Yang, Y., Scott, M. J., Pannuti, A., Fehr, K. C, Eisen, A., Koonin, E. V.. E., Szabo, C., King, M. C., Jhanwar, S., Offit, K., Norton, L., Gilewski, T., Lubin, M., Fouts, D. L., Wrightsman, R., Manning, J. E., and Lucchesi, J. C. Male-specific lethal Osborne, M., Black. D., Boyd, M., Steel, M., Ingles, S., Haile, R., Lindblom, A., 2, a dosage compensation gene of Drosophila. undergoes sex-specific regulation and Olsson, H., Borg, A., Bishop, D. T., Solomon, E., Radice, P., Spatti, G., Gayther, S., encodesa proteinwitha RINGfingerand a metallothionein-likecysteinecluster. Ponder, B., Warren, W., Stratton, M., Liu, Q., Fujimura, F., Lewis, C., Skolnick, EMBO J., 14: 2884—2895,1995. M. H., and Goldgar, D. E. A collaborative survey of 80 mutations in the BRCAJ breast 27. Merz, K. M. CO2 Binding to human carbonic anhydrase H. J. Am. Chem. Soc., 1/3: and ovarian cancer susceptibility gene. Implications for presymptomatic testing and 406—411,1991. screening. JAMA. 15: 535—541,1995.

2545

Downloaded from cancerres.aacrjournals.org on September 30, 2021. © 1996 American Association for Cancer Research. Molecular Modeling of the Amino-Terminal Zinc Ring Domain of BRCA1

Rachelle J. Bienstock, Tom Darden, Roger Wiseman, et al.

Cancer Res 1996;56:2539-2545.

Updated version Access the most recent version of this article at: http://cancerres.aacrjournals.org/content/56/11/2539

E-mail alerts Sign up to receive free email-alerts related to this article or journal.

Reprints and To order reprints of this article or to subscribe to the journal, contact the AACR Publications Subscriptions Department at [email protected].

Permissions To request permission to re-use all or part of this article, use this link http://cancerres.aacrjournals.org/content/56/11/2539. Click on "Request Permissions" which will take you to the Copyright Clearance Center's (CCC) Rightslink site.

Downloaded from cancerres.aacrjournals.org on September 30, 2021. © 1996 American Association for Cancer Research.