<<

Analysis of the structural consensus of the coordination centers of structures ⁎ Kirti Patel, Anil Kumar, Susheel Durani

Department of Chemistry, Indian Institute of Technology Bombay, Mumbai-400076, India

Abstract

In a recent sequence-analysis study it was concluded that up to 10% of the human could be comprised of zinc , quite varied in the functional spread. The native structures of only few of the proteins are actually established. The elucidation of rest of the sequences of not just human but even other actively investigated genomes may benefit from knowledge of the structural consensus of the zinc-binding centers of the currently known zinc proteins. Nearly four hundred X-ray and NMR structures in the database of zinc– structures available as of April 2007 were investigated for geometry and conformation in the zinc-binding centers; separately for the structural and catalytic proteins and individually in the zinc centers coordinated to three and four amino-acid . Enhanced involvement in agreement with the observation in human proteome has been detected in contrast with previous reports. Deviations from ideal coordination geometries are detected, possible underlying reasons are investigated, and correlations of geometry and conformation in zinc-coordination centers with protein function are established, providing possible benchmarks for putative zinc-binding patterns of the burgeoning genome data.

Keywords: Metalloprotein; Zinc–protein; Zinc binding pattern; Zinc coordination; Structural zinc-center; Catalytic zinc-center

1. Introduction processes, like, transcription, translation, and [3–8]. Based on homology based searches of human genome, Andreini The second most abundant transition element in biological et al. [9] concluded that ∼2800, i.e., ∼10%, of the proteins could systems after , zinc is essential for life. With closed d-shell, be zinc proteins, encompassing all the six International Union of unlike iron and , zinc (II) finds biological relevance as a Biochemistry recognized classes of [10]; in fact zinc is -stable element in metalloprotein structures. Interacting the only so widespread in the distribution. The zinc-finger mainly with amino-acid side chains and occasionally with non- type transcription regulators, comprising more than 4% of protein ligands in mostly tetra, penta, and hexa-coordinate human proteome [9], call upon zinc as a tetra-coordinating complexes, zinc contributes both structural and catalytic roles in element; involvements of zinc as catalytic element, on other metalloprotein structures [1,2]. Progressive studies of zinc hand, call upon its electrophilic character, in stabilizing anionic proteins have established their involvements in diverse cellular intermediates, and dynamic coordination geometry, in facilitat- ing bond rearrangements. In protein engineering and design applications, zinc could be an aid in control of both con- formation and function [11]. The currently known zinc centers could provide the structural consensus for characterization of the newer zinc-binding sequences, as well as serve as the guidelines for protein engineering and design applications. Reviews of geometry and stereochemistry of zinc-coordination centers have appeared from time to time [12–21]. However, the structural database of zinc proteins has grown at least five times since the last survey 1248

Table 1 PDB codes of 228 structural zinc proteins, 102 NMR (Panel a) and 126 X-ray determined (Panel b), and 154 catalytic zinc proteins, 7 NMR (Panel c) and 147 X-ray determined (Panel d), investigated in this study Panel a a 1A1T 1FRE 1M36 1PFT 1U2N 1VYX 1XPA 1ZFO 2AYJ 2FNF 1AJY 1G25 1M9O 1PXE 1U6P 1WIG 1XRZ 1ZGW 2AZH 2GAT 1BBO 1G47 1MM2 1PYC 1U7J 1WII 1XWH 1ZR9 2BL6 2GHF 1BOR 1IBI 1N0Z 1R9P 1U85 1WJ0 1Y0J 1ZU1 2C6A 2HGH 1DSQ 1IML 1NCS 1RXR 1UW0 1WJ2 1Y8F 1ZW8 2CON 2I5O 1DSV 1IYM 1NEE 1SRK 1UW2 1WO3 1YUI 2A20 2CQE 2IDA 1DX8 1J2O 1NJ3 1T3K 1VA1 1WPK 1YWS 2A23 2DB6 2IHX 1DXW 1JJD 1NYP 1TF3 1VA2 1WWD 1Z60 2A51 2F8B 2JM3 1E4U 1JN7 1OVX 1TM6 1VA3 1XF7 1ZE9 2AF2 2FEJ 2JNE 1EF4 1LPV 1P7M 1TOT 1VD4 1XJH 1ZFD 2AQC 2FK4 2ODX 1EXK 1LV3

Panel b b 1A2P 1NW2 1R1H 1U5K 1YW4 2AU3 2CYE 2FU5 2H6L 2OGW 1A7W 1OAO 1R2Z 1VJ0 1Z05 2AYD 2DGE 2FYG 2HCV 2OH3 1BTK 1OJ7 1R5Y 1W4R 1Z83 2AZ4 2DS5 2G84 2HJN 2OIK 1DVP 1ONW 1RUT 1WWR 1Z84 2B3J 2ESL 2GC0 2I13 2OMH 1ENR 1OQJ 1RYQ 1WY2 1ZED 2B4Y 2EV6 2GFE 2I9W 2OSO 1GL4 1P3J 1SW1 1X0T 1ZKP 2B8T 2F5N 2GFO 2IMR 2OU3 1H3N 1P5D 1T0B 1XC8 1ZSW 2B9D 2F5P 2GLZ 2IQJ 2OWA 1INN 1P6O 1T64 1XM8 1ZZM 2BOQ 2FE3 2GNR 2J21 2OX0 1J30 1PZW 1T9H 1XTM 2A1K 2C1D 2FE8 2GPY 2J6A 2P57 1JW9 1Q08 1TDZ 1XV2 2AP1 2C2U 2FEA 2GVI 2J9U 2PEB 1KWG 1Q1A 1TJL 1XWY 2APO 2CIH 2FPR 2GWG 2O6D 2UVL 1L1T 1Q74 1TQX 1YC5 2AQ2 2CJS 2FQP 2GWN 2O8J 4KMB 1MA3 1Q9U 1U0A 1YQD 2AS9 2CS7

Panel c c 1DXW 1FFW 1XXE 1XYD 2AF2 2HDP 2JMO

Panel d d 1A4M 1GVF 1KO3 1P5X 1TAZ 1XOV 1Z1L 2BO0 2FR5 2HPO 1AMP 1H2B 1KQ9 1Q2O 1TBF 1XP2 1Z5R 2BZ1 2G3F 2HSI 1ATL 1HET 1LML 1Q7L 1TQW 1XRT 1ZDE 2C1I 2G64 2I0O 1C7K 1HKK 1LR5 1QH5 1U2W 1XRU 1ZKL 2CC0 2G7N 2IW0 1C8Y 1HP1 1LU0 1QIP 1UDV 1XVX 1ZY7 2CDC 2GFO 2J9A 1CA1 1IA9 1M65 1QWR 1UUF 1Y0Y 1ZZ1 2CFU 2GIV 2JG6 1D8W 1IE0 1M7J 1QWY 1V4P 1Y2K 2A21 2CKI 2GMW 2NX9 1ED9 1IM5 1MFM 1R55 1VHH 1Y7W 2A7M 2CKL 2GSO 2NXF 1EU3 1ITU 1MXG 1R5T 1VYK 1Y93 2A97 2CTB 2GU1 2O1Q 1EVL 1J79 1NTO 1S2Z 1W5Q 1Y9A 2AFW 2DDF 2H6F 2OB3 1F4T 1JD0 1OHT 1S4B 1WCZ 1YB0 2AIO 2DVT 2HBV 2OOT 1FRP 1JIW 1OI0 1SG0 1WN5 1YIX 2BC2 2F4M 2HC9 2ORW 1FUA 1JJT 1OJR 1SG6 1X8H 1YLK 2BGX 2FOU 2HEK 2OVY 1GKP 1K7H 1ONW 1SR9 1XEM 1YM3 2BIB 2FPQ 2HF1 8TLN 1GUD 1K9Z 1OYW 1T0A 1XOC 1YT3 2BNM a The dataset of 102 structural zinc proteins (NMR structures). b The dataset of 126 structural zinc proteins (X-ray structures). c The dataset of 7 catalytic zinc proteins (NMR structures). d The dataset of 147 catalytic zinc proteins (X-ray structures).

[18]. Specifically, explosive growth of the NMR characterized 2. Materials and methods zinc–protein structures has been witnessed in addition to the “ ” X-ray characterized structures [22]. The combined resource of Protein Data Bank (PDB) [22] was queried for keyword ZN with a sequence identity cut-off ≤50% cross checked on PISCES1 server of Dunbrack about 400 structures will possibly better capture the consensus [23], furnishing a total of 1321 structures. Restricting to high-resolution X-ray in geometry and conformation in the newly uncovered zinc- structures (≤2.0 Å resolution, b45 average B-factor, and b0.25 R-value) but binding patterns of diverse genomes. We have examined the zinc including all NMR structures, gave 382 PDB entries corresponding to 228 centers available in Protein Data Bank [22] as of April 2007 for structural proteins and 154 catalytic zinc–proteins. Among the structural details of structure and conformation and summarize the results in this report. 1249

coordination, 31% in C5 coordination, and 11% in C6 coordination. The C4 coordination sites are always tetrahedral. Majority of the C5 coordination sites are trigonal bipyramidal, while the C6 coordination sites are always octahedral. Presumably electro- static and steric interactions among the ligands define the coordination geometries. Deviations from ideal geometry occur for some of the following reasons:

(1) A extraneously H-bonded often in secondary coordination sphere and sometime with solvent. (2) Bidentate coordination of Asp (D) and Glu (E) carbox- ylates via both ; among 400 carboxyl ligands, 67 instances of bidentate arrangement were observed. (3) Simultaneous ligation of backbone N and CO to Zn: this was observed in 3 cases. (4) Multi-Zn sites involving Asp, Glu, or His (H) as bridging ligand between Zn atoms.

Fig. 1. Ideal geometries for 4, 5, and 6 coordinated Zinc. Ax =axial, Eq=equatorial. The structural Zn sites are generally tetrahedral with all the four ligands from protein. In the catalytic Zn sites or inhibitor usually occupies the fourth ligation site. In structural proteins, 102 entries were NMR structures and 126 were X-ray structures, while zinc proteins conservation of length of the spacer between the among 154 catalytic proteins, 7 entries were NMR structures and 147 were X- residues ligated with zinc is observed. The ray structures. Classification of the protein structures as the structural or catalytic structures was made on basis of the available descriptions in specific PDB conservation is less stringent among the catalytic zinc sites, entries. The NMR structures were only considered for conformation in the zinc- possibly due to the requirement of conformational flexibility for coordination centers, while the X-ray structures were considered for both catalytic function. Short and long spacers generally link the geometry and conformation. The overall dataset includes apo-enzymes, protein ligands in both structural and catalytic sites. The shorter – inhibitor complexes, and mutants. The PDB codes of the proteins spacers, generally 2 to 7 residues, occur between the first and analyzed, their classification, and the technique of structure determination are summarized in Table 1. second zinc ligand in both the structural and catalytic sites, and The analysis performed with in-house programs2 involved searching PDB between the third and fourth zinc ligand in the structural coordinates for the amino-acid residues within 3.0 Å cut-off of zinc atom, proteins. A comparatively longer N20 residue spacer occurs calculating bond lengths and angles of the atoms coordinated with zinc, and between the second and third zinc ligand in both the structural evaluating sequence positions of the amino acids coordinated with zinc. The and catalytic sites (Tables S1–S4), similar to the observations of angle of planes of coordinating with metal ion was also determined with in-house program. ϕ, ψ Dihedrals of zinc coordinated amino- Vallee et al. [24]. acid residues were extracted with PROSS program.3

3. Results and discussion

3.1. Primary coordination sphere of Zn-binding sites

Zn sites are classified according to the coordination number in the primary coordination sphere. Zn readily forms four, five, and six coordinate complexes, as shown in Fig. 1, but tetra- coordinated structures (C4) in tetrahedral geometry are the most prevalent. Among penta-coordinated structures trigonal-bipy- ramidal and square-pyramidal geometry prevails, while among hexa-coordinated structures (C6) octahedral geometry prevails. As noted in Fig. 2, among 126 structural proteins (X-ray structures) a majority of the zinc sites (82%) are in C4 coordination; only 14% being in C5 and 4% in C6 coordination. Of 147 catalytic proteins (X-ray structures), 58% are in C4

Fig. 2. Frequency of specific coordination numbers of Zinc sites. 1250

Zinc sites are classified according to the patterns of the coordinated amino acids, i.e., the type of amino acid and the position along sequence. Frequencies of the observed zinc- binding patterns (ZBP) considering only the protein residues are shown in Figs. 4 and 5. The tetra-coordinated ZBP patterns are more varied among the catalytic than structural zinc proteins; among the structural proteins the top 10 patterns contribute ∼75% of the sites, while among the catalytic proteins the top 10 patterns contribute only 46% of the sites. Notably, even among ZBPs of identical ligands, e.g., two Cys and two His in the structural proteins, there is often a stronger preference for a given order in amino acid sequence, e.g., the pattern CCHH is much more prevalent than HHCC as noted in Fig. 5a, but there Fig. 3. Amino acid propensities (%) to be Zinc ligands in X-ray and NMR structures. Panel a: structural zinc proteins. Panel b: catalytic zinc proteins. is no such prevalence observed in case of three C and one H. As noted in Fig. 5a CCHC, CHCC, and HCCC are with similar propensities. There are specific preferences noted among the 3.2. Properties of Zn ligands catalytic sites; e.g., HHHD and HHDD are more prevalent then HDHH and DHDH. Among tri-coordinated ZBPs, the patterns The statistics of protein residues serving as Zn ligands are in are similar in both the structural and catalytic Zn sites. As Fig. 3. A higher prevalence of Cys (C) than previously known expected, the tri-coordinated sites are 2.5 times as prevalent in [8,12,14,17,18,21] is detected in conformity with the results of catalytic proteins as in structural proteins. The observed Andreini et al. [9] for human genome. In particular, increasing statistics are similar to those in human proteome [9]. number of cysteine coordinated structural proteins have been The propensities of zinc coordinated His, Cys, and Asp characterized in recent years [9,10,25] including zinc-finger residues for specific secondary structure are summarized in proteins and metallothianines with 3–9 mainly cysteine-coordi- Table 2. His ligands reflect a clear preference for the helical- nated zinc ions. type ϕ, ψs in the structural proteins, but comparable preferences

Fig. 4. Tri-coordinated Zinc Binding Patterns (ZBP). ZBPs in structural proteins (panel a), and catalytic proteins (panel b). In specific ZBPs, the Zinc ligands are in order of appearance in sequence (from N to C terminal). Other ZBPs correspond to diverse combinations of the ligands C, H, E, D, K, N and W; with C for cysteine, H for , E for , D for , K for lysine, N for aspargine, and W for tryptophan (the amino acids K, N and W are coordinated through the backbone N/O atoms). Panel a: structural proteins. Panel b: catalytic proteins. 1251

Fig. 5. Tetra-coordinated Zinc Binding Patterns (ZBP). ZBPs in structural proteins (panel a), and catalytic proteins (panel b). In specific ZBPs, the Zinc ligands are in order of appearance in sequence (from N to C terminal). Other ZBPs correspond to diverse combinations of the ligands C, H, E, D, K, N and W; with C for cysteine, H for histidine, E for glutamic acid, D for aspartic acid, K for lysine, N for aspargine, and W for tryptophan (the amino acids K, N and W are coordinated through the backbone N/O atoms). for the helical and strand-type ϕ, ψs in the catalytic proteins. NandZn–O bonds are of comparable length, while Zn–S bonds Cys ligands reflect comparable preferences for the helical and are appreciably longer, due to larger atomic size of . strand-type ϕ, ψs, while Asp ligands reflect a greater preference The data on angles between Zn and coordinating atoms are in for the strand-type ϕ, ψs. Table 3. Deviations from ideal values occur and could involve Among 228 structural proteins, 37 Zn sites are coordinated to a pair repulsions, or effects like bidentate coordination of water molecule, while 15 Zn sites are coordinated to an inorganic carboxyl group, extraneous H-bonding to secondary coordina- moiety. Among 154 catalytic proteins, 106 Zn sites are with tion sphere or solvent, and bridging ligands in multi-Zn sites. In metal-bound water or ion and 60 sites are with a Zn- the dataset of angles, O–Zn–O angles of protein residues are bound inhibitor, with the inhibitor having either displaced water classified in two categories, the angle involving atoms or increased the coordination number of zinc. Among the catalytic from two different residues and the angles involving both proteins, the Zn sites tend to be solvent accessible, while among oxygen atoms from the same residue, i.e., bidentate arrange- the structural proteins, they tend to be solvent sequestered with ment, denoted by O–Zn–O′. The latter angles are smaller with exception of insulin which has one site exposed to solvent. average value of ∼53° for Eq–Zn–Ax (Here, Eq=Equitorial, His side chain can ligate in alternate tautomeric forms, either Ax=Axial) and in smaller deviations than other angles. Some via δ tautomer with δ protonated or via ε tautomer with important observations are as follows. ε nitrogen protonated. About 73% of His residues are coor- dinated via ε nitrogen and 27% via δ nitrogen. Being similar in coordination geometry, the tautomers are treated together from Table 2 Amino acid propensities to occur in specific secondary structures now on. Imidazole rings display ±10° deviations from the mean plane of metal coordination. % Helix % Strand % Others Aspstr 21 31 48 3.3. Geometry of primary coordination sphere of Zn Aspcat 19 33 48 Cysstr 19 18 63 Cyscat 18 28 54 – There are no significant variations in the Zn X bond lengths (X Hisstr 34 20 47 being any coordinating atom), except with X as sulfur. The data on Hiscat 29 26 45 bond lengths, including standard deviations in the bond lengths, Asp — Aspartic acid, Cys — Cysteine, His — Histidine. Subscript denotes the are summarized in Table S3 under Supplementary Materials. Zn– type of Proteins: str —Structural proteins, cat — Catalytic proteins. 1252

3.4. Zn in catalytic sites O(i) – Zn

– Among C4 sites, though the average angles are close to ideal, some distortions like O–Zn–ONN–Zn–O∼N–Zn–Narenoted. This could be explained on the basis of electron pair repulsion O(w) O

– model. In C5 coordination sites, the Eq–Zn–Ax angles are nearer Zn

– the ideal value but are greater in the range of deviation. The Ax– O 167 (5), 2 Zn–Ax angles are farther from the ideal and with greater standard ′ O

– deviation. The angles in octahedral geometry are generally farther Zn =Square Pyramidal, C6=Octahedral, from the ideal value. The observed deviations are similar to those ′ –

56, 1 82 (17), 3 in C4 and C5 geometries. In contrast of the angle deviations in C5 geometry, the diagonal angles in C6 are nearer to the ideal values. OO –

Zn 3.5. Zn in structural sites –

onal Bipyramidal, C5 The trends are similar to those in the catalytic proteins. The

O(i) O average values in C4 Zn sites are close to the ideal tetrahedral – c sites in crystal structures of enzymes. Zn geometry. In C4 coordination sites, the observed trend in angles – involving nitrogen and oxygen is N–Zn–N∼N–Zn–ObO– Zn–O, in agreement of simple electron pair repulsion model, but O(w) S

– converse of the reports of Albert et al. [18]. A possible explanation Zn

– is that in the Albert et al. study the monodentate and bidentate angles involving oxygen atom were clubbed, while these are

OS analyzed separately in this study; as a result of the separation from – the bidentate O–Zn–O′ case, the average O–Zn–Oangleis Zn – increased. The S–Zn–S angles are closer to the ideal tetrahedral value, in contrast of the angles in catalytic proteins, whereas – – S!S S Zn N angles are similar to those in the catalytic proteins. –

Zn A comparatively greater deviation of C5 angles from the – ideal geometry is observed. Particularly, Ax–Zn–Ax angles could be up to 30° deviated from the ideal angle 180°. The

O(i) S O–Zn–O angles are on the higher side for Eq–Zn–Ax when –

Zn the bidentate arrangements are excluded, in accordance with – the earlier reported observation [18]. The observed deviations in C6 geometry are similar to those

O(w) N in C4 and C5 geometries. With absence of Cys in C6 systems, –

Zn there is an increased propensity for oxygen donors. The angle –

Table 4 ON

– Ligand to metal distances in bidentate arrangements in Zinc proteins Zn

– Co-ord Aspartate Glutamate 156 (13), 5 164 (10), 8146 (18), 157 (10), 4 10155 (14), 8 168 (6), 11 167 (8), 16 165 (8), 4 164 (6), 10No. 147, 1 160 (9), 10 160 (10), 10 163 165 161 (12), (10), (7), 22 7 2 156 (4), 3 169 (6), 4 158 (10), 10 160, 1 Zn–O′ Zn–Oʺ Zn–O′ Zn–Oʺ

SN C4 1.97 (0.17), 4 2.69 (0.10), 4 2.14 (0.02), 2 2.52 (0.28), 2 –

Zn 2.02 (0.20), 11 2.62 (0.25), 11 2.07 (0.08), 8 2.60 (0.29), 8 – C5′ 2.20 (0.15), 4 2.52 (0.23), 4 2.07 (0.12), 7 2.47 (0.17), 7 C5 1.92 (0.0), 2 2.76 (0.13), 2 2.05 (0.14), 8 2.67 (0.19), 8 1.99 (0.06), 4 2.58 (0.28), 4 2.04 (0.08), 6 2.48 (0.39), 6 NN

– C6 2.08,1 2.31,1 2.05 (0.0), 2 2.50 (0.04), 2

Zn 2.08 (0.07), 5 2.52 (0.31), 5 1.99 (0.12), 3 2.64 (0.29), 3 – 107 (8), 71106 (7), 122 109 (6), 111 113 (7), 48 106 (15), 91 106 (16), 149 112 110 (17), (10), 61 38 112 (14), 113 (18), 34 25 111 (7), 85 110 (8), 402 107 107 (7), 22 (8), 30 107 (7), 106 18 (6), 18 117 (10), 10 108 (9), 111 24 (22), 66 53 109 (4), (15), 12 35 111 (22), 25 53 (2), 6 108 (12), 14 111 (24), 19 111 (20), 11 97 (4), 1395 (7), 44 92 (6), 34 93 (8), 77 97 (7), 6 92 (7), 21 88 92 (14), (8), 7 26 103, 1 97 (10), 5 93 (6), 18 54 (3), 18 90 (12), 34 89 (10), 8 93 (7), 19 54 (4), 8 88 (12), 14 89 (14), 7 112 (5), 6116 (12), 27 109 (2), 3 118 (7), 25 120 (9), 35 118 (8), 120 20 (9), 17 133, 1 123 (8), 15 128, 1 124, 1 118 (10), 13 120 (10), 13 129 (12), 3 118 (7), 8 119 (14), 4 122 (15), 3 99 (6), 25169, 1 100 (5), 297 (5), 94 9 (9), 4298 (6), 18 95 (10), 20 96 (7), 20 140, 1 93 (8), 92 37 (8), 83 141 (6), 2 94 95 (6), (7), 16 23 87 93 (7), (9), 5 17 91, 1 97 (16), 21 56 (3), 7 90 (11), 24 92 (8), 12 99 (17), 28 107 (17), 12 54 (4), 57 15 (4), 6 92 (8), 39 91 (11), 29 93 (13), 47 85 (11), 11 165, 1166 (9), 4 156 158 (10), (13), 13 7 162 (6), 3 165 (8), 5 151, 1 159 (10), 11 156 (16), 164 6 (9), 3 170 (12), 5 137, 1 156 (2), 2 91 (10), 4 91 (11), 8 98 (12), 7 O′ and Oʺ denotes the two carboxylate oxygen atoms bound to zinc in a bidentate arrangement. Zn–O′ shows the average distance of the nearer oxygen

180 atom while Zn–Oʺ denotes the average distance between farther oxygen of

– carboxylate and zinc ion. Distances are in Å, standard deviations given in the Ax 90 Eq 120 – – parentheses followed by the number of observations. Values in italics type are Zn Zn Zn – for structural zinc sites in protein crystal structures from the PDB and values in – – bold are for catalytic zinc sites in protein crystal structures. Coordination Eq Ax Opposite 180 Opposite 180 Ax Adjacent 90 ′ ′ numbers 4, 5, 6 are denoted by C4, C5 and C6 respectively. C5 denotes square Table 3 Angles of Zinc coordinated atoms Angles N Eq-Equatorial, Ax-Axial. Values in italics type are for structural zinc sites in protein crystal structures from the PDB and values in bold are for zin C4C5 Eq 109.5 C6 Adjacent 90 Angles are given in degrees, standard deviations are given in parentheses followed by the number of observations. C4=Tetrahedral geometry, C5=Trig C5 pyramidal geometry while C5 denotes trigonal bipyramidal geometry. 1253 between diagonally placed atoms is closer to the ideal 180° than References in C5 geometry. [1] J.M. Berg, Y. Shi, The galvanization of biology: a growing appreciation for 3.6. Bidentate carboxylate groups the roles of zinc, Science 271 (1996) 1081–1085. [2] D.S. Auld, Structural zinc sites, Handbook of , vol. 3, – Among different kinds of bidentate arrangements possible, 2004, pp. 403 415. [3] J.E. Coleman, Zinc proteins: enzymes, storage proteins, transcription the most common are the simple and the bridging. Nearly all- factors, and replication proteins, Ann. Rev. Biochem. 61 (1992) 897–946. bidentate interactions of Zn are of simple type; both oxygen [4] H. Cho, S. Ramaswamy, B.V. Plapp, Flexibility of liver alcohol atoms of Glu or Asp tend to ligate the same Zn. As noted in dehydrogenase in stereoselective binding of 3-butylthiolane 1-oxides, Table 4, the Zn–O distances in bidentate carboxylates are quite Biochemistry 36 (1997) 382–389. varied; typically one of the oxygen tends to be nearer to Zn, but [5] S. Lindskog, A. Liljas, and the role of orientation in , Curr. Opin. Struck. Biol. 3 (1993) 915–920. there are also cases of similar distance; the distances are nearly [6] U. Derewenda, Z. Derewenda, E.J. Dodson, G.G. Dodson, C.D. Reynolds, the same irrespective of the coordination number. The average G.D. Smith, C. Sparks, D. Swenson, Phenol stabilizes more helix in a new distances for Zn coordination are shorter in the catalytic zinc symmetrical zinc insulin hexamer, 338 (1989) 594–596. proteins than in the structural zinc proteins, with aspartate as the [7] L. Banci, I. Bertini, F. Cramaro, C.R. Del, M.S. Viezzoli, Solution structure exceptions. A significant observation is that the bond length of of Apo Cu,Zn dismutase: role of metal ions in protein folding, – ′ – Biochemistry 42 (2003) 9543–9553. Zn O in bidentate arrangement is shorter than the Zn O [8] J.S. Hanas, D.J. Hazuda, D.F. Bogenhagen, F.Y. Wu, C.W. Wu, Xenopus distances in monodentate arrangement, which may result from A requires zinc for binding to the 5 S RNA gene, J. Biol. the smaller O–Zn–O′ angle allowing a greater penetrations of Chem. 258 (1983) 14120–14125. the ligand atom into the Zn coordination sphere. [9] C. Andreini, L. Banci, I. Bertini, A. Rosato, Counting the zinc–proteins encoded in the human genome, J. Proteome Res. 5 (2006) 196–201; C. Andreini, L. Banci, I. Bertini, A. Rosato, Zinc through the three domains 4. Conclusion of life, J. Proteome Res. 11 (2006) 3173–3178. [10] D.S. Auld, Zinc coordination sphere in biochemical zinc sites, BioMetals Extending previous reports [8,12,14,17–21,26], we have 14 (2001) 271–313. analyzed Zn coordination centers in the database of currently [11] L. Regan, Protein design: novel metal-binding sites, Trends Biochem. Sci. – available NMR and X-ray protein structures for specific features 20 (1995) 280 285. [12] D.W. Christianson, Structural biology of zinc, Adv. Protein Chem. 42 (1991) of geometry and conformation. Structural databases will be 281–355. inherently biased against the proteins unstructured and the [13] S. Karlin, Z.Y. Zhu, Classification of mononuclear zinc metal sites in proteins recalcitrant to structure characterization. But for this protein structures, Proc. Natl. Acad. Sci. U. S. A. 94 (1997) 14231–14236. caveat, the present analysis encompassing a larger dataset of [14] D.W. Christianson, R.S. Alexander, Another catalytic triad? Nature better-filtered structures perhaps better captures both the 19 (346) (1990) 225. [15] P. Chakrabarti, Geometry of interaction of metal ions with histidine diversity and the consensus for possible structures in the residues in protein structures, Protein Eng. 4 (1990) 57–63. currently known ZBPs. Although excellent in the exposition of [16] A. Vedani, D.W. Huhta, A new force field for modeling metalloproteins, structure and catalysis the currently available reviews of the J. Am. Chem. Soc. 112 (12) (1990) 4759–4767. structure in zinc centers [8,12–18,21] do not adequately capture [17] B.L. Vallee, D.S. Auld, Zinc: biological functions and coordination motifs, – the diversity and consensus for zinc binding and function. Many Acc. Chem. Res. 26 (10) (1993) 543 551. – [18] I.L. Alberts, K. Nadassy, S.J. Wodak, Analysis of zinc binding sites in more Zn S4 coordinated structures have been documented in the protein crystal structures, Protein Sci. 7 (1998) 1700–1716. recent years, establishing Cys as the prominent Zn ligand over [19] M.M. Harding, The geometry of metal–ligand interactions relevant to His in structural proteins [9,10]. Better capturing the consensus proteins. II. Angles at the metal atom, additional weak metal–donor of structure [9], and the subtleties of metal coordination, the interactions, Acta Crystallogr., D Biol. Crystallogr. 56 (2000) 857–867. present study will possibly better serve the cause of character- [20] M.M. Harding, Small revisions to predicted distances around metal sites in proteins, Acta Crystallogr., D Biol. Crystallogr. 62 (2006) 678–682. ization, engineering, and design of Zn proteins. [21] D.W.Christianson, R.S. Alexander, Carboxylate–histidine–zinc interactions in protein structure and function, J. Am. Chem. Soc. 111 (16) (1989) 6412–6419. Acknowledgements [22] H.M. Berman, J. Westbrook, Z. Feng, G. Gilliland, T.N. Bhat, H. Weissig, I.N. Shindyalov, P.E. Bourne, The Protein Data Bank, Nucleic Acids Res. – This work was supported by grant from Board of Research in 28 (2000) 235 242. [23] G. Wang, R.L. Dunbrack Jr., PISCES: a protein sequence culling server, Nuclear Sciences, India (BRNS) and Council of Scientific and Bioinformatics 19 (2003) 1589–1591. Industrial Research (CSIR), Government of India. KP and AK [24] B.L. Vallee, D.S. Auld, Short and long spacer sequences and other structural are recipients of fellowship from CSIR, India. features of zinc binding sites in zinc enzymes, FEBS Lett. 257 (1) (1989) 138–140. Appendix A. Supplementary data [25] B.L. Vallee, J.E. Coleman, D.S. Auld, Zinc fingers, zinc clusters, and zinc twists in DNA-binding protein domains, Proc. Natl. Acad. Sci. U. S. A. 88 (3) (1991) 999–1003. Supplementary data associated with this article can be found, [26] B.L. Vallee, D.S. Auld, Zinc coordination, function, and structure of zinc in the online version, at doi:10.1016/j.bbapap.2007.07.010. enzymes and other proteins, Biochemistry 29 (24) (1990) 5647–5659.