2E7z Lichtarge Lab 2006
Total Page:16
File Type:pdf, Size:1020Kb
Pages 1–13 2e7z Evolutionary trace report by report maker February 13, 2010 4.3.1 Alistat 13 4.3.2 CE 13 4.3.3 DSSP 13 4.3.4 HSSP 13 4.3.5 LaTex 13 4.3.6 Muscle 13 4.3.7 Pymol 13 4.4 Note about ET Viewer 13 4.5 Citing this work 13 4.6 About report maker 13 4.7 Attachments 13 1 INTRODUCTION From the original Protein Data Bank entry (PDB id 2e7z): Title: Acetylene hydratase from pelobacter acetylenicus CONTENTS Compound: Mol id: 1; molecule: acetylene hydratase ahy; chain: a; ec: 4.2.1.71 1 Introduction 1 Organism, scientific name: Pelobacter Acetylenicus; 2e7z contains a single unique chain 2e7zA (727 residues long). 2 Chain 2e7zA 1 2.1 Q71EW5 overview 1 2.2 Multiple sequence alignment for 2e7zA 1 2.3 Residue ranking in 2e7zA 1 2.4 Top ranking residues in 2e7zA and their position on the structure 1 2 CHAIN 2E7ZA 2.4.1 Clustering of residues at 25% coverage. 2 2.4.2 Overlap with known functional surfaces at 2.1 Q71EW5 overview 25% coverage. 3 From SwissProt, id Q71EW5, 98% identical to 2e7zA: 2.4.3 Possible novel functional surfaces at 25% Description: Acetylene hydratase Ahy. coverage. 7 Organism, scientific name: Pelobacter acetylenicus. Taxonomy: Bacteria; Proteobacteria; Deltaproteobacteria; Desulfu- 3 Notes on using trace results 11 romonadales; Pelobacteraceae; Pelobacter. 3.1 Coverage 11 3.2 Known substitutions 12 3.3 Surface 12 3.4 Number of contacts 12 3.5 Annotation 12 2.2 Multiple sequence alignment for 2e7zA 3.6 Mutation suggestions 12 For the chain 2e7zA, the alignment 2e7zA.msf (attached) with 44 sequences was used. The alignment was downloaded from the HSSP 4 Appendix 12 database, and fragments shorter than 75% of the query as well as 4.1 File formats 12 duplicate sequences were removed. It can be found in the attachment 4.2 Color schemes used 12 to this report, under the name of 2e7zA.msf. Its statistics, from the 4.3 Credits 13 alistat program are the following: 1 Lichtarge lab 2006 Fig. 1. Residues 4-245 in 2e7zA colored by their relative importance. (See Fig. 3. Residues 488-730 in 2e7zA colored by their relative importance. (See Appendix, Fig.15, for the coloring scheme.) Appendix, Fig.15, for the coloring scheme.) 2.4 Top ranking residues in 2e7zA and their position on the structure In the following we consider residues ranking among top 25% of residues in the protein . Figure 4 shows residues in 2e7zA colored by their importance: bright red and yellow indicate more conser- ved/important residues (see Appendix for the coloring scheme). A Pymol script for producing this figure can be found in the attachment. Fig. 2. Residues 246-487 in 2e7zA colored by their relative importance. (See Appendix, Fig.15, for the coloring scheme.) Format: MSF Number of sequences: 44 Total number of residues: 30237 Smallest: 648 Largest: 727 Average length: 687.2 Alignment length: 727 Average identity: 32% Most related pair: 98% Most unrelated pair: 21% Most distant seq: 34% Furthermore, 1% of residues show as conserved in this alignment. The alignment consists of 2% archaean sequences. (Descriptions of some sequences were not readily available.) The file containing Fig. 4. Residues in 2e7zA, colored by their relative importance. Clockwise: front, back, top and bottom views. the sequence descriptions can be found in the attachment, under the name 2e7zA.descr. 2.4.1 Clustering of residues at 25% coverage. Fig. 5 shows the 2.3 Residue ranking in 2e7zA top 25% of all residues, this time colored according to clusters they The 2e7zA sequence is shown in Figs. 1–3, with each residue colored belong to. The clusters in Fig.5 are composed of the residues listed according to its estimated importance. The full listing of residues in Table 1. in 2e7zA can be found in the file called 2e7zA.ranks sorted in the attachment. 2 Table 1. continued cluster size member color residues 641,642,645,648,652,655,661 672 green 6 9,12,16,46,48,49 Table 1. Clusters of top ranking residues in 2e7zA. 2.4.2 Overlap with known functional surfaces at 25% coverage. The name of the ligand is composed of the source PDB identifier and the heteroatom name used in that file. MGD binding site. Table 2 lists the top 25% of residues at the interface with 2e7zAMGD801 (mgd). The following table (Table 3) suggests possible disruptive replacements for these residues (see Section 3.6). Table 2. res type subst’s cvg noc/ dist (%) bb (A˚ ) 418 N N(95) 0.02 30/17 3.01 D(2) Fig. 5. Residues in 2e7zA, colored according to the cluster they belong to: red, followed by blue and yellow are the largest clusters (see Appendix for E(2) the coloring scheme). Clockwise: front, back, top and bottom views. The 141 C C(88) 0.08 23/2 2.50 corresponding Pymol script is attached. S(4) I(4) T(2) Table 1. 604 G G(88) 0.08 1/1 4.67 cluster size member .(2) color residues T(4) red 79 141,154,170,171,197,199,200 P(2) 201,202,204,207,210,211,213 A(2) 214,217,218,219,220,221,223 445 F F(79) 0.11 3/1 4.31 224,225,226,227,231,232,233 W(9) 235,237,238,239,242,243,246 Y(9) 249,250,253,256,257,260,262 M(2) 265,268,269,270,276,279,280 443 D D(77) 0.12 48/13 2.65 281,283,284,288,289,294,298 E(22) 308,313,315,317,318,319,324 447 T T(77) 0.12 5/0 3.34 554,555,557,558,559,560,561 S(9) 562,563,566,573,575,576,577 N(13) 578,580 613 S S(79) 0.12 19/8 2.60 blue 71 61,62,65,66,68,70,73,74,78 A(4) 81,82,83,84,85,86,87,90,93 L(2) 94,102,104,111,114,124,125 Q(6) 129,130,131,132,133,406,407 T(6) 408,411,416,418,432,433,437 606 R R(72) 0.14 22/0 2.90 438,440,443,445,447,448,449 G(9) 453,454,456,457,458,464,465 S(2) 487,490,494,497,501,502,505 H(9) 606,613,616,622,628,676,677 K(4) 678,679,681,682 P(2) yellow 15 597,598,599,601,634,636,637 676 H H(65) 0.16 57/2 3.01 continued in next column continued in next column 3 Table 2. continued Table 3. res type subst’s cvg noc/ dist res type disruptive (%) bb (A˚ ) mutations R(11) 418 N (Y)(FWH)(T)(R) S(4) 141 C (R)(K)(E)(H) A(2) 604 G (R)(K)(E)(H) Y(11) 445 F (K)(E)(TQD)(R) V(2) 443 D (R)(FWH)(YVCAG)(K) G(2) 447 T (R)(K)(FWH)(M) 416 A G(43) 0.17 47/34 3.14 613 S (R)(K)(H)(FW) A(29) 606 R (D)(T)(YE)(SVCLAPIG) N(6) 676 H (E)(Q)(D)(M) S(18) 416 A (R)(K)(E)(Y) T(2) 111 T (KR)(Q)(H)(M) 111 T G(63) 0.22 12/1 2.64 465 R (TD)(Y)(EVA)(CLPIG) S(11) 114 N (Y)(FW)(H)(T) A(2) 424 A (R)(K)(Y)(EH) V(9) E(2) Table 3. List of disruptive mutations for the top 25% of residues in Y(2) 2e7zA, that are at the interface with MGD. H(2) T(4) L(2) 465 R E(2) 0.22 8/0 2.86 R(65) H(15) T(4) Q(4) K(2) Y(2) V(2) 114 N R(29) 0.24 39/13 2.93 N(11) K(11) .(18) M(2) S(6) G(13) H(2) T(4) 424 A S(9) 0.24 1/1 4.94 P(25) G(18) A(40) E(2) T(2) Fig. 6. Residues in 2e7zA, at the interface with MGD, colored by their rela- V(2) tive importance. The ligand (MGD) is colored green. Atoms further than 30A˚ away from the geometric center of the ligand, as well as on the line of sight to the ligand were removed. (See Appendix for the coloring scheme for the Table 2. The top 25% of residues in 2e7zA at the interface with protein chain 2e7zA.) MGD.(Field names: res: residue number in the PDB entry; type: amino acid type; substs: substitutions seen in the alignment; with the percentage of each type in the bracket; noc/bb: number of contacts with the ligand, with the num- Figure 6 shows residues in 2e7zA colored by their importance, at the ber of contacts realized through backbone atoms given in the bracket; dist: interface with 2e7zAMGD801. distance of closest apporach to the ligand. ) MGD binding site. Table 4 lists the top 25% of residues at the interface with 2e7zAMGD802 (mgd). The following table (Table 5) suggests possible disruptive replacements for these residues (see Section 3.6). 4 Table 4. Table 4. continued res type subst’s cvg noc/ dist res type subst’s cvg noc/ dist (%) bb (A˚ ) (%) bb (A˚ ) 200 D D(100) 0.01 164/31 1.80 S(2) 221 D D(100) 0.01 81/5 1.46 H(9) 219 G G(97) 0.02 29/29 2.88 K(4) R(2) P(2) 201 P P(97) 0.03 88/14 3.01 171 G G(84) 0.17 88/88 2.79 S(2) W(4) 12 C .(15) 0.06 5/0 4.65 S(2) C(81) A(6) H(2) N(2) 220 T T(86) 0.06 5/5 4.10 199 L V(52) 0.18 34/27 2.68 S(13) L(11) 48 K R(43) 0.07 23/0 2.94 I(36) K(50) 204 T N(9) 0.23 11/0 3.31 G(4) S(20) S(2) T(65) 218 Y S(6) 0.07 145/19 2.69 I(4) A(4) P(86) Table 4.