Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology 1484

Extending the Reach of Computational Approaches to Model Enzyme Catalysis

BEAT ANTON AMREIN

ACTA UNIVERSITATIS UPSALIENSIS ISSN 1651-6214 ISBN 978-91-554-9816-0 UPPSALA urn:nbn:se:uu:diva-314686 2017 Dissertation presented at Uppsala University to be publicly examined in A1:111a, BMC, Husargatan 3, Uppsala, Friday, 24 March 2017 at 09:15 for the degree of Doctor of Philosophy. The examination will be conducted in English. Faculty examiner: Prof. Adrian Mulholland (University of Bristol).

Abstract Amrein, B. A. 2017. Extending the Reach of Computational Approaches to Model Enzyme Catalysis. Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology 1484. 67 pp. Uppsala: Acta Universitatis Upsaliensis. ISBN 978-91-554-9816-0.

Recent years have seen tremendous developments in methods for computational modeling of (bio-) molecular systems. Ever larger reactive systems are being studied with high accuracy approaches, and high-level QM/MM calculations are being routinely performed. However, applying high-accuracy methods to large biological systems is computationally expensive and becomes problematic when conformational sampling is needed. To address this challenge, classical force field based approaches such as free energy perturbation (FEP) and empirical valence bond calculations (EVB) have been employed in this work. Specifically:

1. Force-field independent metal parameters have been developed for a range of alkaline earth and transition metal ions, which successfully reproduce experimental solvation free energies, metal- oxygen distances, and coordination numbers. These are valuable for the computational study of biological systems. 2. Experimental studies have shown that the epoxide hydrolase from Solanum tuberosum (StEH1) is not only an enantioselective enzyme, but for smaller substrates, displays enantioconvergent behavior. For StEH1, two detailed studies, involving combined experimental and computational efforts have been performed: We first used trans-stilbene oxide to establish the basic reaction mechanism of this enzyme. Importantly, a highly conserved and earlier ignored histidine was identified to be important for catalysis. Following from this, EVB and experiment have been used to investigate the enantioconvergence of the StEH1-catalyzed hydrolysis of styrene oxide. This combined approach involved wildtype StEH1 and an engineered enzyme variant, and established a molecular understanding of enantioconvergent behavior of StEH1. 3. A novel framework was developed for the Computer-Aided Directed Evolution of Enzymes (CADEE), in order to be able to quickly prepare, simulate, and analyze hundreds of enzyme variants. CADEE’s easy applicability is demonstrated in the form of an educational example.

In conclusion, classical approaches are a computationally economical means to achieve extensive conformational sampling. Using the EVB approach has enabled me to obtain a molecular understanding of complex enzymatic systems. I have also increased the reach of the EVB approach, through the implementation of CADEE, which enables efficient and highly parallel in silico testing of hundreds-to-thousands of individual enzyme variants.

Keywords: epoxide hydrolase, enantioselectivity, regioselectivity, enantioconvergence, biocatalysis, empirical valence bond, computational directed evolution

Beat Anton Amrein, Department of Cell and Molecular Biology, Box 596, Uppsala University, SE-75124 Uppsala, Sweden.

© Beat Anton Amrein 2017

ISSN 1651-6214 ISBN 978-91-554-9816-0 urn:nbn:se:uu:diva-314686 (http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-314686) /LVW RI SDSHUV

7KLV WKHVLV LV EDVHG RQ WKH IROORZLQJ SDSHUV ZKLFK DUH UHIHUUHG WR LQ WKH WH[W E\ WKHLU 5RPDQ QXPHUDOV

, 'XDUWH ) %DXHU 3 %DUUR]R $ $PUHLQ % $ 3XUJ 0 cTYLVW - DQG .DPHUOLQ 6 & /  )RUFH )LHOG ,QGHSHQGHQW 0HWDO 3DUDPHWHUV 8VLQJ D 1RQERQGHG 'XPP\ 0RGHO - 3K\V &KHP % ±

,, $PUHLQ % $ %DXHU 3 'XDUWH ) -DQIDON &DUOVVRQ c 1DZRU\WD $ 0RZEUD\ 6 / :LGHUVWHQ 0 DQG .DPHUOLQ 6 & /  ([SDQGLQJ WKH &DWDO\WLF 7ULDG LQ (SR[LGH +\GURODVH DQG 5HODWHG (Q]\PHV $&6 &DWDO ±

,,, %DXHU 3 -DQIDON &DUOVVRQ c $PUHLQ % $ 'REULW]VFK ' :LGHUVWHQ 0 DQG .DPHUOLQ 6 & /  &RQIRUPDWLRQDO 'LYHUVLW\ DQG (QDQWLRFRQYHUJHQFH LQ 3RWDWR (SR[LGH +\GURODVH , 2UJ %LRPRO &KHP ±

,9 $PUHLQ % $ 6WHIIHQ0XQVEHUJ ) 6]HOHU , 3XUJ 0 .XONDUQL < DQG .DPHUOLQ 6 & /  &$'(( &RPSXWHU$LGHG 'LUHFWHG (YROXWLRQ RI (Q]\PHV ,8&U- ±

5HSULQWV ZHUH PDGH ZLWK SHUPLVVLRQ IURP WKH SXEOLVKHUV $GGLWLRQDO 3XEOLFDWLRQV

9 'XDUWH ) $PUHLQ % $ DQG .DPHUOLQ 6 & /  0RGHOLQJ &DWDO\WLF 3URPLVFXLW\ LQ WKH $ONDOLQH 3KRVSKDWDVH 6XSHUIDPLO\ 3K\V &KHP &KHP 3K\V ±

9, 'XDUWH ) $PUHLQ % $ %ODKD1HOVRQ ' DQG .DPHUOLQ 6 & /  5HFHQW $GYDQFHV LQ 4000 )UHH (QHUJ\ &DOFXODWLRQV 8VLQJ 5HIHUHQFH 3RWHQWLDOV %%$ ± *HQHUDO 6XEMHFWV ±

5HSULQWV ZHUH PDGH ZLWK SHUPLVVLRQ IURP WKH SXEOLVKHUV &RQWHQWV

 ,QWURGXFWLRQ    &KHPLVWU\ DQG &DWDO\VLV    %LRFKHPLVWU\ %LRFDWDO\VLV DQG (Q]\PHV    5HJLR DQG (QDQWLRVHOHFWLYLW\    6RODQXP 7XEHURVXP (SR[LGH +\GURODVH     0ROHFXODU 0RGHOLQJ $SSURDFKHV    2XWOLQH  

 0HWKRGV DQG 7KHRUHWLFDO %DFNJURXQG    (Q]\PH .LQHWLFV    6HTXHQFH $QDO\VLV    6HTXHQFH 6SDFH    6WUXFWXUDO $QDO\VLV DQG 7RROV    2EWDLQLQJ 3URWHLQ 6WUXFWXUHV    /LPLWDWLRQV RI +LJK5HVROXWLRQ 6WUXFWXUHV    6WUXFWXUHV RI 3URWHLQ 9DULDQWV    7UDQVLWLRQ 6WDWH 7KHRU\    0ROHFXODU 6LPXODWLRQV    0ROHFXODU 0HFKDQLFDO 6LPXODWLRQV    0ROHFXODU '\QDPLFV 6LPXODWLRQV    4XDQWXP 0HFKDQLFDO 6LPXODWLRQV    0XOWLVFDOH 0RGHOLQJ    )UHH (QHUJ\ 3HUWXUEDWLRQ    (PSLULFDO 9DOHQFH %RQG $SSURDFK  

 0HWKRGRORJ\ $GYDQFHV    3DSHU ,    3DSHU ,9    0XWDWDJHQVLV    (IILFLHQW 3DUDOOHO ([HFXWLRQ    $XWRPDWLF $QDO\VLV    6XJJHVWHG :RUNIORZ  

 (9% $SSOLFDWLRQV    3DSHU ,,    3URWRQDWLRQ RI + DQG +    &RQVHUYDWLRQ RI ( DQG +    2[\DQLRQ +ROHV    (4 +1 DQG 'RXEOH 0XWDQW    3DSHU ,,,    (QJLQHHUHG 9DULDQW DQG &U\VWDO 6WUXFWXUH    %LQGLQJ 0RGHV  

 &RQFOXVLRQV DQG )XWXUH 3HUVSHFWLYH  

 3RSXOlUYHWHQVNDSOLJ 6DPPDQIDWWQLQJ  

 $FNQRZOHGJPHQWV  

 5HIHUHQFHV   $EEUHYLDWLRQV

c cQJVWU|P −10 P '% 'DWD %DQN ')7 'HQVLW\ )XQFWLRQDO 7KHRU\ HH (QDQWLRPHULF ([FHVV (9% (PSLULFDO 9DOHQFH %RQG IV )HPWRVHFRQG 10−15 V  kcat &DWDO\WLF 5DWH 8QGHU 2SWLPXP 6DWXUDWLQJ &RQGLWLRQV V .0 0LFKDHOLV0HQWHQ &RQVWDQW 0' 0ROHFXODU '\QDPLFV 00 0ROHFXODU 0HFKDQLFV 02±40 0ROHFXODU 2UELWDO 4XDQWXP 0HFKDQLFV QV 1DQRVHFRQG 10−9 V 3'% 3URWHLQ 'DWD %DQN SV 3LFRVHFRQG 10−12 V 40 4XDQWXP 0HFKDQLFV 4000 4XDQWXP 0HFKDQLFV  0ROHFXODU 0HFKDQLFV 6W(+ 6RODQXP WXEHURVXP (SR[LGH +\GURODVH , 9% 9DOHQFH %RQG &RPSXWHU 3URJUDPV 8VHG LQ WKLV :RUN

1DPH 'HVFULSWLRQ /LQN %DVK 8QL[ 6KHOO KWWSVZZZJQXRUJVRIWZDUHEDVK &$'(( &RPSXWHU$LGHG 'LUHFWHG (YROXWLRQ RI (Q]\PHV KWWSVJLWKXEFRPNDPHUOLQODEFDGHH &KHP'UDZ 0ROHFXOH (GLWRU KWWSZZZFDPEULGJHVRIWFRPVRIWZDUHRYHUYLHZDVS[ &KLPHUD 0ROHFXODU 0RGHOLQJ 6\VWHP KWWSVZZZFJOXFVIHGXFKLPHUD &OXVWDOΩ 0XOWLSOH 6HTXHQFH $OLJQPHQW KWWSVZZZHELDFXN7RROVPVDFOXVWDOR *520$&6 0ROHFXODU '\QDPLFV 6LPXODWLRQ 3DFNDJH KWWSZZZJURPDFVRUJ *LW 9HUVLRQ &RQWURO 6\VWHP KWWSVJLWVFPFRP 0DHVWUR *UDSKLFDO ,QWHUIDFH IRU 0ROHFXODU 0RGHOOLQJ E\ 6FKU|GLQJHU //& KWWSVZZZ6FKU|GLQJHUFRPPDHVWUR 02/$5,6 0ROHFXODU '\QDPLFV (9% KWWSODHWURXVFHGXVRIWZDUHKWPO

3523.$ 3UHGLFWLRQ RI S.D 9DOXHV RI ,RQL]DEOH *URXSV LQ 3URWHLQV KWWSVJLWKXEFRPMHQVHQJURXSSURSND 3\02/ 0ROHFXODU *UDSKLFV 6\VWHP KWWSVZZZS\PRORUJ 3\WKRQ 6FULSWLQJ /DQJXDJH KWWSVZZZS\WKRQRUJ 4 0ROHFXODU '\QDPLFV 3DFNDJH KWWS[UD\EPFXXVHaDTZZZT 6&:5/ 3URWHLQ 6LGH &KDLQ 3UHGLFWLRQ KWWSGXQEUDFNIFFFHGXVFZUO 90' 9LVXDO 0ROHFXODU '\QDPLFV KWWSZZZNVXLXFHGX5HVHDUFKYPG  ,QWURGXFWLRQ

 &KHPLVWU\ DQG &DWDO\VLV 7KH ZRUOG LV IXOO RI FKHPLFDO UHDFWLRQV DQG WKH SURGXFWV RI FKHPLFDO UHDFWLRQV DUH HYHU\ZKHUH DURXQG XV LQ WKH IRUP RI FORWKLQJ PDGH IURP V\QWKHWLF ILEHUV OLNH Q\ORQ WKH 3(7 ERWWOHV WKDW EHYHUDJHV DUH VWRUHG LQ WKH IXHO WKDW LV EXUQHG ERWK WR SURSHO FDUV DQG WUXFNV RU WR FDUDPHOL]H VXJDU RQ WRS RI GHOLFLRXV WUHDWV 0DQNLQG KDV XVHG FKHPLFDO UHDFWLRQV IRU WKRXVDQGV RI \HDUV 3UHKLVWRULF KX PDQV XVHG FKHPLFDO UHDFWLRQV WKDW UDQJHG IURP VLPSOH FRPEXVWLRQ IRU KHDW DQG FRRNLQJ > @ WR IHUPHQWDWLRQ WR FUHDWH FKHHVH > @ RU WR FUDIW DOFRKROLF EHYHUDJHV > @ $W WKH EHJLQQLQJ RI WKH WK FHQWXU\ D IXQGDPHQWDO FDWDO\VW ZDV GLVFRY HUHG E\ )ULW] +DEHU WKDW ZDV ODWHU EURXJKW WR LQGXVWULDO VFDOH E\ &DUO %RVFK WKH +DEHU%RVFK SURFHVV XVHG IRU WKH LQGXVWULDO IL[DWLRQ RI QLWURJHQ >±@ 7R GD\¶V LQGXVWULDO UHDFWLRQV WKDW LQYROYH KHWHURJHQHRXV FDWDO\VWV DUH XVHGRQYHU\ ODUJH VFDOHV VHH HJ > @  :KLOH LQGXVWULDO VFDOH UHDFWLRQV OLNH WKH +DEHU%RVFK SURFHVV DUH YHU\ LP SRUWDQW UHDFWLRQV D WUHQG WRZDUGV JUHHQ FKHPLVWU\ LV REVHUYHG DQG V\QWKHWLF SDWKZD\V IRU PRUH DWRP HIILFLHQF\ DUH VRXJKW DIWHU SDWKZD\V LQYROYLQJ OHVV WR[LF LQWHUPHGLDWHV IHZHU E\SURGXFWV DQG DYRLGLQJ WR[LF VROYHQWV ,Q WKLV FRQ WH[W ELRFDWDO\VWV KDYH IRXQG WR KDYH JUHDW SRWHQWLDO IRU LQGXVWULDO XVDJH >@ VHH DOVR 7DEOH  ,Q JHQHUDO FDWDO\VWV ORZHU WKH DFWLYDWLRQ EDUULHU IRU D UHDF WLRQ FRPSDUHG WR WKH VDPH UHDFWLRQ LQ D GLIIHUHQW HQYLURQPHQW LQ RWKHU ZRUGV FDWDO\VWV DUH LQFUHDVLQJ WKH UDWH DW ZKLFK D UHDFWLRQ RFFXUV RIWHQ E\ PDQ\ RU GHUV RI PDJQLWXGH >@ 8QGHUVWDQGLQJ KRZ D FDWDO\VW IXQFWLRQV FDQ SRLQW WRZDUG KRZ LW FDQ EH LP SURYHG 7KH UHDFWLRQ PHFKDQLVP RI D UHDFWLRQ FDQ SRWHQWLDOO\ OHDG WR DQ XQGHU VWDQGLQJ RI KRZ WKH UHDFWLRQ FDQ EHVW EH LPSURYHG &RQVHTXHQWO\ JUHDW H[SHU LPHQWDO HIIRUWV DUH XQGHUWDNHQ WR LGHQWLI\ UHDFWLRQ PHFKDQLVPV >  ±@ 6LQFH WKH HQG RI WKH WK FHQWXU\ VLJQLILFDQW FRQWULEXWLRQV WR WKH ILHOG RIUH DFWLRQ NLQHWLFV DQG VWUXFWXUH GHWHUPLQDWLRQ KDYH EHHQ PDGH E\ 6 $UUKHQLXV 9 +HQUL 0LFKDHOLV DQG 0HQWHQ (\ULQJ (YDQV DQG 3RODQL 5 )UDQNOLQ :DWVRQ DQG &ULFN .HQGUHZ DQG 3HUXW] 0RGHUQ DQDO\WLFDO PHWKRGV DOVR FRQWULEXWH WR H[SHULPHQWDO UHVXOWV ZKLFK FDQ KHOS WR H[SODLQ UHDFWLRQ PHFKDQLVPV %XW HYHQ ZLWK PRGHUQ DQDO\WLFDO PHWKRGRORJLHV LW FDQ SURYH WR EH H[WUHPHO\ GLI ILFXOW WR XQGHUVWDQG D UHDFWLRQ H[SHULPHQWDOO\ IRU PXOWLSOH UHDVRQV 'DWD PD\ EH YHU\ GLIILFXOW WR REWDLQ H[SHULPHQWDOO\ )RU H[DPSOH VWUXFWXUDO LQIRUPDWLRQ PD\ EH YHU\ KDUG WR REWDLQ RU LW PD\ EH XQDYDLODEOH HJ PHPEUDQH SURWHLQV

 [17] or intrinsically unstructured proteins [18]). Also, the experimental tools needed to obtain information may not exist (e.g., as when a substrate has many possible but experimentally indistinguishable conformations [19]. As we will see later in this thesis, thanks to the computing power of hardware and that is available today (and which is undergoing continuous development), it is possible to simulate reactions in silico to provide additional insight [20, 21]. For some specialized problems, computational approaches have even shown potential for the design of new catalysts [22, 23]).

1.2 Biochemistry, Biocatalysis and Enzymes Biological processes adapt to their native environments, namely the tempera- ture, pressure, and solvent of the biological cell. Proteins that are biocatalyti- cally active (i.e., enzymes) that can handle reactions proficiently under the con- ditions in living cells, even at the diffusion limit, have evolved [24–26]. At the same time, although highly specific and efficient enzymes are needed for some reactions, biological organisms need to adapt new functions (e.g., to respond to new molecules in their environments). As early as 1976, Jensen suggested that catalytic promiscuity (i.e., the ability of some enzymes to turnover multi- ple substrates) is important to evolve new functions [27]. Today, this natural process is mimicked by protein engineers and facilitates a better understanding of the evolution of new functions [28]. In this context, various computational tools have been developed to predict enzyme promiscuity and also to recon- struct ancestral proteins [29, 30]. (For more details refer to our perspective on the catalytic promiscuity of the alkaline phosphatase superfamily [31]). Biocatalysts are catalysts that work in aqueous solutions, usually in the cell cytoplasm, with high concentrations of protein and other biological molecules present [32, 33]. For many reactions it is therefore crucial for an organism to have very selective catalysts. This, together with their high efficiency and the ability to function without precious metals, makes biocatalysts potent con- tributors to greener chemistry. Also, enzymes can be very selective in terms of chemo-, regio- and enantioselectivity. This can allow for greater atom- efficiency in chemistry because cleaning a product mixture both wastes mole- cules and is tedious. This, together with high efficiency, allows some enzymes to be used in industrial applications (see Table 1.1), and the number of enzymes applicable to industrial processes is increasing [34, 35]. Important factors of this development are that enzymes work under mild conditions and that they can be very selective. The application of enzymes can, however, be limited by the properties of their environment, namely phys- iological conditions with low substrate and enzyme loads. Enzymes are hence increasingly tuned toward process requirements [43, 44], but many challeng- ing problems remain unsolved [44]. Various computational approaches have been put forward to address those challenges [23, 45–48].

10 6WDUWLQJ &DWDO\VW 3URGXFWV $SSOLFDWLRQV 0DWHULDOV κ&DVHLQ &K\PRVLQ SDUDκ&DVHLQ &KHHVH κ0DFURSHSWLGH 3URGXFWLRQ >@ UDF 0HWK\O .HWRUHGXFWDVH 5 0HWK\O 3KDUPD YDOHUDOGHK\GH SHQWDQRO /LTXLG &U\VWDOV >@ .HWRQHV 7UDQVDPLQDVH &KLUDO $PLQHV 6LWDJOLSWLQ 0DQXIDFWXUH >@ )DWV /LSDVH )UHH IUHH IDWW\ )RRG 'LYHUVH 7ULDJO\FHURO DFLGV DQG >@ JO\FHURO &HOOXORVH &HOOXODVH 'LYHUVH %LRIXHO 7H[WLOHV >@ 3URWHLQ 3URWHDVH 3HSWLGHV )RRG ,QGXVWU\ >@ 6WDUFK $P\ODVH 0DOWRVH )RRG ,QGXVWU\ >@ 1LDFLQ $FKURPREDFWHU +\GUR[\ ,QVHFWLFLGHV [\ORVR[LGDQV QLFRWLQLF DFLG SUHFXUVRU >@ /. 7DEOH  /LVWHG DUH YDULRXV ELRFDWDO\WLFDOO\ LPSRUWDQW UHDFWLRQV DQG WKHLU FRUUH VSRQGLQJ HQ]\PHV %LRFDWDO\WLF SURFHVVHV DUH XVHG LQ LQGXVWU\ IRU D ZLGH UDQJH RI UHDFWLRQV UDQJLQJ IURP FKHPLFDO V\QWKHVLV WR ELRIXHOV 3OHDVH QRWH WKDW PRVW UHIHU HQFHV DUH UHYLHZV RI WKH FODVV RI HQ]\PHV

,Q WKLV ZRUN , IRFXV RQ WKH SRWDWR HQ]\PH 6RODQXP WXEHURVXP HSR[LGH K\ GURODVH  6W(+  ZKLFK LV D PHPEHU RI D SURWHLQ IDPLO\ RI Įȕ K\GURODVH IROG HQ]\PHV $ UDQJH RI HSR[LGH K\GURODVHV KDYH EHHQ LGHQWLILHG ZKLFK DUH FRQYHUWLQJ HSR[LGHV WR YLFLQDO GLROV >±@ $Q H[DPSOH RI DQ HSR[LGH LV VKRZQ LQ )LJXUH  (SR[LGH K\GURODVHV DUH FDSDEOH RI HQDQWLRVHOHFWLYH DQG HYHQ HQDQWLRFRQYHUJHQW V\QWKHVLV RI D YDULHW\ RI HSR[LGHV > @ 7KH SURG XFW RI WKHVH FDWDO\VWV DUH YLFLQDO GLROV ZKLFK DUH XVHG IRU WKH V\QWKHVLV RIILQH FKHPLFDOV >  @

 5HJLR DQG (QDQWLRVHOHFWLYLW\ 6HOHFWLYLW\ LQ FKHPLFDO UHDFWLRQV LV FHQWUDO WR WKH GHVLJQ RI QHZ FKHPLFDO SDWK ZD\V DQG WR WKH LPSURYHPHQW RI H[LVWLQJ UHDFWLRQV 6HOHFWLYLW\ LV GLVWLQJXLVKHG LQ FKHPR UHJLR DQG HQDQWLRVHOHFWLYLW\ &KHPRVHOHFWLYLW\ XVXDOO\ UHIHUV WR D UHDFWLRQ WKDW RFFXUV RQO\ DW D VSHFLILF IXQFWLRQDO JURXS RI DWRPV DQG ERQGV

 RI D PROHFXOH 5HJLRVHOHFWLYLW\ PHDQV WKDW LI WZR HTXDOO\ IXQFWLRQDO JURXSV DUH DYDLODEOH GHWHUPLQLQJ ZKLFK RI WKH WZR ZLOO UHDFW GHSHQGV RQ WKH VXEVWL WXWLRQ DW WKH IXQFWLRQDO JURXS )LQDOO\ HQDQWLRVHOHFWLYLW\ UHIHUV WR D UHDFWLRQ WKDW LQYROYHV FKLUDO PROHFXOHV RI ZKLFK RQH HQDQWLRPHU LV IRUPHG SUHIHUDEO\ RYHU WKH RWKHU RQH 8VLQJ WKH &DKQ,QJROG3UHORJ QRPHQFODWXUH VHH HJ >@ HQDQWLRVHOHFWLYLW\ LV WKH SUHIHUUHG IRUPDWLRQ RI RQH HQDQWLRPHU LQ VXFKDZD\ WKDW WKH UDWLR EHWZHHQ WKH 5 DQG 6 HQDQWLRPHU LV QRW  (QDQWLRVHOHFWLYLW\ UHTXLUHV XVXDOO\ D FKLUDO FDWDO\VW RU FRIDFWRU WR LQGXFH FKLUDOLW\ $Q H[DPSOH RI D FKLUDO PROHFXOH LV GLVSOD\HG LQ )LJXUH  ,PSRUWDQWO\ ELRFDWDO\VWV VXFK DV HQ]\PHV DUH EXLOW IURP FKLUDO EXLOGLQJ EORFNV WKH QDWXUDO DPLQR DFLGV DQG SURYLGH D FKLUDO VFDIIROG 7KH ZRUG FKLUDO LV GHULYHG IURP WKH *UHHN ZRUG ȤİȚȡ NKHLU IRU KDQG

)LJXUH  7KH WZR VWHUHRLVRPHUV RI VW\UHQH R[LGH 7KH 5 HQDQWLRPHU LV RQ WKH OHIW DQG WKH 6 HQDQWLRPHU LV RQ WKH ULJKW 6WHUHRLVRPHU\ RU FKLUDOLW\ DOVR UHIHUUHG WRDV HQDQWLRLVRPHU\ LV D SURSHUW\ RI PROHFXOHV WKDW FRQVLVW RI WKH VDPH ERQGV DQG DWRPV EXW WKDW DUH QRW VXSHULPSRVDEOH WKH\ DUH PLUURU LPDJHV RI HDFK RWKHU

(QDQWLRSXUH PROHFXOHV GLVSOD\ RSWLFDO URWDWLRQ RI OLJKW LI SRODUL]HG OLJKW LV GLUHFWHG WKURXJK WKHP ,Q PDQ\ FKLUDO FRPSRXQGV D VLQJOH DWRP LV WKH RUL JLQ RI WKH SURSHUW\ DQG WKHVH DWRPV DUH UHIHUUHG WR DV ³FKLUDO´ 6RPHWLPHV WKLV LV LQGLFDWHG ZLWK DQ DVWHULVN  VHH HJ )LJXUHV  DQG   8QGHU QRQ HQDQWLRVHOHFWLYH UHDFWLRQ FRQGLWLRQV FKLUDO FHQWHUV DUH XVXDOO\ IRUPHGDVDQ HTXDO PL[WXUH RI 5 DQG 6 SURGXFWV ,I D FDWDO\VW LV DEOH WR VKLIW WKH HTXLOLE ULXP EHWZHHQ 5 DQG 6 SURGXFWV WKH FDWDO\VW LV VDLG WR EH HQDQWLRVHOHFWLYH ,W LV LPSRUWDQW WR QRWH WKDW DOO FKHPLFDO SURSHUWLHV RI WZR HQDQWLRPHUV DUH HTXDO ZLWK WKH H[FHSWLRQ RI RSWLFDO DFWLYLW\ 7KLV PHDQV WKDW LW LV LPSRVVLEOH WR VHSD UDWH D UDFHPLF PL[WXUH E\ FRQYHQWLRQDO PHDQV HJ YLD GLVWLOODWLRQ  7KH FRP SRXQGV PXVW HLWKHU ILUVW EH FRQYHUWHG WR GLDVWHUHRLVRPHUV LH E\ DGGLQJ DQ DGGLWLRQDO VWHUHRFHQWHU VR WKDW WKH VWHUHRLVRPHUV DUH QR ORQJHU PLUURU LPDJHV  DQG WKHQ VHSDUDWHG DQG FRQYHUWHG EDFN WR WKH RULJLQDO FRPSRXQGV RU FKLUDO FKURPDWRJUDSKLF PHWKRGV PXVW EH HPSOR\HG ,Q PDQ\ FDVHV VSHFLILF HQDQ WLRPHUV PXVW EH LVRODWHG HJ LQ SKDUPDFHXWLFDO DSSOLFDWLRQV  DQG LW LV FUXFLDO

 WR DYRLG UDFHPL]DWLRQ ZKHQHYHU SRVVLEOH WR DYRLG KDYLQJ LQDFWLYH ³ZDVWHG´ PROHFXOHV DQG GLIILFXOW SXULILFDWLRQ VWHSV

$ %

& '

)LJXUH  (QDQWLRPHULF PROHFXOHV $ 7KH 5 DQG 6 HQDQWLRPHUV RI OLPRQHQH DUH UHVSRQVLEOH IRU WKH RGRU RI RUDQJHV DQG OHPRQV UHVSHFWLYHO\ % 7KH 5 DQG 6  HQDQWLRPHUV RI FDUYRQH DUH UHVSRQVLEOH IRU WKH RGRU RI FDUDZD\ DQG PLQW UHVSHFWLYHO\ & ,EXSURIHQ LV D SRWHQW FKLUDO DQWLLQIODPPDWRU\ GUXJ WKDW LV VROG DV D UDFHPLF PL[ WXUH RI 5 DQG 6 LEXSURIHQ 7KDQNV WR DQ LVRPHUDVH WKDW PDPPDOV SRVVHVV WKH OHVV DFWLYH 5 HQDQWLRPHU LV FRQYHUWHG LQWR WKH DFWLYH 6 HQDQWLRPHU LQ YLYR >@ ' 7KDOLGRPLGH ZDV RQFH XVHG DV D FXUH IRU PRUQLQJ VLFNQHVV >@ 8QIRUWXQDWHO\ WKH FRPSRXQG LV WHUDWRJHQLF DQG KDV VHYHUH HIIHFWV RQ WKH IHWXV OHDGLQJ WR VSHFLILF ELUWK GHIHFWV LI WDNHQ GXULQJ SUHJQDQF\ VHH HJ >@  ,W ZDV VXJJHVWHG WKDW SULPDULO\ RQO\ RQH HQDQWLRPHU LV UHVSRQVLEOH IRU WHUDWRJHQLF HIIHFWV >@ EXW WKLV LV VXEMHFW WR GHEDWH >@ $GGLWLRQDOO\ WKH FKLUDO FHQWHU LV XQVWDEOH LQ SURWRQDWHG PHGLD DQG UDFHPL]D WLRQ RFFXUV LQ YLYR 7KDOLGRPLGH LV QR ORQJHU XVHG WR WUHDW PRUQLQJ VLFNQHVV EXW LW LV FXUUHQWO\ XVHG WR FXUH OHSURV\ >@ DQG WR WUHDW FDQFHU >@

 6RODQXP 7XEHURVXP (SR[LGH +\GURODVH  (SR[LGH K\GURODVHV FDWDO\]H WKH FRQYHUVLRQ RI HSR[LGHV WR YLFLQDO GLROV 'H SHQGLQJ RQ WKH RUJDQLVP WKH ELRORJLFDO IXQFWLRQV RI WKLV HQ]\PH FODVV FRYHU D ZLGH UDQJH IURP GHWR[LILFDWLRQ RYHU FDWDEROLVP WR FHOOXODU VLJQDOLQJ >@ 6RPH HSR[LGH K\GURODVHV FRYHU D ZLGH UDQJH RI VXEVWUDWHV DQG LQWHUHVWLQJO\ HQDQWLRVHOHFWLYLW\ ZDV DOVR REVHUYHG >@ $ FDQGLGDWH RI SDUWLFXODU LQWHUHVW IRU VWXGLHV LV WKH SRWDWR 6RODQXP WXEHURVXP HSR[LGH K\GURODVH  6W(+ > @ 7KH ZLOGW\SH SURWHLQ DQG LWV DFWLYH VLWH DUH VKRZQ LQ )LJXUH  ,Q HDUOLHU VWXG LHV WKLV HQ]\PH ZDV IRXQG WR \LHOG RSWLFDOO\ SXUH SURGXFWV IRU VPDOO VXEVWUDWHV >@ (QJLQHHUHG SURWHLQ YDULDQWV WRJHWKHU ZLWK HDUOLHU H[SHULPHQWV DQG UHODWHG HSR[LGH K\GURODVHV KDYH EHHQ XVHG WR JDLQ LQVLJKW LQWR WKH UHDFWLRQ PHFKDQLVP RI WKH HQ]\PH ([SHULPHQWDO HYLGHQFH VXSSRUWV D WKUHHVWHS UHDFWLRQ PHFKD QLVP DIWHU VXEVWUDWH ELQGLQJ 0LFKDHOLV &RPSOH[  7KH LQLWLDO DWWDFN RI ' RSHQV WKH HSR[LGH DW HLWKHU HSR[LGH FDUERQ DQG LV WKHQ IROORZHG E\ D JHQHUDO EDVHDFWLYDWHG FRQVHUYHG DFWLYHVLWH ZDWHU PROHFXOH ZKLFK LV EHOLHYHG WR EH

 (A) (B)

Figure 1.3. The potato (Solanum tuberosum) epoxide hydrolase 1 (StEH1), PDB- ID: 2CJP. (A) The wild-type StEH1 displayed as surface view with part of the active site visible in the center of the image. (B) The substrate-free active site of wild-type StEH1 featuring the catalytically relevant residues: For the first step, these are the nucleophile D105, the oxyanion-hole tyrosines Y154 an Y235, and histidine H300 and the nucleophilic water. activated by H300. Finally, the tetrahedral intermediate is converted to product and then released to close the catalytic cycle (see Figure 1.4). The mechanistic cycle is unable to explain the origin of the regio- and enan- tioselectivity of StEH1. This is exactly where computational modeling unfolds its potential. A variety of substrates that were subjected to kinetic measure- ments with different protein variants and structures of some protein variants allowed us to build a computational model that could reproduce the experimental data. With the help of this computational model, we were able to establish the mechanism in even more detail and identified a previously ne- glected catalytically important residue – H104 – in the active site, see Paper II. We have then, thanks to the established details of the reaction mechanism, obtained insight to detailed regio- and enantioselectivity, see Paper III. As the trans-stilbene oxide (TSO) substrate, styrene oxide (SO) can be open- ed at both epoxide carbons (C1 and C2). In contrast to TSO, however, the product is not always the meso-diol, but it is experimentally distinguishable. It was found in earlier work that the wild-type enzyme is enantioconvergent and converts both (R)-SO and (S)-SO preferably to the (R)-product (see also Figure 1.5) [53, 69]. StEH1 was investigated using a combined experimental and computational approach and different enzyme variants have been tested with all stereoiso- mers of the substrate. In Paper II and Paper III EVB calculations have been performed on StEH1 with different mutant structures and two substrates, trans- stilbene oxide (TSO, Paper II) and styrene oxide (SO, Paper III), see also Fig-

14 )LJXUH  7KH JHQHUDOL]HG UHDFWLRQ PHFKDQLVP RI WKH FDWDO\WLF F\FOH FDWDO\]HG E\ 6W(+ 7KH FRPSXWDWLRQDO ZRUN IRFXVHG RQ WKH ILUVW WZR UHDFWLRQV VWHSV WKH QXFOH RSKLOLF HSR[LGH ULQJRSHQLQJ DQG WKH IRUPDWLRQ RI WKH WHWUDKHGUDO LQWHUPHGLDWH 7KH WKLUG VWHS ZDV QRW PRGHOHG EHFDXVH WKH EUHDNGRZQ RI WKH WHWUDKHGUDO LQWHUPHGLDWH LV QRW H[SHFWHG WR EH UDWHOLPLWLQJ >@

XUH  7KH FDOFXODWLRQV IRFXVHG RQ WKH ILUVW WZR VWHSV RI WKH UHDFWLRQ PHFKD QLVP DIWHU WKH HQ]\PH KDV IRUPHG WKH 0LFKDHOLV &RPSOH[ %HFDXVH WKH (9% DSSURDFK UHOLHV RQ DQ HVWDEOLVKHG UHDFWLRQ PHFKDQLVP LW LV XVHIXO WR KDYH H[ SHULPHQWDO GDWD IURP GLIIHUHQW HQ]\PH YDULDQWV WR HVWDEOLVK DQG YDOLGDWH WKH UHDFWLRQ PHFKDQLVP RI DQ HQ]\PH 7KH H[SHULPHQWDO GDWD LQFOXGLQJ NLQHWLF PHDVXUHPHQWV RI ERWK VWHDG\VWDWH NLQHWLFV DQG SUH±VWHDG\VWDWH NLQHWLFV DO ORZHG XV WR REWDLQ D JRRG XQGHUVWDQGLQJ RI 6W(+ ZLWK WKH 762 VXEVWUDWH 2QFH ZH HVWDEOLVKHG WKH UHDFWLRQ PHFKDQLVP IRU 762 ZH VHW RXW WR ZRUN RQ D VXEVWUDWH ZLWK GLVWLQJXLVKDEOH SURGXFWV QDPHO\ 62

 )LJXUH  7KH 62 VXEVWUDWH DV LOOXVWUDWLRQ RI KRZ D FKLUDO FDWDO\VW¶V HQDQWLRFRQYHU JHQFH LV H[SODLQHG LI DV LQ WKH FDVH IRU WKH ZLOGW\SH YDULDQW RI 6W(+ 6 62 LV RSHQHG RQ & WKH VWHUHR FHQWHU LV LQYHUWHG DQG WKH 5 GLRO IRUPV ,I DW WKH VDPH WLPH D FDWDO\VW SUHIHUDEO\ RSHQV WKH 5 62 DW & DJDLQ WKH 5 GLRO ZLOO EH WKH SURGXFW $V D FRQVHTXHQFH RQO\ 5 GLRO LV IRUPHG DQG HQDQWLRFRQYHUJHQW EHKDYLRU LV REVHUYHG

)LJXUH  7KH VXEVWUDWHV LQYHVWLJDWHG LQ WKLV ZRUN DUH WUDQVVWLOEHQH R[LGH 762 DQG VW\UHQH R[LGH 62  %HFDXVH 762 LV D V\PPHWULF VXEVWUDWH LWV K\GURO\VLV SURGXFW LV WKH VXSHULPSRVDEOH PHVRGLRO DQG VR LWV UHJLRVHOHFWLYLW\ FDQQRW EH GHWHUPLQHG H[ SHULPHQWDOO\ 6W\UHQH R[LGH LV QRW V\PPHWULF DQG UDWLRV EHWZHHQ 5  DQG 6 GLROV FDQ EH GHWHUPLQHG H[SHULPHQWDOO\

 0ROHFXODU 0RGHOLQJ $SSURDFKHV ,Q PRGHUQ VRFLHW\ FRPSXWDWLRQ LV XVHG WR JDLQ DGGLWLRQDO LQVLJKW LQWR V\V WHPV RI LQWHUHVW ,Q IDFW HDUO\ FRPSXWDWLRQDO FKHPLVWU\ ZDV XVHG E\ KXPDQV ORQJ EHIRUH FRPSXWHUV EHFDPH DYDLODEOH DQG VRPH RI WKRVH FRPSXWDWLRQDO DS SURDFKHV DUH XVHG WRGD\ RQ PRGHUQ FRPSXWHUV ZLWK HYHU PRUH FRPSOH[ V\V WHPV (DUO\ DSSURDFKHV LQ FRPSXWDWLRQDO FKHPLVWU\ GLG UHO\ RQ KXPDQ FRPSX WDWLRQDO SRZHU DV H[HPSOLILHG E\ WKH DFNQRZOHGJPHQWV LQ 5REHUW 0XOOLNHQ¶V ZRUN RQ HOHFWURQLF SRSXODWLRQ DQDO\VLV ZKLFK LV VWLOO LQ XVHG WR FDOFXODWH 0XO OLNHQ FKDUJHV  ³7KH ZULWHU LV LQGHEWHG WR 0U : -DXQ]HPLV IRU FDOFXODWLRQV ZKRVH UHVXOWV DUH HPERGLHG LQ WKH WDEOHV´ >@ ,Q WKH ODVW  \HDUV WUHPHQGRXV H[SRQHQWLDO JURZWK RI FRPSXWLQJ SRZHU ZDV DFKLHYHG 6LQFH QHDUO\  \HDUV WKLV JURZWK KDV EHHQ DFKLHYHG E\ DGGLQJ

 more and more processor-cores and so code has to be optimized to run on multiple processors, see Figure 1.8). Two major computational modeling approaches are in use today. Approach- es based on molecular mechanics (MM) rely on force fields to simulate a molecule with Newtonian limitations and scale, depending on the implemen- tation details between O(n log(n)) to O(n2), see also Figure 1.7. While the O(n· log(n)) algorithm involves a computationally expensive reverse Fourier transformation, it scales much better with larger problem sizes.

Figure 1.7. It is important to estimate the computational costs of algorithms. The big O notation describes the worst-case performance of an algorithm, when the argument n tends to infinity, and it provides a worst-case estimate of how many operations an algorithm will need to arrive at a result given a problem of size n. For example, O(n) means that the computational cost will increase linearly with the problem size, and O(n2) that it will increase to the square, as the problem size is increased. The big O notation estimates only how many iterations the algorithm needs. In practice, a higher- order algorithm may be faster for a range of problem sizes if the costs per iteration are lower.

One limitation of classical force fields is that they are unable to describe ex- cited states and thus usually limited to nonreactive simulations (i.e., no bonds can be formed or broken). To describe excited states numerical approxima- tions to solve the Schrödinger equation are employed. These approaches are referred as quantum mechanical (QM) approaches and scale, depending on the implementation details, usually between O(n3) (DFT) to O(n7) (CCSD(T)), O(n8) (CCSDT) [71, 72]. Many of the QM approaches used today are hence limited to system sizes ranging from between a few dozens to about 1,000 atoms, depending on the available resources and approach used. Efforts are ongoing to reduce the or- der of quantum mechanical approaches required and to achieve linear scal-

17 )LJXUH  7KH IDVWHVW VXSHUFRPSXWHUV RYHU WKH FRXUVH RI WKH SDVW  \HDUV 1RWDEO\ D VPDUWSKRQH SURFHVVRU FRXOG RXWUXQ WKH IDVWHVW FRPSXWHUV IURP  DQG DQ RIILFH FRPSXWHU HVSHFLDOO\ LI HTXLSSHG ZLWK D PRGHUQ JUDSKLF SURFHVVLQJ XQLW *38  FDQ FRPSHWH ZLWK VXSHUFRPSXWHUV IURP WKH V $QRWKHU LPSRUWDQW SRLQW LV WKDW WKH VSHHG JDLQ ZLWK VLQJOHFRUH VLOLFRQ FKLSV KDV VORZHG DURXQG  DQG WKLV KDV EHHQ FRPSHQVDWHG IRU E\ WKH LQWURGXFWLRQ RI SDUDOOHO FRPSXWLQJ ZLWK PXOWLSOH SURFHVVRU FRUHV 7KH ¶7ULROLWK&OXVWHU¶ ZDV WKH IDVWHVW VXSHUFRPSXWHU RU FOXVWHU XVHGLQWKLV ZRUN

LQJ $FWXDOO\ OLQHDU VFDOLQJ LPSOHPHQWDWLRQV H[LVW WKDW VFDOH ZLWK PRUH WKDQ  DWRPV >±@ DQG VFDOLQJ KDV HYHQ EHHQ GHPRQVWUDWHG IRU PLOOLRQV RI DWRPV >@ 7KH VFDOLQJ LV KRZHYHU RQO\ DFKLHYHG ZKHQ DSSUR[LPDWLRQV RI ORQJUDQJH LQWHUDFWLRQV DUH HPSOR\HG DQG ZRUNV EHVW IRU YHU\ ODUJH V\VWHPV ZKHUH DJDLQ WKH ORQJUDQJH LQWHUDFWLRQV FDQ EH DSSUR[LPDWHG LH ³QHDUVLJKW HGQHVV´ > @ 7KH ODUJH SUHH[SRQHQWLDO IDFWRUV PDNH WKHVH DSSURDFKHV XQWLO QRZ HYHQ WKRXJK OLQHDUO\ VFDOLQJ VWLOO RYHUO\ H[SHQVLYH 7KLV EHFRPHV SDUWLFXODUO\ DUWLFXODWH ZKHQ ODUJH V\VWHPV ZLWK WKRXVDQGV RI DWRPV DQG PDQ\ GLIIHUHQW FRQIRUPDWLRQV DOO DFFHVVLEOH DW DPELHQW WHPSHUDWXUH DUH FRQVLGHUHG )RU WKHVH V\VWHPV LW LV LPSRUWDQW WR VDPSOH GLIIHUHQW FRQIRUPDWLRQV WR JHW DQ RYHUYLHZ RI WKH HQHUJHWLF ODQGVFDSH DQG FRQILJXUDWLRQV WKDW DUH DFFHVVLEOH 0RGHOLQJ FRPSOHWH HQ]\PHV LV WKHUHIRUH QRW IHDVLEOH ZLWK 40RQO\ PHWKRGV , ZRXOG OLNH WR PHQWLRQ KHUH WKDW VHPLHPSLULFDO PHWKRGRORJLHV HJ $0 WKDW LQFUHDVH UHDFK H[LVW DOWKRXJK DJDLQ DFFXUDF\ LV OLPLWHG >±@ +HQFH PRGHOLQJ FKHPLFDO UHDFWLRQV LQ ODUJH V\VWHPV LV YHU\ GLIILFXOW WR GR XVLQJ 40 DSSURDFKHV 6RPH UHVHDUFKHUV KDYH SXEOLVKHG ZRUN WKDW FRQVLGHUV RQO\ D VXEVHW RI DWRPV LQYROYHG LQ D UHDFWLRQ LQ DQ HQ]\PDWLF V\VWHP LHWKH FDWDO\WLFDOO\ FUXFLDO DWRPV LQ WKH DFWLYH VLWH ZKLOH WKH UHPDLQGHU RI WKH HQ]\PH LV QHJOHFWHG > @ :KLOH VXFK FRQWLQXXP PRGHO DSSURDFKHV PD\ \LHOG UH VXOWV WKH\ VXIIHU IURP WKH VHULRXV GUDZEDFN WKDW D YDVW SDUW RI WKH SURWHLQ LV LJQRUHG LHPRVWnd VKHOO UHVLGXHV DQG WKHLU FRQWULEXWLRQV  $GGLWLRQDOO\ DWRPV KDYH WR EH DQFKRUHG DQGRU DUH IUR]HQ DQG VDPSOLQJ LV QRW REWDLQHG

 7R VXP XS PRGHOLQJ UHDFWLRQV LQ ODUJH V\VWHPV VXFK DV SURWHLQV LV LQHIIL FLHQW ZKHQ GRQH ZLWK 40RQO\ PHWKRGV :KLOH LW FDQ EH GRQH LW LV RIWHQ YHU\ H[SHQVLYH VXIILFLHQW VDPSOLQJ LV KDUG WR DFKLHYH DQG UHVXOWV PD\ QRW EH DF FXUDWH 2QH DOWHUQDWLYH LV WR XVH UHDFWLYH IRUFH ILHOGV VXFK DV 5HD[)) ZKHUH WKH IRUFH ILHOG LV EDVHG RQ ERQG RUGHUV QRW H[SOLFLW ERQGV DQG FDQ XQGHUJR UHDFWLRQV IRU FODVVHV RI SDUDPHWHUL]HG UHDFWLRQV >@ ,GHDOO\ RQH ZRXOG XVH D PHWKRG WKDW VFDOHV DV 00EDVHG DSSURDFKHV GR EXW ZLWK WKH DFFXUDF\ RI D 40EDVHG DSSURDFK 2QH SRVVLEOH VROXWLRQ WR WKLV SUREOHP LV PXOWLVFDOH PRGHOLQJ ZKLFK ZDV XVHG DOUHDG\ LQ WKH V ,Q PXOWLVFDOH PRGHOLQJ D FRPELQDWLRQ RI GLIIHUHQW OHYHOV RI DFFXUDF\ IRU GLI IHUHQW SDUWV RI D V\VWHP LV XVHG )RU H[DPSOH WZR OHYHOV RI WKHRU\ FRXOG EH DSSOLHG 40 IRU WKH UHDFWLYH SDUW RI WKH V\VWHP DQG 00 IRU WKH UHVW RI WKH V\VWHP 7KH 40 SDUW LV RIWHQ VLPXODWHG EDVHG RQ PROHFXODU RUELWDO 40 DS SURDFKHV ,W LV KRZHYHU DOVR SRVVLEOH WR XVH D YDOHQFH ERQG EDVHG DSSURDFK DQG WR FRXSOH LW ZLWK 00 ,Q WKLV WKHVLV WKH PDLQ ZRUNKRUVH LV WKH HPSLULFDO YDOHQFH ERQG DSSURDFK DV LPSOHPHQWHG LQ 4 ZKLFK ZLOO EH GHVFULEHG LQ WKH PHWKRGV VHFWLRQ

 2XWOLQH 7KH PDLQ PRWLYDWLRQ RI WKLV WKHVLV LV WR JDLQ D GHHSHU XQGHUVWDQGLQJ RI WKH UHDF WLRQ PHFKDQLVP RI WKH SRWDWR HQ]\PH 6RODQXP WXEHURVXP HSR[LGH K\GURODVH  6W(+  ,W DOVR LQYROYHG WKH DVVRFLDWHG PHWKRG GHYHORSPHQW RI &$'(( WR DXWRPDWH LQ VLOLFR HQ]\PH HYROXWLRQ ,Q WKH QH[W FKDSWHU , ZLOO GLVFXVV WKH PHWKRGV XVHG DQG WKH WHFKQLFDO EDFNJURXQG ,Q WKH PHWKRG GHYHORSPHQW FKDS WHU , SUHVHQW ZRUN RQ WKH RFWUDKHGUDO GXPP\ PRGHO ZKLFK , LQLWLDOO\ SODQQHG WR XVH LQ RWKHU PHWDOFRQWDLQLQJ HQ]\PHV VXFK DV $'+$ , ZLOO WKHQ SUHVHQW &$'(( &RPSXWHU$LGHG 'LUHFWHG (YROXWLRQ RI (Q]\PHV D IUDPHZRUN WR VLPSOLI\ SUHSDUDWLRQ DQG SDUDOOHO FRPSXWDWLRQ RI LQ VLOLFR JHQHUDWHG SURWHLQ YDULDQWV )LQDOO\ , ZLOO GLVFXVV WKH HSR[LGH K\GURODVH 6W(+ ZKLFK ZDV LQ YHVWLJDWHG XVLQJ D FRPELQDWLRQ RI NLQHWLF PHDVXUHPHQWV FU\VWDOORJUDSKLF LQ YHVWLJDWLRQV DQG FRPSXWDWLRQDO ZRUN

  0HWKRGV DQG 7KHRUHWLFDO %DFNJURXQG

 (Q]\PH .LQHWLFV 5HDFWLRQ NLQHWLFV FDQ EH GHWHUPLQHG E\ PHDVXULQJ WKH FRQFHQWUDWLRQ RI UHDFWDQW RU SURGXFW RYHU WLPH 7KLV LV VWUDLJKWIRUZDUG SDUWLFXODUO\ ZKHQ WKH EDFNZDUG UHDFWLRQ LV YHU\ VORZ RU ZKHQ WKH SURGXFW LV UHPRYHG IURP WKH UHDFWLRQ DQG WKH EDFNZDUG UHDFWLRQ FDQ EH LJQRUHG $ UHODWLRQVKLS EHWZHHQ WHPSHUDWXUH DQG UHDFWLRQUDWH ZDV VXJJHVWHG E\ $UUKHQLXV DQG LV NQRZQ DV WKH $UUKHQLXV HTXDWLRQ /DWHU WKH (\ULQJ3RODQL HTXDWLRQ ZDV LQWURGXFHG WR GHVFULEH UHDF WLRQ NLQHWLFV 2I SDUWLFXODU XVH IRU HQ]\PDWLF UHDFWLRQV DUH WKH FRQWULEXWLRQV RI 0LFKDHOLV DQG 0HQWHQ >@ ZKR GHYHORSHG D ZHOONQRZQ VLPSOLILHG PRGHO WKDW LV XVHG WR FKDUDFWHUL]H HQ]\PH NLQHWLFV VHH DOVR )LJXUH  

[S]  K M v = Vmax = kcat[E0] Vmax

Vmax [S] = K M v = V 2

Vmax[S] [S]  K M v = v0 = KM+[S]

KM [S] )LJXUH  ,I NLQHWLF PHDVXUHPHQWV DUH SHUIRUPHG RQ HQ]\PHV W\SLFDO GDWD REWDLQHG XVLQJ WKH 0LFKDHOLV0HQWHQ NLQHWLFV DUH WKH 0LFKDHOLV0HQWHQ FRQVWDQW .0 DQG NFDW D PHDVXUH RI WKH UDWH RI FDWDO\WLF SURGXFWIRUPDWLRQ XQGHU RSWLPXP FRQGLWLRQV 7KH\ FDQ EH REWDLQHG E\ SORWWLQJ WKH VXEVWUDWH FRQFHQWUDWLRQ >6@ DJDLQVW WKH UHDFWLRQ YHORFLW\ V  7KLV SORW LOOXVWUDWHV VSHFLILF FDVHV IRU ORZ DQG KLJK FRQFHQWUDWLRQ RI VXEVWUDWH DQG IRU .0 >6@

7KH 0LFKDHOLV0HQWHQ FRQVWDQW .0 LV WKH FRQFHQWUDWLRQ RI VXEVWUDWH QHHGHG IRU WKH HQ]\PH WR SHUIRUP DW KDOI RI LWV PD[LPXP UHDFWLRQ UDWH $V PHDVXUH RI WKH FDWDO\WLF SURGXFWIRUPDWLRQ XQGHU RSWLPXP FRQGLWLRQV NFDW LV XVHG ,W LV UHSUHVHQWLQJ WKH QXPEHU RI SURGXFW PROHFXOHV IRUPHG SHU VHFRQG V−1 DQGLQ FRPSOH[ V\VWHPV ZLWK VHYHUDO VWHSV NFDW LV WKH UDWH FRQVWDQW RI WKH UDWHOLPLWLQJ VWHS $ PHDVXUH IRU FDWDO\WLF HIILFLHQF\ LV NFDW.0 ZKLFK WDNHV ERWK VXEVWUDWH ELQGLQJ DQG FRQYHUVLRQ WR SURGXFW LQWR DFFRXQW

 $VVXPLQJ D VLPSOH PRGHO V\VWHP DQ HQ]\PDWLF UHDFWLRQ FDQ EH YLHZHG DV DQ LQLWLDO HTXLOLEULXP RI HQ]\PH DQG VXEVWUDWH (6 WKDW HQFRXQWHU DQG IRUP DQ HQ]\PHVXEVWUDWH FRPSOH[ (6  UHDFW DQG ILQDOO\ VHSDUDWH DV HQ]\PH DQG SURGXFW (3 VHH 6FKHPH  DQG )LJXUH   7KH EDFNZDUG UHDFWLRQ RI WKH UDWHOLPLWLQJ VWHS N  LV KHUH DVVXPHG WR EH QHJOLJLEOH ZKLFK LV QDWXUDOO\ QRW QHFHVVDULO\ WKH FDVH

‡ Guncat

G‡

Energy cat

G0 E+S ES EP E+P Reaction Coordinate )LJXUH  7KH UHODWLRQVKLS EHWZHHQ DQ XQFDWDO\]HG UHDFWLRQ DQG DQ HQ]\PH FDWD O\VW 5DWH DFFHOHUDWLRQ LV DFKLHYHG E\ UHGXFLQJ WKH DFWLYDWLRQ HQHUJ\ ΔG‡) QHHGHG WR RYHUFRPH WKH KLJKHVW EDUULHU DQG WKH UHDFWLRQ IUHH HQHUJ\ ΔG0 GRHV QRW FKDQJH

N N (6 (6 (3 6FKHPH  N  7KH LQLWLDO YHORFLW\ v0) RI SURGXFW IRUPDWLRQ PD\ EH REVHUYHG H[SHULPHQ WDOO\ DQG LW GHSHQGV RQ WKH FRQFHQWUDWLRQ RI HQ]\PHVXEVWUDWH FRPSOH[ >(6@ DQG k2  

v0 = k2[ES]  7KH FRQFHQWUDWLRQ RI IUHH HQ]\PH LV HTXDO WR WKH WRWDO DPRXQW RI HQ]\PH PLQXV WKH HQ]\PH LQ HQ]\PHVXEVWUDWH FRPSOH[

[E]=[E]tot − [ES]  7ZR DVVXPSWLRQV DUH PDGH KHUH QDPHO\ WKDW WKH FRQFHQWUDWLRQ RI VXEVWUDWH LV PXFK KLJKHU WKDQ WKH FRQFHQWUDWLRQ RI SURGXFW DQG FRQFHQWUDWLRQ RI HQ]\PH ,I WKH FRQFHQWUDWLRQ RI WKH HQ]\PHVXEVWUDWH FRPSOH[ LV FRQVWDQW WKH V\VWHP LV UHIHUUHG DV EHLQJ LQ VWHDG\ VWDWH

k−1[ES]+k2[ES]=k1[E][S] 

 7R GHILQH KM  ZH VHSDUDWH WKH FRQFHQWUDWLRQV DQG UDWHV

(k−1 + k2)/k1 =[E][S]/[ES] 

$QG GHILQH KM DV

KM =(k−1 + k2)/k1 

7KHQ ZH VXEVWLWXWH KM EDFN LQWR HT  WR DUULYH DW

KM =[E][S]/[ES]  6XEVWLWXWLQJ  LQWR HT  JLYHV

KM =([E]tot − [ES])[S]/[ES]  1H[W ZH LVRODWH >(6@

[ES]([S]+KM )=[E]tot[S] 

[E] [S] [ES]= tot  KM +[S] 6XEVWLWXWLQJ  LQWR HT  JLYHV

[E]tot[S] v0 = k2  KM +[S] 7KH PD[LPXP YHORFLW\ RI WKH HQ]\PDWLF UHDFWLRQ LV JLYHQ E\

Vmax = k2[E]tot 

5HSODFLQJ k2 ZLWK  \LHOGV WKH 0LFKDHOLV0HQWHQ HTXDWLRQ

Vmax[S] v0 =  KM +[S] :KHUH WKH LQLWLDO UDWH RI WKH UHDFWLRQ v0 LV VHW LQ UHODWLRQ WR WKH VXEVWUDWH FRQFHQWUDWLRQ [S] ZKLFK DOORZV WKH GHWHUPLQDWLRQ RI PD[LPXP UDWH Vmax DQG WKH 0LFKDHOLV0HQWHQ FRQVWDQW .0 $GGLWLRQDOO\ E\ XVLQJ PRUH VRSKLVWLFDWHG H[SHULPHQWDO DSSURDFKHV LW LV SRVVLEOH WR GHWHUPLQH WKH EDUULHUV RI LQWHUPHGLDWH (, VWHSV VHH 6FKHPH  

N N N (6 (6 (, (3 6FKHPH  N  N  7KLV FDQ EH DFKLHYHG E\ XVLQJ SUH±VWHDG\VWDWH NLQHWLF PHDVXUHPHQWV LI WKH LQWHUPHGLDWH LV H[SHULPHQWDOO\ REVHUYDEOH 7KH HQ]\PH UHDFWLRQ LV RIWHQ PHD VXUHG XVLQJ VWRSSHGIORZ SURFHGXUHV VXEVWUDWH LV H[SRVHG WR DQ HQ]\PH DQG

 WKH LQWHUPHGLDWH LV REVHUYHG RYHU WLPH ,W LV SRVVLEOH GHSHQGLQJ RQ WKH UHDFWLRQ VHWXS WR REVHUYH GLIIHUHQW VWHSV RI D UHDFWLRQ 'HSHQGLQJ RQ WKH PHFKDQLVPRI WKH UHDFWLRQ LQYHVWLJDWHG D PRGHO IRU WKH REVHUYHG UDWH FDQ EH FRQVWUXFWHG )RU D FDVH OLNH GHSLFWHG LQ 6FKHPH  DVVXPLQJ >(6@ DQG >(6@ DUH LQ HTXLOLEULXP IRU H[DPSOH k2[S] kobs = + k−2 + k3  KS +[S]

+HUH KS UHSUHVHQWV WKH GLVVRFLDWLRQ FRQVWDQW k/k1  7KH VPDOOHU WKH GLVVRFLDWLRQ FRQVWDQW WKH KLJKHU WKH SUREDELOLW\ WKDW SURGXFW ZLOO EH IRUPHG DIWHU WKH 0LFKDHOLV FRPSOH[ KDV IRUPHG 7KLV GDWD FDQ WKHQ EH XVHG RIWHQ LQ FRQMXQFWLRQ ZLWK RWKHU H[SHULPHQWDO REVHUYDEOHV LH ZLWK D ' VWUXFWXUH RI D SDUWLFXODU HQ]\PH WR DUWLFXODWH DQG YDOLGDWH D UHDFWLRQ PHFKDQLVP DQG WR REWDLQ LQIRUPDWLRQ RQ KRZ DQ HQ]\PH FDQ EH LPSURYHG

 6HTXHQFH $QDO\VLV 7KH VHTXHQFH RI D SURWHLQ LV WKH GLPHQVLRQDO FRQVWUXFWLRQ SODQ RI WKH VDPH SURWHLQ ,W LV WKHUHIRUH SRVVLEOH WR FRPSDUH SURWHLQV E\ FRPSDULQJ WKH VH TXHQFHV RI UHODWHG SURWHLQV WR HDFK RWKHU &OHDUO\ FRPSDULQJ WZR SURWHLQV FDQ EH HDV\ LI WKHLU VHTXHQFHV DUH YHU\ FORVHO\ UHODWHG HJ LI RQO\ D VLQJOH SRLQW PXWDWLRQ LV WKH GLIIHUHQFH  :KLOH WZR LGHQWLFDO VHTXHQFHV FDQ EH FRPSDUHG PDQXDOO\ LW EHFRPHV YHU\ WHGLRXV WR FRPSDUH WZR RU PRUH VHTXHQFHV WKDW PD\ KDYH LQVHUWLRQV DQG GHOHWLRQV RU ORZ VHTXHQFH VLPLODULW\ ZKLFK LV DOVR UHIHUUHG WR DV VHTXHQFH LGHQWLW\  7KLV LV HPSKDVL]HG E\ WKH QXPEHU RI NQRZQ SUR WHLQ VHTXHQFHV 7RGD\ PRUH WKDQ  XQLTXH VHTXHQFHV DUH DYDLODEOH RQ ZZZXQLSURWFRP DV RI -DQ   PRUH WKDQ  PDQXDOO\ DQQR WDWHG DQG  DXWRPDWLFDOO\ DQQRWDWHG VHTXHQFHV DUH DFFHVVLEOH RQOLQH  7RROV ZHUH GHYHORSHG WR PDQDJH WKHVH SURWHLQ VHTXHQFHV WR RUJDQL]H DQG DOLJQ VHTXHQFHV WR HDFK RWKHU DQG WR FRPSDUH VHTXHQFHV IRU VLPLODULW\ $PRQJ WKH EHVW NQRZQ DSSURDFKHV DUH WKH %DVLF /RFDO $OLJQPHQW 6HDUFK 7RRO %/$67 >@ DQG IRU PXOWLSOH VHTXHQFH DOLJQPHQWV &OXVWDO Ω >@ 9DULRXV ZD\V WR GLVSOD\ WKH VLPLODULW\ RI VHTXHQFHV H[LVW DQG :HE/RJR >@ LV D ZHOONQRZQ DOWHUQDWLYH WR GLVSOD\ UDZ VHTXHQFHV

 6HTXHQFH 6SDFH ,W LV KDUG WR LPDJLQH WKH VHTXHQFH VSDFH RI SURWHLQV DQG HQ]\PHV EHFDXVH RI LWV H[SRQHQWLDO JURZWK 1 ZLWK 1 UHSUHVHQWLQJ WKH VHTXHQFH OHQJWK  $ EUXWH IRUFH DSSURDFK LH WU\LQJ DOO SRVVLEOH FRPELQDWLRQV  RQ D VHTXHQFH VSDFH RI MXVW  DPLQRDFLGV ZRXOG \LHOG ≈ XQLTXH VHTXHQFHV D QXPEHU FRP SDUDEOH WR WKH HVWLPDWHG QXPEHU RI DWRPV LQ WKH NQRZQ XQLYHUVH  >@ +HQFH LW LV XQIHDVLEOH WR WU\ RXW WKH WRWDO VHTXHQFH VSDFH RI D SURWHLQ ,QVWHDG

 GLUHFWHG HYROXWLRQ H[SHULPHQWV DUH XVHG ZLWK WKH LQKHUHQW ULVN RI JHWWLQJ VWXFN LQ ORFDO PLQLPD > @ +HUH FRPSXWDWLRQDO WRROV DQG VWUXFWXUDO LQIRUPDWLRQ DUH EHVW XVHG LQ FRPELQDWLRQ ZLWK GLUHFWHG HYROXWLRQ DSSURDFKHV %HFDXVH WKH VHTXHQFH VSDFH JURZV H[SRQHQWLDOO\ PXWDJHQHVLV RI PRUH WKDQ RQH UHVLGXHVRI D SURWHLQ EHFRPHV YHU\ ODUJH TXLFNO\ )RU H[DPSOH FRQVLGHU D FDVH LQ ZKLFK D SURWHLQ LV HQJLQHHUHG DQG WZR RU PRUH SRVLWLRQV DUH PXWDWHG VLPXOWDQHRXVO\ :KLOH D VLQJOHSRLQW PXWDWLRQ WR DOO  QDWXUDO DPLQR DFLGV ZRXOG SURGXFH  YDULDQWV WKH QXPEHU RI YDULDQWV LQ D GRXEOH RU WULSOH SRLQWVDWXUDWLRQ ZLOO JURZ H[SRQHQWLDOO\ E\ D IDFWRU RI  DQG OHDG WR  GXDOSRLQW VDWXUDWLRQ [  RU  WULSOH VDWXUDWLRQ  [  [  HQ]\PH YDULDQWV 7KH SODLQWH[WRI DOO WKH VHTXHQFHV JHQHUDWHG ZKHQ VLPXOWDQHRXVO\ SHUPXWLQJ  SRVLWLRQV IURP D  DPLQRDFLG SURWHLQ ZRXOG QHHG QHDUO\  7% RI VWRUDJH DORQH 7KLV JURZWK FDQ EH VOLJKWO\ UHGXFHG LW UHPDLQV H[SRQHQWLDO LI D VXEVHW RI DPLQR DFLGV LV XVHG DV KDV EHHQ VXJJHVWHG DQG DSSOLHG H[SHULPHQWDOO\ >@ 7KH H[DPSOH DERYH ZRXOG VFDOH ZLWK  DPLQR DFLGV WR SURGXFH    PXWDQW VH TXHQFHV RU  *% RI WH[W

 6WUXFWXUDO $QDO\VLV DQG 7RROV  2EWDLQLQJ 3URWHLQ 6WUXFWXUHV $Q HQRUPRXVO\ ODUJH QXPEHU RI FRQIRUPDWLRQV DUH WKHRUHWLFDOO\ DFFHVVLEOHWR HQ]\PHV 1HYHUWKHOHVV WKH\ RIWHQ IROG LQWR RQH ELRORJLFDOO\ DFWLYH FRQIRUPD WLRQ ZLWK D IHZ QRWDEOH H[FHSWLRQV VXFK DV LQWULQVLFDOO\ XQVWUXFWXUHG SURWHLQV >@ 7R SUHGLFW WKH IROG RI D SURWHLQ XVLQJ RQO\ VHTXHQFH LQIRUPDWLRQ UHPDLQV DQ XQVROYHG FKDOOHQJH (YHQ ZLWK YHU\ SRZHUIXO FRPSXWLQJ UHVRXUFHV OLNH )ROGLQJ#+RPH >@ RU $QWRQ  >@ LW UHPDLQV D SUREOHP WR IROG SURWHLQV FRU UHFWO\ LQ VLOLFR > @ 7KLV PHDQV WKDW ZLWKRXW H[SHULPHQWDOO\ REWDLQHG ' VWUXFWXUHV RI SURWHLQV LW LV YHU\ GLIILFXOW WR REWDLQ LQVLJKW LQ WKH GLPHQVLRQDO RUJDQL]DWLRQ RI SURWHLQV 7KHUHIRUH ZKHQHYHU SRVVLEOH D ' VWUXFWXUH LV REWDLQHG H[SHULPHQWDOO\ XVXDOO\ E\ XVLQJ ;UD\ GLIIUDFWLRQ >@ 7KH SURWHLQ LV H[SUHVVHG SXULILHG DQG FU\VWDOOL]HG DQG WKHQ ;UD\ GLIIUDFWLRQ GDWD LV FROOHFWHG ,I DOO SUHYLRXV VWHSV ZHUH VXFFHVVIXO DQG WKH FU\VWDO GLG GLIIUDFW D GHQVLW\ PDS FDQ EH REWDLQHG 7KLV LQ WXUQ DOORZV ' FRRUGLQDWHV WR EH FRPSXWHG HJ XVLQJ &RRW >@  7KH ' GDWD LV VWRUHG LQ YDULRXV GDWDEDVHV 2QH ZHOONQRZQ H[DPSOH LV WKH 3URWHLQ 'DWD %DQN ZZZUFVERUJ >@  ZKHUH  VWUXFWXUHV DUH DYDLO DEOH DV RI -DQXDU\    ,W LV KRZHYHU QRW WULYLDO WR H[SUHVV DQG FU\VWDOOL]H D SURWHLQ DQG HYHQ ZKHQ LW LV SRVVLEOH WKH FU\VWDOV PD\ QRW GLIIUDFW RU PD\ EH RI SRRU TXDOLW\ $V DQ DOWHUQDWLYH WR FU\VWDOOL]DWLRQ WKH SURWHLQ PD\ EH H[SUHVVHG LQ DQ LVRWRSHHQULFKHG PHGLXP DQG VXEMHFWHG WR QXFOHDU PDJQHWLF UHVRQDQFH 105 VSHFWURVFRS\ SURYLGHG LW LV VXIILFLHQWO\ VROXEOH WR \LHOG D JRRG VLJQDO 2WKHU HPHUJLQJ WHFKQRORJLHV VXFK DV VLQJOHPROHFXOH ;UD\ ODVHU GLIIUDFWLRQ

 >@ DQG FU\RHOHFWURQ PLFURVFRS\ FU\R(0 DUH EHLQJ GHYHORSHG 5HFHQWO\ D  c FU\R(0 VWUXFWXUH ZDV SXEOLVKHG >@

 /LPLWDWLRQV RI +LJK5HVROXWLRQ 6WUXFWXUHV ;UD\ VWUXFWXUHV XVXDOO\ ODFN K\GURJHQ DWRPV DQG VR K\GURJHQ LV XVXDOO\ DGGHG GXULQJ SRVWSURFHVVLQJ HJ EHIRUH VWDUWLQJ D PROHFXODU G\QDPLF VLPXODWLRQ  7KH UHDVRQ IRU WKLV LV WKDW WKH HOHFWURQ GHQVLW\ DURXQG K\GURJHQ DWRPV LV ORZ DQG RIWHQ SRODUL]HG WRZDUG KHDY\ DWRPV DQG DV WKH OLJKWHVW HOHPHQW K\GUR JHQ LV PRELOH HYHQ DW ORZ WHPSHUDWXUHV DQG WKH HOHFWURQ GHQVLW\ LV VSUHDG RXW $GGLWLRQDOO\ LW LV LPSRUWDQW WR QRWH WKDW FU\VWDO VWUXFWXUHV DUH RIWHQ REWDLQHG DW D S+ WKDW DOORZV WKH SURWHLQ WR FU\VWDOOL]H ZKLFK LV QRW QHFHVVDULO\ WKH VDPH DV WKH SK\VLRORJLFDO S+ RI WKH KRVW RUJDQLVP 7R DGGUHVV WKH LVVXHV WKDW DULVH IURP SURWRQDWLRQ VHYHUDO WRROV DUH DYDLODEOH LQFOXGLQJ 0RO3URELW\ >@ ZKLFK FDQ EH XVHG WR ILQG WKH PRVW OLNHO\ SURWRQDWLRQ VWDWHV DQG HYHQ VLGH FKDLQ IOLSV6LGH FKDLQ IOLSV RFFXU ZKHQ WKH GHQVLW\ RI D FU\VWDO LV QRW DOORZLQJ WR FOHDUO\ DVVLJQ WKH RULHQWDWLRQ RI VLGH FKDLQV OLNH KLVWLGLQH DUJLQLQH RU JOXWDPLQH 7KHVH FDVHV FDQ EH IRXQG DQG UHVROYHG ZLWK K\GURJHQ ERQG QHWZRUN DQDO\VLV $GGLWLRQDOO\ WRROV OLNH 3523.$ FDQ EH HPSOR\HG WR HVWLPDWH S.D >@

 6WUXFWXUHV RI 3URWHLQ 9DULDQWV 3URWHLQV DUH VXEMHFW WR ERWK QDWXUDO HYROXWLRQ DQG KXPDQ HQJLQHHULQJ )RU H[DPSOH UDQGRP SRLQW PXWDWLRQV FDQ KDSSHQ WR DQ RUJDQLVP LI LW LV H[SRVHG WR LRQL]LQJ UDGLDWLRQ $GGLQJ D UDGLDWLRQ VRXUFH DQG KHQFH DFFHOHUDWLQJ WKH QDWXUDO PXWDWLRQ UDWH LV XVHG WR LPSURYH FURS >@ :LWK WKH VRSKLVWLFDWHG ELRWHFKQRORJLFDO WRROV DYDLODEOH WRGD\ LW LV DOVR SRVVLEOH WR HGLW WKH '1$RIDQ RUJDQLVP DQG GLUHFWO\ DOWHU LW )RU H[DPSOH SRLQW PXWDWLRQV PD\ EH LQWURGXFHG WR DOWHU WKH IXQFWLRQ RI DQ HQ]\PH ([SHULPHQWDOO\ WKLV FDQ EH XVHG WR WHVW ZKHWKHU D UHVLGXH DVVXPHG WR EH FUXFLDO IRU FDWDO\VLV LV LQGHHG FUXFLDO HJ E\ PXWDWLQJ DQ DVSDUWDWH WKDW LV DV VXPHG WR EH WKH QXFOHRSKLOH WR DQ DODQLQH  ,I WKLV HQ]\PH LV ODWHU GHWHUPLQHG WR EH LQDFWLYH FKDQFHV DUH WKDW WKLV VSHFLILF DVSDUWDWH VLGH FKDLQ LV LQGHHG FUX FLDO IRU FDWDO\VLV HVSHFLDOO\ LI WKH VWUXFWXUH RI WKH HQ]\PH GLG QRW FKDQJH ,Q VXFK D VFHQDULR D VLQJOH PXWDWLRQ UHQGHUHG WKH HQ]\PH LQDFWLYH ZKLOH OHDYLQJ LW VWUXFWXUDOO\ LQWDFW ,Q RWKHU FDVHV D PXWDWLRQ PD\ LQWURGXFH D VLJQLILFDQW FKDQJH LQ WKH VWUXFWXUH RI D SURWHLQ DV IRU H[DPSOH ZKHQ 6W(+ ZDV PRGLILHG DQG + ZDV PXWDWHG WR DVSDUDJLQH >@ ,Q PDQ\ RWKHU FDVHV WKH SURWHLQ UHWDLQV LWV ' VWUXFWXUH DV LQ FDVH RI WKH <)PXWDQW >@ )RU VXFK VXEVWL WXWLRQV D PXWDQW VWUXFWXUH PD\ EH REWDLQHG E\ ³PXWDWLQJ´ WKH FRUUHVSRQGLQJ VLGH FKDLQ LQ VLOLFR 7R LQWURGXFH RU FKDQJH SURWHLQ VLGH FKDLQV D QXPEHU RI WRROV KDYH EHFRPH DYDLODEOH )RU H[DPSOH WKH 5DPDFKDQGUDQ SORW JLYHV DQ HVWLPDWH LI WKH DQJOHV

 RI WKH SURWHLQ EDFNERQH DUH LQ WKHLU HQHUJHWLFDOO\ SUHIHUUHG UHJLRQ VHH ILJ   $OVR WKDQNV WR WKH WUHPHQGRXV QXPEHU RI VROYHG ;UD\ VWUXFWXUHV WKDW DUH QRZ



 )LJXUH  5DPDFKDQGUDQ SORW RQ WKH H[DPSOH RI 6W(+ 3'%,' &-3$  7KH SORW YLVXDOL]HV WKH HQHUJHWLFDOO\ IDYRUHG UHJLRQV RI WKH EDFNERQH WRUVLRQV RIψ &&α DQG ϕ &α1  7KLV SORW ZDV FUHDWHG ZLWK 90'  >@ DQG PRGLILHG ZLWK ,QNVFDSH >@

DYDLODEOH 'XQEUDFN HW DO > @ DQG RWKHUV KDYH FUHDWHG URWDPHU OLEUDULHV RI W\SLFDO UHVLGXH RULHQWDWLRQV DQG FRPSXWDWLRQDO PRGHOLQJ DQG YLVXDOL]DWLRQ WRROV VXFK DV &KLPHUD >@ RU 3\02/ >@ RIIHU EXLOWLQ PXWDJHQHVLV WRROV 7KHVH WRROV XVXDOO\ SURYLGH DQ LQWHUIDFH WR URWDPHU OLEUDULHV DQG DOORZ XVHUV WR FKRRVH WKHLU SUHIHUUHG VLGH FKDLQ RULHQWDWLRQ 6RPH VFRULQJ IXQFWLRQ LV XVXDOO\ SURYLGHG DV ZHOO PDNLQJ WKH WRROV HYHQ PRUH XVHU IULHQGO\ HJ 3\02/¶V ³0XWDJHQHVLV :L]DUG´  $ QXPEHU RI SURJUDPV KDYH EHHQ ZULWWHQ WR DXWRPDWH WKH VHOHFWLRQ RI WKH EHVW URWDPHU $ ZHOONQRZQ SURJUDP LV 6&:5/ >@ ,W LV DEOH WR FRQVWUXFW SURWHLQ VLGH FKDLQ RULHQWDWLRQV JLYHQ MXVW WKH EDFNERQH WUDFH ,PSRUWDQWO\ IRU WKLV ZRUN 6&:5/ DOVR VROYHV D FRPELQDWRULDO SUREOHP ZKHQ XVLQJ OLEUDULHV LI D UHVLGXH LV WRR ODUJH DQG QHLJKERULQJ UHVLGXHV QHHG WR EH PRYHG RU LI WZR QHLJKERULQJ UHVLGXHV FODVK LW TXLFNO\ EHFRPH GLIILFXOW WR ILQG WKH EHVW VROX WLRQ 7KDQNV WR D KHXULVWLF 6&:5/ DOZD\V REWDLQV D VROXWLRQ IRU VXFK FDVHV (YHQ LI PDQ\ GHHSO\ QHVWHG DQG LQWHUGHSHQGHQW FRPELQDWLRQV RI VLGH FKDLQ RULHQWDWLRQV H[LVW

 7UDQVLWLRQ 6WDWH 7KHRU\ (DUO\ DGYDQFHV WRZDUGV WKH IXQGDPHQWDO XQGHUVWDQGLQJ RI FKHPLFDO UHDFWLRQV LQ WKH JDV SKDVH GDWH EDFN WR  ZKHQ 6YDQWH $UUKHQLXV LQWURGXFHG KLV IDPRXV HTXDWLRQ

− Ea k = Ae RT 

 :KHUH k = UDWH FRQVWDQW A = SUHH[SRQHQWLDO IDFWRU Ea = DFWLYDWLRQ HQHUJ\ R = JDV FRQVWDQW T = 7HPSHUDWXUH LQ GHJUHHV .HOYLQ 7R UDWLRQDOL]H WKLV ILQGLQJ ZH DVVXPH D UHDFWLRQ VLPLODU WR WKH RQH GHSLFWHG LQ 6FKHPH 

(6 (6‡ (3 6FKHPH 

(TXDWLRQ  FDQ WKHQ EH UHZULWWHQ   E 1 ln(k)=ln(A) − a  R T 7R PDNH XVH RI WKLV UHODWLRQVKLS WKH WHPSHUDWXUH LV SORWWHG DJDLQVW WKH UDWH FRQVWDQW T YV OQ k  VR D OLQHDU HTXDWLRQ FDQ EH ILWWHG DQG ERWK WKH SUH H[SRQHQWLDO IDFWRU DQG (D5 FDQ EH H[WUDFWHG IURP yLQWHUFHSW DQG VORSH UH VSHFWLYHO\

7KH (\ULQJ HTXDWLRQ HT  ZDV IRXQG LQ  LQGHSHQGHQWO\ E\ +HQU\ (\ULQJ >@ DQG (YDQV 3RODQ\L >@ 7KH H[SRQHQW IHDWXUHV LQ FRQWUDVW WR HT  WKH *LEEV DFWLYDWLRQ HQHUJ\ DQG UHSODFHV WKH SUHH[SRQHQWLDO IDFWRU ZLWK kBT /h LI κ  VHH HT  

‡ kBT − ΔG k = e RT  h :LWK WKH VDPH V\PEROV DV LQ WKH $UUKHQLXV HTXDWLRQ DQG DGGLWLRQDOO\ kB %ROW]PDQQ FRQVWDQW h 3ODQFN¶V FRQVWDQW DQG ΔG‡ *LEEV DFWLYDWLRQ HQ HUJ\ 7UDQVLWLRQ VWDWH WKHRU\ 767 DOORZV XV WR FRQYHUW H[SHULPHQWDOO\ REVHUYHG NLQHWLF HQHUJLHV WR DFWLYDWLRQ HQHUJLHV >@ 767 VXJJHVWV WKDW UHDFWLRQV SUR FHHG WKURXJK D PD[LPXP FURVVLQJ IURP UHDFWDQW WR SURGXFW VWDWH 7KLV PD[L PXP LV UHIHUUHG WR DV D WUDQVLWLRQ VWDWH ,W LV WKH KLJKHVW SRLQW LQ HQHUJ\ EHWZHHQ UHDFWDQW DQG SURGXFW 7KDQNV WR WKH ZRUN E\ (\ULQJ LW LV SRVVLEOH WR FDOFXODWH WKH DFWLYDWLRQ HQHUJ\ RI D UHDFWLRQ IURP D PDFURVFRSLFDOO\ REVHUYHG UHDFWLRQ UDWH

‡ kBT − ΔG k = κ e RT  h :LWK WKH VDPH V\PEROV DV WKH $UUKHQLXV HTXDWLRQ DQG DGGLWLRQDOO\ ‡ kB %ROW]PDQQ FRQVWDQW h 3ODQFN¶V FRQVWDQW ΔG *LEEV DFWLYDWLRQ HQHUJ\ DQG κ 7UDQVPLVVLRQ FRHIILFLHQW XVXDOO\ DVVXPHG WR EH 

 0ROHFXODU 6LPXODWLRQV 7ZR IXQGDPHQWDOO\ GLIIHUHQW ZD\V WR SHUIRUP PROHFXODU VLPXODWLRQV H[LVW 2QH LV EDVHG RQ 00 DQG WKH RWKHU RQ 40 00 PHWKRGV UHO\ RQ D VHW RI

 SDUDPHWHUV IRU DOO ERQGHG DQG QRQERQGHG WHUPV XVHG LQ D V\VWHP WR SURGXFH WKH SRWHQWLDO HQHUJ\ RI WKH V\VWHP   U(R)= Ubonded + Unonbonded

7KH PDMRU DGYDQWDJH RI XVLQJ 00 WR PRGHO D PROHFXOH LV WKDW LW LV IDVW HYHQ ZLWK PDQ\ SDUWLFOHV 7KH GLVDGYDQWDJHV RI 00EDVHG DSSURDFKHV DUH WKDW WKH\ DUH ERXQG WR 1HZWRQLDQ PHFKDQLFV XQDEOH WR PRGHO H[FLWHG VWDWHV DQG XVXDOO\ ERXQG WR KDUPRQLF SRWHQWLDOV IRU ERQGHG WHUPV $V D UHVXOW FODVVLFDO 00 DSSURDFKHV FDQQRW VXSSRUW UHDFWLRQV GLUHFWO\ LH QR IRUPDWLRQ RU EUHDNLQJ RI ERQGV  7R IRUP DQG EUHDN ERQGV D PRUH GHWDLOHG GHVFULSWLRQ LV QHHGHG DQG WKH 6FKU|GLQJHU HTXDWLRQ LV VROYHG DSSUR[LPDWHO\

HΨ=EΨ :LWK H EHLQJ WKH +DPLOWRQLDQ RSHUDWRU GHVFULELQJ WKH WRWDO HQHUJ\ RI WKH V\VWHP Ψ UHSUHVHQWV WKH ZDYHIXQFWLRQ DQG E WKH HQHUJ\ YDOXHV RI Ψ7ZR PDLQ DSSURDFKHV DUH XVHG WR VROYH LW WKRVH EDVHG RQ YDOHQFH ERQGV 9% DQG WKRVH WKDW DUH EDVHG RQ PROHFXODU RUELWDOV 02 

 0ROHFXODU 0HFKDQLFDO 6LPXODWLRQV 7KH IRUFH ILHOG EDVHG DSSURDFKHV UHO\ RQ SDUDPHWUL]DWLRQ RI DOO LQWHUDFWLRQV RI PROHFXOHV LQ WKH IRUP RI   ( )= bond( − )2 + angle( − )2+ U R ki ri r0 ki θi θ0 bonds angles dihedral[1 + ( + )]+ ki cos ηiφi δi dihedrals     12  6  σij σij qiqj 4εij − + rij rij εrij i i=j i i=j

 ZKHUH WKH ILUVW WKUHH WHUPV GHVFULEH ERQG DQJOH DQG GLKHGUDO UHVSHFWLYHO\ DQG WKH IRXUWK WHUP GHVFULEHV WKH QRQERQGHG LQWHUDFWLRQV XVXDOO\ VSOLW LQWR YDQ GHU :DDOV DQG &RXORPE WHUPV )RU D JLYHQ FRPSRXQG DOO SDUDPHWHUV bond angle dihedral KDYH WR EH NQRZQ ri ki  θ0 ki  ηi φi δi ki  σij εij qi qj DQG FRQVLVWHQWO\ GHILQHG IRU DOO DWRPV WKDW DUH SDUW RI WKH IRUFH ILHOG VLPXODWLRQ

 0ROHFXODU '\QDPLFV 6LPXODWLRQV $SSURDFKHV EDVHG RQ PROHFXODU G\QDPLFV 0' DLP WR VLPXODWH D PROHFXOH RYHU WLPH EHWZHHQ IHPWRVHFRQGV IV WR PLFURVHFRQGV —V  DQG ZLWK VSHFLDO

 hardware (ANTON) milliseconds (ms) [94], to obtain information of the con- formational landscape of a molecule. Often, molecules have many degrees of freedom, making it very hard or impossible to find the global minimum energy conformation and simulations can get trapped in local minima. Additionally, large molecules have many energetically similar configurations that are all ac- cessible with temperatures around 300K. It is therefore of central importance to sample a molecule’s energetic landscape long enough and with different ini- tial conditions, to obtain sufficient sampling and collect an understanding for the states a molecule visits over time. This approach is routinely used to per- form simulations of protein in solvent and is implemented in many simulation packages (e.g. GROMACS [115, 116], AMBER [117], CHARMM [118] and Q [119]). Depending on the simulated system, code, and hardware used, 1 second compute time translates to 10−14 (single core [120]) to 10−9 seconds (ANTON 2 [94]).

2.5.3 Quantum Mechanical Simulations Quantum mechanical approaches (QM) simulate a molecule entirely, including electrons. Since electrons have much less mass than the nuclei they are almost instantaneously adapting to the movement of the nuclei and this can be used to separate the nuclei from the electrons. This fundamental approximation is called Born-Oppenheimer approximation. The goal of the QM approaches is to calculate the wave function of a mo- lecule, as can be done by solving the Schrödinger equation. The Schrödinger equation has however only an analytical solution for the simplest atoms (e.g. hydrogen atom) and it needs to be solved numerically for other atoms and molecules. Using the Born-Oppenheimer approximation, the Schrödinger equa- tion can be solved for the electrons separately. The Hartree-Fock method ap- proximates further, by calculating the electron interaction with the meanfield of the other electrons, thus neglecting electron-electron correlations. To in- clude electron correlation QM methods based on Møller–Plesset perturbation theory (e.g. MP2), configuration interaction (CI) or coupled cluster (CC) can be employed. These methods are, however, computationally expensive (N4 to N8), growing with the number of electrons in the system. Hence they are un- practical for large systems. Density function theory (DFT) is an approach that can be used for larger systems, which does not aim to approximate the wave- function, but instead the electron density. A shortfall of DFT is the description of exchange-correlation and different functionals should be tested for a system to avoid non-physical DFT artefacts.

29  0XOWLVFDOH 0RGHOLQJ ,Q  WKH 1REHO SUL]H LQ FKHPLVWU\ ZDV DZDUGHG WR .DUSOXV /HZLW DQG :DUVKHO IRU WKH GHYHORSPHQW RI PXOWLVFDOH PRGHOLQJ 7KH LGHD WKH /DXUHDWHV FDPH XS ZLWK LV WKH FRPELQDWLRQ RI GLIIHUHQW OHYHOV RI WKHRU\ RU FRPSOH[LW\ IRU GLIIHUHQW SDUWV RI D PROHFXOH RU D V\VWHP RI PROHFXOHV OLNH D VROYDWHG SUR WHLQ 0RGHUQ 4000 DSSURDFKHV EXLOG RQ WKLV LGHD DQG XVH D IRUFH ILHOG IRU D ODUJH SDUW RI WKH V\VWHP DQG GHQVLW\ IXQFWLRQDO WKHRU\ ')7 RU DQRWKHU 40 DSSURDFK IRU WKH UHDFWLYH SDUW RI WKH V\VWHP ,Q WKH HPSLULFDO YDOHQFH ERQG (9% DSSURDFK ZKLFK ZDV H[WHQVLYHO\ XVHG LQ WKLV ZRUN WKH UHDFWLYH SDUW RI D V\VWHP LV PRGHOHG ZLWK YDOHQFH ERQG $G GLWLRQDOO\ LQ WKLV ZRUN VSKHULFDO UHDFWLRQ ERXQGDULHV DUH XVHG LH VXUIDFH FRQVWUDLQHG DOODWRP VROYHQW 6&$$6 > @  VHH )LJXUH  :LWK (9% WKH UHDFWLYH DWRPV DUH SDUW RI WKH IRUFH ILHOG DQG WKH ERQGV IRUPHG DQGRU EUR NHQ DUH PRGHOOHG ZLWK 0RUVH SRWHQWLDOV LQVWHDG RI KDUPRQLF SRWHQWLDOV 7KLV DSSUR[LPDWLRQ RI UHDFWLYLW\ KDV ERWK DQ LPSDFW RQ WKH DFFXUDF\ RI WKH DFWLYH ERQGV EXW DOVR RQ WKH FRPSXWH WLPH QHHGHG WR VLPXODWH RQH WLPHVWHS 3URYLGHG WKDW WKH LQWULQVLF HQHUJ\ QHHGHG IRU ERQGIRUPDWLRQ DQGRU ERQGEUHDNLQJ GRHV QRW FKDQJH VLJQLILFDQWO\ IRU WKH VDPH UHDFWLRQ LQ YDFXXP ZDWHU RU SURWHLQWKH (9% DOORZV IRU VLJQLILFDQWO\ PRUH VDPSOLQJ RI WKH HQYLURQPHQW ,Q WXUQ WKH HQHUJHWLF FRQWULEXWLRQV IURP WKH HQYLURQPHQW HJ VROYHQW RU SURWHLQ FDQ EH VDPSOHG ORQJHU DQG WKLV FDQ KHOS WR REWDLQ FRQYHUJLQJ HQHUJHWLFV 7KHUH DUH RWKHU ZD\V WR DGG UHDFWLYLW\ WR D IRUFH ILHOG )RU H[DPSOH 5HD[)) LV DQ DSSURDFK WKDW FDQ HQDEOH D IRUFH ILHOG EDVHG DSSURDFK WR ERQG EUHDN LQJ DQG ERQG IRUPLQJ SURFHVVHV &RQVLGHUDEOH SDUDPHWUL]DWLRQ RI WKH UHDFWLYH IRUFH ILHOG LV QHHGHG WR GHVFULEH GLIIHUHQW FODVVHV RI V\VWHPV > @

 )UHH (QHUJ\ 3HUWXUEDWLRQ ,Q PROHFXODU G\QDPLFV LW LV QRW SRVVLEOH WR VDPSOH KLJK HQHUJ\ VWDWHV LQ D SK\VLFDOO\ PHDQLQJIXO ZD\ ,W LV KRZHYHU SRVVLEOH WR GHVFULEH WZR VWDWHV$ DQG % ZLWK D IRUFH ILHOG ,I WKH WZR VWDWHV DUH FORVH DQG VDPSOHG HQRXJK LH WKHUH LV HQRXJK RYHUODS  LW LV SRVVLEOH WR FDOFXODWH WKH WUDQVLWLRQ IURP RQH VWDWH WR DQRWKHU 7KH SURFHGXUH NQRZQ DV IUHH HQHUJ\ SHUWXUEDWLRQ )(3 FDQ EH XVHG EDVHG RQ WKH ZRUN RI 5REHUW =ZDQ]LJ IURP  >@   U − U ΔG = −RT ln exp{− B A }  kBT A 7KLV RQO\ ZRUNV LI WKH RYHUODS EHWZHHQ WKH WZR VWDWHV $ DQG % LV ODUJH HQRXJK LH WKH GLIIHUHQFH EHWZHHQ VWDWH $ DQG % LV VPDOO  ,I QRW HQRXJK VDP SOLQJ LV DYDLODEOH GXH WR ORZ RYHUODS LQWHUPHGLDWH SRWHQWLDOV FDQ EH FUHDWHG XVXDOO\ ZLWK D OLQHDU FRPELQDWLRQ RI VWDWHV $ DQG % DQG D PDSSLQJ SDUDPHWHU θm 

 (A) (B) O O 1 2 O O O O

R R

(C)

1 2

 H12

)LJXUH  $ 6FKHPDWLF LOOXVWUDWLRQ RI WKH VSKHULFDO ERXQGDULHV XVHG LQ WKLV ZRUN 7KH SURWHLQ LV GLVSOD\HG LQ JUH\ FDUWRRQ 7KUHH GLIIHUHQW OD\HUV RI DWRPV DUH GLVWLQJXLVKHG ,Q WKH FHQWHU GHSLFWHG DV ERQG DQG VWLFNV WKH UHDFWLQJ DWRPV ,Q D VHFRQG OD\HU WKH DWRPV DUH XQUHVWUDLQHG DQG PD\ LQ FRQWUDVW WR WKH WKLUG OD\HU PRYH IUHHO\ 7KH WKLUG OD\HU UHVWUDLQV WKH PRELOLW\ RI DWRPV DQG WKH HYHU\WKLQJ RXWVLGH WKH WKLUG OD\HU LV KHDYLO\ UHVWUDLQHG DQG SUDFWLFDOO\ LPPRELOL]HG % 7KH WZR YDOHQFH ERQG VWDWHV Ψ1 DQG Ψ2 IRU WKH ILUVW VWHS RI QXFOHRSKLOLF HSR[LGH K\GURO\VLV & 7KH IUHH HQHUJ\ VXUIDFHV RI WKH WZR YDOHQFH ERQG VWDWHV :LWK λ UHSUHVHQWLQJ WKH UHRUJDQL]DWLRQ HQHUJ\ QHHGHG WR FKDQJH IURP RQH WR WKH RWKHU VXUIDFH

Ueff (θm)=εm =(1− θm)UA + θmUB (0 ≤ θm ≤ 1) 

7KH PDSSLQJ SDUDPHWHU θm LV WKHQ LQFUHPHQWDOO\ FKDQJHG IURP  WR  VR WKDW WKH RYHUODS EHWZHHQ WZR LQWHUPHGLDWH VWDWHV LV ODUJH HQRXJK WR FRQYHUJH $OO WKDW LV OHIW LV WKH FRPELQDWLRQ DV GHVFULEHG EHORZ VHH HT  RI DOO WKH LQWHUPHGLDWH VWHSV ZKLFK UHVXOWV LQ D FKDQJH IURP VWDWH $ WR % >±@

 (PSLULFDO 9DOHQFH %RQG $SSURDFK 7KH (9% DSSURDFK DGGV UHDFWLYLW\ WR IRUFH ILHOG EDVHG VLPXODWLRQV $V LWV QDPH LPSOLHV HPSLULFDO ILWWLQJ RI D VHW RI SDUDPHWHUV WR PDWFK D UHIHUHQFH V\V WHP LV QHHGHG 2QFH WKH SDUDPHWHUV DUH ILWWHG HJ WR WKH UHDFWLRQ RI LQWHUHVW LQ ZDWHU  WKH\ DUH IL[HG DQG GR QRW FKDQJH IRU WKH VDPH UHDFWLRQ LQ RWKHU VRO YHQWV 7KH H[DFW VDPH SDUDPHWHUV DV REWDLQHG E\ WKH UHIHUHQFH UHDFWLRQ DUH WKHQ DSSOLHG WR WKH DFWXDO V\VWHP RI LQWHUHVW HJ WKH VDPH UHDFWLRQ LQ SURWHLQ >±@

 7KH YDOHQFH ERQG 9% WKHRU\ DQG PROHFXODU RUELWDO 02 WKHRU\ ERWK GH VFULEH SKHQRPHQD RQ WKH PROHFXODU OHYHO ,Q WKH 9% DSSURDFK WKH PROHFXOH LV KDQGOHG DV D OLQHDU FRPELQDWLRQ RI YDOHQFH ERQG VWDWHV 8QOLNH 02 WKHRU\ ZKLFK GHVFULEHV WKH ZDYH IXQFWLRQV RI GHORFDOL]HG HOHFWURQV E\ FRPELQLQJ DOO SRVVLEOH RUELWDOV ORRNLQJ IRU WKH SUREDELOLW\ RI ILQGLQJ DQ HOHFWURQ DW JLYHQ SR VLWLRQV LQ VSDFH WKH 9% DFFRXQWV IRU WKH SUREDELOLW\ RI ILQGLQJ D PROHFXOH DW D JLYHQ VWDWH >@ %HFDXVH WKH (9% DSSURDFK XVHV LQH[SHQVLYH 0RUVHSRWHQWLDO HT  WHUPV WR PRGHO UHDFWLYLW\ LH ERQG IRUPLQJ DQG EUHDNLQJ LW LV ZHOO VXLWHG WR DFFRXQW IRU WKH HQHUJHWLF FRQWULEXWLRQV IURP WKH HQYLURQPHQW LH YDFXXP VROYHQW RU SURWHLQ DQG WKH FRPSXWDWLRQ RI WKH FRUUHVSRQGLQJ HQHUJHWLFV LV PXFK IDVWHU WKDQ ZLWK 02EDVHG 4000 FDOFXODWLRQV 7KLV LV EHFDXVH LW LV QRW QHFHVVDU\ WR FRPSXWH WKH ZDYH IXQFWLRQ IRU HYHU\ WLPH VWHS

M = De(exp{−2a(r − r0)}−2exp{−a(r − r0)} 

%XLOGLQJ IURP 0DUFXV 7KHRU\ :DUVKHO :HLVV LQWURGXFHG WKH (9% DS SURDFK LQ  7KH IXQGDPHQWDO LGHD ZDV WR GHWHUPLQH WKH VROYDWLRQ HIIHFWV RI WKH HQYLURQPHQW RQ WKH V\VWHP RI LQWHUHVW >@ th 7KH GLDEDWLF VWDWHV H11 DQG H22 DUH GHVFULEHG DV LQ HT  IRU WKH i i VWDWH ZLWK WKH ILUVW WHUP αgas UHSUHVHQWLQJ WKH JDVSKDVH HQHUJ\ 7KH HDVLHVW H[DPSOH RI DQ (9% FDOFXODWLRQ LV WKH FDOFXODWLRQ LQ WKH JDV SKDVH ZKHUH WKH GL = = i + (R Q) DJRQDO HOHPHQWV Hii εi αgas Uintra ,  DQG ZKHUH WKH VHFRQG WHUP Uintra(R, Q) DFFRXQWV IRU WKH LQWUDPROHFXODU LQWHUDFWLRQV ZLWKLQ WKH VROXWH 7KH RIIGLDJRQDO WHUP LQ HT  PD\ EH LPSOHPHQWHG DV D GHVFULSWLRQ RI i H[SRQHQWLDO FRXSOLQJ IXQFWLRQV DQG WRJHWKHU ZLWK αgas HPSLULFDOO\ ILWWHG WR D UHIHUHQFH HQHUJ\

(−μ(r−r0)) Hij = Ae  :LWK HTV  DQG  WKH +DPLOWRQLDQ PDWUL[ FDQ EH FRQVWUXFWHG DV   H11 H21 H =  H12 H22 7KH FDVH RI D JDV SKDVH FDOFXODWLRQ FDQ EH H[WHQGHG WR DFFRXQW IRU WKH VRO YHQW RU SURWHLQ HIIHFWV E\ H[WHQGLQJ Hii ZLWK HOHPHQWV IURP WKH VROYHQW DQG VROYHQW LQWHUDFWLRQ

= = i + i (R Q)+ i (R, Q, r, q)+ i (r, q) Hii εi αgas Uintra , Uinter Usolvent  ,Q HT  R DQG Q DUH UHSUHVHQWLQJ FRRUGLQDWHV DQG FKDUJHV RI WKH UH DFWLQJ DWRPV VROXWH ZKLOH r DQG q DUH WKH FRRUGLQDWHV DQG FKDUJHV RI WKH VXUURXQGLQJ DWRPV VROYHQW  XVXDOO\ LQ YDFXXP VROYHQW RU SURWHLQ 7KH WHUP i (R, Q) Uintra UHSUHVHQWV WKH LQWUDPROHFXODU LQWHUDFWLRQV RI WKH VROXWH DV DOUHDG\

 i (R, Q, r, q) PHQWLRQHG DERYH DQG Uinter LV WKH LQWHUDFWLRQ EHWZHHQ VROXWH DQG i (r, q) WKH VXUURXQGLQJ VROYHQW )LQDOO\ Usolvent UHSUHVHQWV WKH VROYHQW HQHU JLHV 1RZ WKH DGLDEDWLF JURXQGVWDWH HQHUJ\ DQG LWV FRUUHVSRQGLQJ HLJHQYHFWRU DUH REWDLQHG IURP WKH ORZHVW HLJHQYDOXH E\ VROYLQJ WKH VHFXODU HTXDWLRQ

HC = EgC  )RU WKH WZRVWDWH (9% V\VWHP WKH VROXWLRQ WR WKLV HLJHQYDOXH SUREOHP LV

1 2 2 E = [(ε1 + ε2) − (ε1 − ε2) +4H ] g 2 12  7KH VROXWLRQ RI  LH WKH ORZHVW HLJHQYDOXH LV WKH JURXQG VWDWH HQHUJ\ RI WKH V\VWHP 7KH DFWLYDWLRQ IUHHHQHUJ\ ΔG‡ FDQ WKHQ EH REWDLQHG E\ FKDQJ LQJ WKH V\VWHP IURP RQH VWDWH WR WKH RWKHU $ ³PDSSLQJ SRWHQWLDO³ LQ WKH IRUP RI

εm =(1− θm)ε1 + θmε2 (0 ≤ θm ≤ 1)  FDQ EH XVHG WR FKDQJH WKH V\VWHP JUDGXDOO\ IURP RQH VWDWH WR WKH RWKHU ZKHUH θm LV FKDQJHG LQFUHPHQWDOO\ IURP  WR  7KHQ WKH IUHH HQHUJ\ ΔGm RI FKDQJLQJ θm IURP  WR  FDQ EH REWDLQHG E\ IUHHHQHUJ\ SHUWXUEDWLRQXPEUHOOD VDPSOLQJ )(386 >@ 7R REWDLQ WKH IXOO HQHUJ\ VXUIDFH EHWZHHQ GLIIHUHQW GLDEDWLF VWDWHV HJ IURP VWDWH  WR VWDWH   WKH FKDQJHV LQ εm QHHG WR EH JUDGXDO HQRXJK WR EH DEOH WR FRQVWUXFW WKH IUHHHQHUJ\ IXQFWLRQDO

−1 ΔG(x )=ΔGm − β ln δ(x − x ) × exp{−β[Eg(x) − εm(x)]} εm  $V PHQWLRQHG HDUOLHU ZLWK (9% D IXQGDPHQWDO DSSUR[LPDWLRQ LV WKDW WKH i RIIGLDJRQDOV DQG αgas GR QRW FKDQJH VLJQLILFDQWO\ ZKHQ FKDQJLQJ WKH HQYL URQPHQW RI WKH UHDFWLRQ LH ZKHQ WUDQVIHUULQJ WKH V\VWHP IURP ZDWHU WR SUR WHLQ > @

H12 = (ε1 − Eg)(ε2 − Eg)  ,W LV SRVVLEOH WR HVWLPDWH WKH RULJLQ RI WKH FDWDO\WLF HIIHFW WR WKH (9% UHVXOW E\ DSSUR[LPDWLQJ WKH DFWLYDWLRQ IUHH HQHUJ\ ZLWK WKH +ZDQJ±cTYLVW±:DUVKHO +$: HTXDWLRQ

0 2 ‡ (ΔG + λ) H12(R0) ΔG = W + − H12(x)+ − Γ  4λ (ΔG0 + λ) W  WKH ³ZRUN WHUP´ FRUUHVSRQGV WR WKH IUHH HQHUJ\ QHHGHG WR EULQJ WKH UH 0 DFWDQW SDLU WR WKH LQWHUDFWLRQ GLVWDQFH R0 ΔG UHSUHVHQWV WKH UHDFWLRQ IUHH

 HQHUJ\ DQG λ LV WKH UHRUJDQL]DWLRQ HQHUJ\ H12(x) DQG H12(R0) GHVFULEH WKH DYHUDJH YDOXH RI H12 DW WKH WUDQVLWLRQ DQG UHDFWDQW VWDWH UHVSHFWLYHO\ DQG IL QDOO\ Γ UHSUHVHQWV WKH QXFOHDU TXDQWXPPHFKDQLFDO FRUUHFWLRQ > @ ,W LV SRVVLEOH WR DSSUR[LPDWH WKH UHRUJDQL]DWLRQ HQHUJ\ >@ E\ XVLQJ WKH UHODWLRQVKLS 1 λ = (Δε −Δε ) 2 2 1 

ZKHUH Δε1 DQG Δε2 UHSUHVHQW WKH DYHUDJH GLIIHUHQFH EHWZHHQ ε1 DQG ε2 )RU D VFKHPDWLF RYHUYLHZ UHIHU WR )LJXUH  Water Enzyme

2 1 2 1

wat enz G‡ Eg G‡ Eg

)LJXUH  $Q LOOXVWUDWLRQ RI WKH UHODWLRQVKLS EHWZHHQ WZR GLDEDWLF VWDWHV ε1 DQG ε2 LQ ZDWHU OHIW DQG HQ]\PH ULJKW  %RWK WKH UHRUJDQL]DWLRQ HQHUJ\ λ DQG WKH DFWLYDWLRQ EDUULHU DUH ORZHU LQ WKH HQ]\PDWLF UHDFWLRQ 3XEOLVKHG LQ UHI >@ OLFHQVHG XQGHU && %<

7KLV FRQFOXGHV WKH PHWKRGRORJ\ FKDSWHU DQG QH[W , ZLOO SUHVHQW DUWLFOHV WKDW KDYH EHHQ SXEOLVKHG LQ WKH FRQWH[W RI WKLV WKHVLV 7KH FRPSXWDWLRQDO PRGHOLQJ RI PHWDOV LV D SDUWLFXODUO\ FKDOOHQJLQJ WDVN DQG VR , ZLOO QH[W LQWURGXFH WKH QRQERQGHG RFWDKHGUDO GXPP\ PRGHO

 3. Methodology Advances

3.1 Paper I Force Field Independent Metal Parameters Using a Nonbonded Dummy Model Metals play ubiquitous roles in proteins and enzymes: Nearly 1/3 of the protein structures stored on the PDB contain metal ions [135]. Also, metals are found in all classes of enzymes and for example 44% of all known oxidoreductases contain a metal-center [136]. Clearly accurate metal models are important to model metallo proteins.

Modeling metal ions is challenging because metals are easily ionized and highly reactive. Hence, one would preferably model metals with QM ap- proaches which is increasingly done in QM/MM studies [137–139]. It is how- ever often still problematic to reproduce physical properties accurately with QM methods [140, 141]. Additionally, QM approaches are computationally expensive. This is accented, especially if one wishes to perform free energy calculations, which require significant conformational sampling. Different strategies to model metal ions with classical simulations have been published previously. In Paper I, we have described them broadly as three classes; the non-bonded soft-sphere model, the covalently bonded approach and the non-bonded dummy model approach. The simple non-bonded soft- sphere models are describing metal-ligand interactions trough electrostatic and van der Waals potentials. While both earth-alkali and alkali metals have been modeled successfully with this approach, the model appeared to be inadequate for modeling multinuclear metal centers [142]. An additional challenge was the simultaneous reproduction of both metal - water distances and free ener- gies of solvation. The second approach using covalently bonded approaches is not allowing ligand exchange and/or interconversion between different co- ordination geometries [143] and generally suffers from predefined bonds and challenges with a large number of parameters and additionally it needs repara- metrization for new systems [135]. The final approach is using a dummy model, originally developed by Åqvist and Warshel for the octahedral dummy model of Mn2+ ions [144]. The dummy model approach aims to provide an alternative metal model to deal with short- comings of the other approaches and reproduce experimental data. For the oc- tahedral dummy model a metal ion consists of six δ+ charged dummy-particles

35 DUUDQJHG DURXQG D PHWDO FHQWHU ZKLFK LV LQ WXUQ LV FKDUJHG n6δ VHH )LJXUH   7KH GXPP\ DWRPV DUH NHSW LQ RFWDKHGUDO JHRPHWU\ E\ VWURQJ ERQG DQG DQJOHSDUDPHWHUV EHWZHHQ WKH PHWDO FHQWHU DQG WKH GXPP\DWRPV 7KLV DS SURDFK SUHDOLJQV WKH FKDUJH RI WKH PHWDO LQ RFWDKHGUDO JHRPHWU\ 1RWH WKDW WKH GXPP\ PRGHO FDQ EH XVHG IRU RWKHU FRRUGLQDWLRQ JHRPHWULHV DV GHPRQ VWUDWHG IRU WHWUDKHGUDO ]LQF >@

)LJXUH  7KH RFWDKHGUDO GXPP\ PRGHO $ 7KH GXPP\ PRGHO DV /HZLV VWUXF WXUH ZLWK WKH δ+ SDUWLDO FKDUJHV RQ GXPP\ DWRPV DQG WKH n − 6δ− FKDUJH RQ WKH PHWDOFHQWHU % 7KH RFWDKHGUDO GXPP\ PRGHO LQ D SURWHLQ DFWLYH VLWH 5HSULQWHG ZLWK SHUPLVVLRQ IURP UHI >@ &RS\ULJKW  $PHULFDQ &KHPLFDO 6RFLHW\

,Q WKLV ZRUN ZH WHVWHG SUHYLRXVO\ DYDLODEOH SDUDPHWHUV IRU 0Q=Q0J DQG &D UHILQHG WKHP DQG GHYHORSHG QHZ SDUDPHWHUV IRU 1L&RDQG )H :H SDUDPHWUL]HG WKH GXPP\ PRGHOV WR UHSURGXFH LPSRUWDQW H[SHUL PHQWDO SURSHUWLHV RI PHWDOV WKDW KDG EHHQ SXEOLVKHG QDPHO\ WKH VROYDWLRQ IUHH HQHUJ\ >@ WKH OLJDQGPHWDO GLVWDQFHV >@ DQG DOVR WKH OLJDQG GLVVRFLD WLRQ HQHUJLHV RI 0J IRU WKH JO\R[DODVH , V\VWHP VHH EHORZ >@ :H ZHUH DEOH WR GHPRQVWUDWH WKDW RXU GXPP\PRGHOV UHPDLQ VWDEOH ERWK LQ ZDWHU DQG LQ HQ]\PH DQG ZH VKRZHG WKDW WKH GXPPLHV UHSURGXFH WKH DERYH PHQWLRQHG NH\ H[SHULPHQWDO SURSHUWLHV VLPXOWDQHRXVO\ WKH IUHH HQHUJ\ RI VROYDWLRQDQG LRQZDWHU GLVWDQFHV )LQDOO\ ZH UDQ PROHFXODU G\QDPLF VLPXODWLRQV RI WKH GXPPLHV LQ D SURWHLQ V\VWHP WR GHPRQVWUDWH JHQHUDO DSSOLFDELOLW\ :H XVHG RXU SDUDPHWHUV WR GHPRQVWUDWH WKHLU DSSOLFDELOLW\ LQ JO\R[DODVH , $GGLWLRQDOO\ P\ FRDXWKRUV KDYH HPSOR\HG WKH RFWDKHGUDO GXPP\ PRGHO IRU VLPXODWLRQV RI PHWDO LRQV LQ GLIIHUHQW V\VWHPV >±@ ,Q WKH SDVW WKUHH \HDUV VLQFH 3DSHU , ZDV SXEOLVKHG /LDR HW DO >@ KDYH SXEOLVKHG D &X2+ PRGHO WKDW VXFFHVVIXOO\ UHSURGXFHV WKH -DKQ7HOOHU GLVWRUWLRQ D GLIILFXOW SURS HUW\ WR PRGHO $GGLWLRQDO RFWDKHGUDO GXPP\ PRGHO SDUDPHWHUV KDYH EHHQ SXE OLVKHG E\ -LDQJ HW DO > @ 9HU\ UHFHQWO\ D FRPSUHKHQVLYH UHYLHZ DERXW PHWDO PRGHOV XVLQJ FODVVLFDO PHFKDQLFV KDV EHHQ SXEOLVKHG >@ $OO RI ZKLFK VKRZ WKDW WKH QRQERQGHG RFWDKHGUDO GXPP\ PRGHO LV D YHUVDWLOH DSSURDFK WR PRGHO PHWDO LRQV 7R SDUDPHWUL]H WKH PHWDO LRQV ZH XVHG D )(3 SURWRFRO WR REWDLQ WKH IUHH HQHUJ\ RI VROYDWLRQ

 Δ =Δ FEP +Δ +Δ +Δ std.state Gsolv GM 0→M n+ GBorn Gcav Gcorr  +HUH WKH ILUVW WHUP LV REWDLQHG E\ FKDUJLQJ WKH PHWDO LQ D )(3 SURFHGXUH

n−1 −( − / ) Δ FEP = − Um+1 Um kT GM 0→M n+ RT ln e  m m=1

DV H[SODLQHG LQ VHFWLRQ  ZLWK U DV LQ HT   7KH ΔGBorn WHUP FRUUHFWV IRU WKH HUURU LQWURGXFHG E\ WKH ILQLWH LQWHUDFWLRQ FXWRII UDGLXV XVHG IRU WKH HOHFWURVWDWLF LQWHUDFWLRQV DQG LW FDQ EH HVWLPDWHG ZLWK   2 1 Qi ΔGBorn = −166 1 −  RBorn ε(T )

:KHUH WKH VROXWH FKDUJH LV UHSUHVHQWHG E\ Qi WKH FDYLW\ UDGLXV RI WKH FDYLW\ LV RBorn DQG WKH GLHOHFWULF FRQVWDQW RI WKH PHGLXP DQG WKH VROXWH DUH HPEHGGHG LQ ε(T ) 7KH ODVW WZR WHUPV RI HT  FDQFHO RXW ΔGcav UHIHUV WKH FRVW RI FUHDWLQJ Δ std.state D FDYLW\ LQ WKH VROYHQW DQG Gcorr LV D FRUUHFWLRQ IRU PRYLQJ WKH PHWDO LRQ IURP WKH YDFXXP WR VROXWLRQ

+DYLQJ DFFHVV WR LPSURYHG SDUDPHWHUV DOORZV PRUH DFFXUDWH PRGHOLQJ RI QHZ V\VWHPV ,Q WKH QH[W VHFWLRQ &$'(( D WRRO IRU LQ VLOLFR GLUHFWHG HYROXWLRQ ZLOO EH LQWURGXFHG

  3DSHU ,9 &$'(( &RPSXWHU$LGHG 'LUHFWHG (YROXWLRQ RI (Q]\PHV 2QH JRDO RI P\ WKHVLV ZDV WKH GHYHORSPHQW RI DQ DXWRPDWHG ZD\ WR JHQHUDWH DQG UXQ (9% VLPXODWLRQV WR DOORZ IRU LQ VLOLFR GLUHFWHG SURWHLQ HYROXWLRQ

,Q 3DSHU ,9 ZH LOOXVWUDWHG WKDW UDWLRQDO HQ]\PH GHVLJQ DSSURDFKHV KDYH KDG VLJQLILFDQW LPSDFW DQG UDWLRQDO GHVLJQ DSSURDFKHV DUH VWLOO DSSOLHG WRGD\ 5DWLR QDO GHVLJQ LV KRZHYHU UHO\LQJ RQ DQG OLPLWHG E\ WKH DYDLODELOLW\ RI H[SHULPHQ WDO LQIRUPDWLRQ > @ $SSURDFKHV IRU WKH GLUHFWHG HYROXWLRQ RI HQ]\PHV KDYH VLQFH UHYROXWLRQL]HG WKH GHYHORSPHQW RI ELRFDWDO\VWV 7KLV V\VWHPDWLF DS SURDFK HPSOR\V JHQH GLYHUVLILFDWLRQ DQG UDSLG VFUHHQLQJ WHFKQLTXHV WR TXLFNO\ HYROYH DQ H[LVWLQJ HQ]\PH GHVLJQ 7KLV HQDEOHV DFFHVV WR FDWDO\WLFDOO\ VXSHULRU HQ]\PHV WKDW ZRXOG EH LQDFFHVVLEOH E\ UDWLRQDO GHVLJQ EHFDXVH WKH HIIHFW RIWKH LQWURGXFHG PXWDWLRQV LV VRPHWLPHV XQSUHGLFWDEOH >±@ $GGLWLRQDOO\ GH QRYR HQ]\PH GHVLJQV RIWHQ QHHG UHILQHPHQW :LWK PRGHUQ H[SHULPHQWDO DSSURDFKHV WKURXJKSXWV RI 108 WR 1010 VFUHHQHG YDULDQWV SHU H[SHULPHQW DUH SRVVLEOH >@ 7KLV ODUJH QXPEHU RI FRQVWUXFWV FDQ KRZHYHU DOUHDG\ EH FKDOOHQJHG ZLWK RQO\  DPLQR DFLGV PXWDWHG WR DOO  YDULDWLRQV DW HDFK SRVLWLRQ 208 ≈ 2.6· 1010 VHH DOVR )LJXUH  RQ SDJH   8QIRUWXQDWHO\ GLUHFWHG HYROXWLRQ DSSURDFKHV DUH SURQH WR JHW WUDSSHG LQ ORFDO PLQLPD LQ WKH VHTXHQFH VSDFH ZKLFK FDQ UHQGHU IXUWKHU HYROXWLRQ LPSRVVL EOH > @ ,Q WKHVH FDVHV D FRPELQDWLRQ RI FRPSXWDWLRQDO DQG VWUXFWXUDO DSSURDFKHV FDQ KHOS WR UDWLRQDOL]H WKHVH DQG WR SRVVLEO\ ILQG D ZD\ RXW RI D ORFDO PLQLPXP > @ ,Q DGGLWLRQ FRPSXWDWLRQDO DSSURDFKHV WKDW FRYHU D ZLGH UDQJH RI DSSOLFDWLRQV KDYH EHHQ SXEOLVKHG 7KHVH LQFOXGH WKH GHYHO RSPHQW RI GH QRYR HQ]\PH GHVLJQV ZLWK QHZ SURSHUWLHV KRWVSRW SUHGLFWLRQ WRROV ZLWK PDFKLQHOHDUQLQJ PROHFXODU G\QDPLFDO DQG TXDQWXP PHFKDQLFDO DSSURDFKHV >   @ $OVR DSSURDFKHV WR SHUIRUP LQ VLOLFR GLUHFWHG HYROXWLRQ H[SHULPHQWV KDYH EHHQ SXW IRUZDUG LQ WKH OLWHUDWXUH >@ 0RVW FRP SXWDWLRQDO HQ]\PH GHVLJQ DWWHPSWV UHO\ RQ KLJKOHYHO TXDQWXPPHFKDQLFDODS SURDFKHV DQG DUH FRPSXWDWLRQDOO\ H[SHQVLYH 8QIRUWXQDWHO\ WKLV SUREOHP RQO\ DFFHQWXDWHV ZKHQ VXFK DSSURDFKHV VKRXOG EH XVHG WR VFUHHQ WKRXVDQGV RI HQ ]\PH PXWDQWV 7R FKRRVH FRPSXWDWLRQDOO\ FKHDSHU VHPLHPSLULFDO 4000 DSSURDFKHV ZRXOG DOORZ IRU DGGLWLRQDO VDPSOLQJ DQG VFUHHQLQJ KRZHYHU WKH LQKHUHQWO\ OLPLWHG DFFXUDF\ RI VXFK PHWKRGV LV RIWHQ SUREOHPDWLF >±@ 5HFHQW VWXGLHV GHPRQVWUDWH WKH VSHHG DQG HIILFLHQF\ RI WKH (9% DSSURDFK DV D WRRO IRU FRPSXWDWLRQDO HQ]\PH GHVLJQ >±@ 7KH (9% DSSURDFK KDV WKH GLVWLQFW DGYDQWDJH RI EHLQJ DFFXUDWH DQG IDVW HQRXJK WR DOORZ IRU VXI ILFLHQW VDPSOLQJ $W WKH VDPH WLPH LW ZDV HDUOLHU UHOLDEO\ XVHG IRU GLYHUVH V\VWHPV > ±@ )DYRUDEO\ IRU WKH FRPSXWDWLRQDO GLUHFWHG HYROXWLRQ RI HQ]\PHV WKH (9% DSSURDFK FRPSXWHV UHODWLYH HQHUJHWLFV DQG KHQFH DOORZV HI

 )LJXUH  6LPXODWLRQ IORZFKDUW $IWHU LQLWLDOL]DWLRQ &$'(( ORFDWHV WKH LQLWLDOL]D WLRQ GLUHFWRU\ DQG VHDUFKHV IRU VLPXODWLRQ SDFNHWV VLPSDFNV  7KH\ DUH VXEVHTXHQWO\ GLVWULEXWHG RQ WKH FRPSXWDWLRQDO UHVRXUFHV DQG FRPSXWHG ZLWK 4 7KDQNV WR WKH IDFW WKDW WKH LQGLYLGXDO VLPSDFNV FDQ EH FRPSXWHG LQGHSHQGHQWO\ RI HDFK RWKHU &$'(( VFDOHV SOHDVLQJO\ SDUDOOHO ZLWK PXOWLSURFHVVRU V\VWHPV 3XEOLVKHG LQ UHI >@ OL FHQVHG XQGHU && %<

ILFLHQW FRPSDULVRQ RI GLIIHUHQW HQ]\PH FRQVWUXFWV 7KH (9% LV FRXSOHG ZLWK GHWDLOHG NQRZOHGJH RI WKH UHDFWLRQ PHFKDQLVP DQG FDUHIXO SDUDPHWUL]DWLRQWR UHIHUHQFH V\VWHP

7R DFKLHYH D WRRO IRU LQ VLOLFR GLUHFWHG HYROXWLRQ GLIIHUHQW DVSHFWV QHHG WR EH DGGUHVVHG )LUVW RQFH WKH XVHU KDV GHILQHG PXWDWLRQ VLWHV PXWDJHQHVLV PXVW EH LQWURGXFHG LQ DQ DXWRPDWHG IDVKLRQ ZLWKRXW KXPDQ LQWHUDFWLRQ 6HFRQG WKH VLPXODWLRQV PXVW EH UXQ HIILFLHQWO\ DQG LQ SDUDOOHO DQG WKH ,2 ORDG QHHGV WR EH FRQWUROOHG WR DYRLG ,2 RYHUORDG )LQDOO\ WKH GDWD KDG WR EH DQDO\]HG RQ WKH IO\ DQG WKH UHVXOWV QHHGHG WR EH DFFHVVLEOH GXULQJ WKH UXQ VHH DOVR )LJXUH  $OO WKH ZKLOH FDOFXODWLRQ WLPH PXVW EH NHSW WR D PLQLPXP 7KH VROXWLRQ IRU WKLV LV &$'(( &RPSXWHU$LGHG 'LUHFWHG (YROXWLRQ RI (Q]\PHV LV DEOH WR VROYH SRWHQWLDO FODVKHG E\ DGDSWLQJ URWDPHUV RI VXUURXQGLQJ VLGHFKDLQV

 0XWDWDJHQVLV 7KHUH DUH YDULRXV DSSURDFKHV IRU PRGHOLQJ LQ VLOLFR PXWDJHQHVLV 2QH ZHOO NQRZQ DSSURDFK LV WR XVH URWDPHU OLEUDULHV >   @ %HFDXVH WUD GLWLRQDO URWDPHU OLEUDULHV DUH LQFRQYHQLHQW WR XVH ZLWK DQ DXWRPDWLF SURJUDP

 VXFK DV &$'(( SDUWLFXODUO\ LQ FDVHV ZLWK LQHYLWDEOH FODVKHV DQ DXWRPDWLFVR OXWLRQ KDG WR EH IRXQG :H GHFLGHG WR XVH 6&:5/ DQ DXWRPDWHG WRRO WKDW LV XVHG WR SUHGLFW VLGH FKDLQ RULHQWDWLRQV EDVHG RQ URWDPHU GLVWULEXWLRQV

 (IILFLHQW 3DUDOOHO ([HFXWLRQ ,W LV REYLRXV WKDW KXQGUHGV RU WKRXVDQGV RI PXWDQWV QHHG WR EH UXQ LQ SDUDOOHO 6LQFH WKH LQGLYLGXDO VLPXODWLRQV RI PXWDQWV DUH LQGHSHQGHQW RI RQH DQRWKHU WKLV LV D SOHDVLQJO\ SDUDOOHO VFDOLQJ SUREOHP (YHU\ DGGLWLRQDO PXWDQW VWUXFWXUH FDQ EH UXQ RQ DQ DGGLWLRQDO FRUH $ 3\WKRQ VFULSW WKDW XVHV WKH PSLS\ PRG XOH ZDV LPSOHPHQWHG WR FRRUGLQDWH WKH SDUDOOHO FRPSXWDWLRQ PXWDQWVWUXFWXUHV &$'(( UHDGV D GLUHFWRU\ ZLWK LQSXW ILOHV DQG WKHQ UXQV HDFK VLPSDFN XQWLO D WLPH OLPLW LV UHDFKHG RU DOO WKH VLPSDFNV KDYH EHHQ FRPSXWHG

 $XWRPDWLF $QDO\VLV 8QOLNH LQ ZLOGW\SH VLPXODWLRQV ZKHQ D VLPXODWLRQ LV ILUVW UXQ DQG WKHQ DQD O\]HG WKLV ZRXOG KDYH EHHQ KLJKO\ XQSUDFWLFDO IRU XVDJH ZLWKLQ WKH &$'(( IUDPHZRUN ,W ZDV WKHUHIRUH HYLGHQW WKDW &$'(( ZRXOG DQDO\]H DQG FRPSUHVV UXQV RQ WKH IO\ ZKLOH UXQQLQJ 7R DGG FRPIRUW DQG DFFHVVLELOLW\ ZH GHFLGHG WR VWRUH WKH GDWD LQ DQ 64/LWH GDWDEDVH DQG UHDG WKH GDWD IURP WKH GDWDEDVH IRU WKH XVHU

 6XJJHVWHG :RUNIORZ &$'(( DVVLVWV WKH XVHU E\ SUHSDULQJ DQG UXQQLQJ IUHH HQHUJ\ FDOFXODWLRQV ZLWK LQ VLOLFR JHQHUDWHG HQ]\PH YDULDQWV ,W LV WKHUHIRUH FUXFLDO WKDW D ZHOO SDUDPHWHUL]HG UHIHUHQFH V\VWHP LV VHOHFWHG DV D VWDUWLQJ SRLQW )RU LOOXVWUDWLYH SXUSRVHV ZH SUHVHQW DQ H[DPSOH LQ VLOLFR GLUHFWHG HYROXWLRQ RI WULRVH SKRV SKDWH LVRPHUDVH 7,0  ZKHUH WKH UDWHOLPLWLQJ VWHS RI WKH SURWRQ DEVWUDFWLRQ UHDFWLRQ ZDV PRGHOHG VHH )LJXUH  IRU WKH JHQHUDO ZRUNIORZ ,QLWLDOO\ WKH XVHU VHOHFWV VLWHV IRU SRLQW VDWXUDWLRQ DQG WKLV FDQ IRU H[DPSOH EH DFKLHYHG E\ ILUVW UXQQLQJ DQ DODQLQH VFDQ 5HVXOWV RI WKLV LQLWLDO PXWDJHQHVLV URXQG VHH )LJXUH  ZHUH WKHQ XVHG WR SUHSDUH DQ LQGLYLGXDO VDWXUDWLRQ RIXVHU VHOHFWHG KRWVSRWV VDWXUDWLRQV RQ D VHOHFWLRQ RI UHVLGXHV LQ RXU H[DPSOH WKH WKUHH UHVLGXHV < / DQG 7  )LQDOO\ ZH VHOHFWHG IRU HDFK SRVLWLRQ D VXEVHW RI DPLQR DFLGV DQG GLG D FRPELQDWRULDO VDWXUDWLRQ VHH )LJXUH  )RU WKH IXOO GDWD VHW SOHDVH UHIHU WR 3DSHU ,9 1RWH WKDW WKLV ZRUNIORZ FDQ EH DGDSWHG HDVLO\ WR DGGLWLRQDO GDWD DERXW RI D V\VWHP ,I DQ HQ]\PH LV NQRZQ WR KDYH WZR RU WKUHH LQWHUHVWLQJ UHVLGXHV HJ KRWVSRWV  QR DODQLQH VFDQ RU LQGLYLGXDO VDWXUDWLRQ LV QHHGHG ,QVWHDG D FRPELQDWRULDO VDWXUDWLRQ FDQ EH SHUIRUPHG GLUHFWO\ HLWKHU ZLWK WKH FRPSOHWH  DPLQR DFLGV RU ZLWK FXVWRP DPLQR DFLG OLEUDU\

 )LJXUH  7KH VXJJHVWHG ZRUNIORZ XVLQJ &$'(( $IWHU WKH UHDFWLRQ PHFKDQLVP LV HVWDEOLVKHG DQG WKH UDWH OLPLWLQJ VWHS WKRURXJKO\ SDUDPHWHUL]HG VLWHV KDYH WR EH GHWHU PLQHG IRU H[DPSOH ZLWK DQ DODQLQH VFDQ 1H[W WKH GHWHUPLQHG VLWHV FDQ EH VXEMHFWHG WR LQ VLOLFR VLWHGLUHFWHG HYROXWLRQ 7KLV SURFHVV FDQ EH UHSHDWHG XQWLO WKH XVHU GHFLGHV WR HQG WKH LQ VLOLFR PXWDJHQHVLV 3XEOLVKHG LQ 3DSHU ,9 >@ ILJ   OLFHQVHG XQGHU && %<

,Q WKH QH[W FKDSWHU ZRUN RQ 6W(+ DQ HSR[LGH K\GURODVH WKDW KDV EHHQ VXE MHFW WR H[SHULPHQWDO GLUHFWHG HYROXWLRQ ZLOO EH SUHVHQWHG

 )LJXUH  5HVXOWV RI DQ DODQLQH VFDQ SHUIRUPHG DURXQG WKH DFWLYH VLWH RI WULRVH SKRV SKDWH LVRPHUDVH 7,0  ,QGLFDWHG LQ UHG DUH PXWDWLRQV WKDW ZHUH IRXQG WR KDYHDSDU WLFXODUO\ ODUJH LPSDFW RQ WKH DFWLYDWLRQ EDUULHU FRPSDUHG WR WKH ZLOGW\SH HQ]\PH IDU OHIW  'HULYHG IURP ILJ  RI 3DSHU ,9 >@ OLFHQVHG XQGHU && %<

)LJXUH  5HVXOWV RI FRPELQHG SRLQW PXWDJHQHVLV RQ WKUHH UHVLGXHV < / DQG 7 DURXQG WKH DFWLYH VLWH RI WULRVHSKRVSKDWH LVRPHUDVH 7,0  3XEOLVKHG LQ 3DSHU ,9 >@ ILJ   OLFHQVHG XQGHU && %<

  (9% $SSOLFDWLRQV

 3DSHU ,, ([SDQGLQJ WKH &DWDO\WLF 7ULDG LQ (SR[LGH +\GURODVHV DQG 5HODWHG (Q]\PHV (Q]\PH YDULDQWV RI 6W(+ ZHUH WHVWHG ZLWK WKH VXEVWUDWH 762 WR HVWDEOLVK DQG YDOLGDWH WKH UHDFWLRQ PHFKDQLVP ([SHULPHQWDO DQG FRPSXWDWLRQDO ZRUN KDV EHHQ FRPELQHG 0XWDJHQHVLV YDULDQWV RI WKH SURWHLQ (4 <) <) DQG +1 KDYH EHHQ WHVWHG E\ H[SHULPHQW DQG VLPXODWLRQ DQG NH\ ILQGLQJV DUH DV IROORZV + ZDV IRXQG WR EH FRQVHUYHG LQ VLPLODU VHTXHQFHV DQG ZH VXJJHVW WKDW LW DFWV DV FKDUJHUHOD\ < DQG < DUH FRQILUPHG DV EHLQJ LPSRUWDQW IRU R[\DQLRQ VWDELOL]DWLRQ 7KH FU\VWDO VWUXFWXUH RI +1 KDV UHYHDOHG WKDW WKLV HQ]\PH YDULDQW ODFNV WKH DFWLYHVLWH K\GURO\WLF ZDWHU

7KH ZRUN IRFXVHG RQ WKH JHQHUDO PHFKDQLVP DQG WKH SURWRQDWLRQ RI WKH DF WLYH VLWH UHVLGXHV SULPDULO\ WKH DFWLYHVLWH KLVWLGLQHV + DQG +  :KLOH + ZDV SUHYLRXVO\ VXJJHVWHG DV JHQHUDO EDVH LW ZDV DOVR DUJXHG WKDW LW LV SURWRQDWHG GXULQJ WKH QXFOHRSKLOLF DWWDFN >@ 2XU PHWKRGRORJ\ KHOSHG XV WR GHPRQVWUDWH LQ SDUWLFXODU WKDW + LV DOVR OLNHO\ WR EH FUXFLDO WR WKH UHDF WLRQ PHFKDQLVP :H IRXQG WKDW + LV KLJKO\ FRQVHUYHG DQG DFFRPSDQLHG E\ JOXWDPLF DFLG RU DVSDUWLF DFLG ,Q WKH FDVH RI 6W(+ LW LV JOXWDPDWH ( :H KDYH VKRZQ WKDW ERWK WKH +1 DQG (4 PXWDQWV KDYH VRPH UHVLGXDO DFWLYLW\ DQG WKH GRXEOH PXWDQW (4+1 ZDV IRXQG WR EH LQDFWLYH LQ WKH K\GURO\WLF KDOIUHDFWLRQ ZKLFK VXJJHVWV WKDW ( PD\ DFW DV D EDFNXS EDVH ,Q JHQHUDO DQ\ HSR[LGH FDQ EH RSHQHG DW ERWK RI LWV FDUERQV ,Q FDVH RI 6W(+ ZLWK 762 VXEVWUDWH WKH UHVXOWLQJ GLROV FDQQRW EH GLVWLQJXLVKHG H[SHUL PHQWDOO\ +RZHYHU FRPSXWDWLRQDOO\ WKLV LV SRVVLEOH DQG WKH HQHUJHWLF SURILOHV RI DOO IRXU YDULDQWV WZR HQDQWLRPHUV DQG WZR FDUERQV DUH GHSLFWHG LQ )LJXUH 

 3URWRQDWLRQ RI + DQG + :H IRXQG WKDW DQ DGGLWLRQDO KLVWLGLQH + DGMDFHQW WR WKH VXJJHVWHG JHQHUDO EDVH + ZDV LPSRUWDQW IRU WKH HQHUJHWLFV LQ SDUWLFXODU WKH IUHH HQHUJLHV :KLOH HDUOLHU VWXGLHV RQ HSR[LGH K\GURODVHV KDYH HLWKHU QHJOHFWHG + HQ WLUHO\ >@ RU KDYH NHSW LW QHXWUDO >@ ZH IRXQG WKDW WKH H[SHULPHQWDO HQ HUJHWLFV ZHUH RQO\ UHSURGXFLEOH FRPSXWDWLRQDOO\ ZKHQ + ZDV SURWRQDWHG

 )LJXUH  )UHHHQHUJ\ SURILOHV IRU ZLOGW\SH 6W(+ FDWDO\]HG K\GURO\VLV RI $ 55 762 DQG % 66 762 ,Q FRORU WKH FDOFXODWHG & DWWDFN EOXH & DWWDFN UHG DQG LQ JUH\ WKH H[SHULPHQWDOO\ GHULYHG HQHUJLHV 5HSULQWHG ZLWK SHUPLVVLRQ IURP UHI >@ &RS\ULJKW  $PHULFDQ &KHPLFDO 6RFLHW\

(DUOLHU VWXGLHV SUHGLFWHG WKH S.D ZLWK 3523.$ VXJJHVWLQJ WKDW WKH UHVLGXH HTXLYDOHQW RI + LQ WKH FU\VWDO VWUXFWXUH ZDV SURWRQDWHG >@ :H UHSUR GXFHG WKLV IRU 6W(+ ZLWK 3523.$ EXW ZH DGGLWLRQDOO\ GHWHUPLQHG WKH S.D DW GLIIHUHQW VWDJHV RI WKH UHDFWLRQ DQG ZH IRXQG WKDW WKH S.D RI + GURSSHG VLJ QLILFDQWO\ XSRQ IRUPDWLRQ RI WKH 0LFKDHOLV FRPSOH[ ,PSRUWDQWO\ WKLV GURS LQ WKH S.D ZDV QRW WKH FDVH IRU + (QFRXUDJHG E\ WKHVH ILQGLQJV ZH GLG D VH TXHQFH DOLJQPHQW ZLWK PDQXDOO\ FXUDWHG αβK\GURODVHV XVLQJ WKH (67+(5 GDWDEDVH >@ DQG ZH IRXQG WKDW + LV KLJKO\ FRQVHUYHG 7KLV VWURQJO\ VXJJHVWV WKDW + LV SURWRQDWHG GXULQJ FDWDO\VLV DQG + LV QHXWUDO VHH DOVR SDUW $ RI )LJXUH  

)LJXUH  6WUXFWXUHV LQ FRPSDULVRQ $ 6QDSVKRW RI DQ 0' VLPXODWLRQ ZLWK WKH DFWLYH VLWH ZLWK NH\ UHVLGXHV \HOORZ RI WKH ZLOGW\SH HQ]\PH ZLWK WKH FRYDOHQW LQ WHUPHGLDWH 762 SXUSOH  % 6XSHULPSRVHG FU\VWDO VWUXFWXUHV RI LPSRUWDQW UHVLGXHV RI ZLOGW\SH \HOORZ DQG +1 PXWDQW EOXH  5HSULQWHG ZLWK SHUPLVVLRQ IURP UHI >@ &RS\ULJKW  $PHULFDQ &KHPLFDO 6RFLHW\

  &RQVHUYDWLRQ RI ( DQG + $V GHVFULEHG LQ WKH SUHYLRXV VHFWLRQ ZH IRXQG WKDW + ZDV KLJKO\ FRQVHUYHG LQ HSR[LGH K\GURODVHV DQG αβK\GURODVH VHTXHQFHV :H DOVR IRXQG WKDW LI WKH UHVLGXH HTXLYDOHQW RI + ZDV D KLVWLGLQH WKH HTXLYDOHQW RI ( ZDV HLWKHUD JOXWDPDWH RU DQ DVSDUWDWH 7KLV WRJHWKHU ZLWK 3523.$ S.D SUHGLFWLRQV ZH DUULYHG DW WKH FRQFOXVLRQ WKDW ( DQG + ERWK QHHG WR EH LRQL]HG

 2[\DQLRQ +ROHV ,Q HDUOLHU ZRUN RQ HSR[LGH K\GURODVHV LW ZDV VXJJHVWHG WKDW WZR R[\DQLRQ KROHV SOD\ NH\ UROHV LQ WKH FDWDO\WLF PHFKDQLVP 7KH R[\DQLRQ KROH IRUPHG E\ WKH WZR W\URVLQHV << ZDV EHOLHYHG WR EH LPSRUWDQW IRU WKH ILUVW VWHS RI VWDELOL]LQJ WKH HDUOLHU HSR[LGHR[\JHQ DQG WKH R[\DQLRQ KROH IRUPHG E\ ) DQG : ZDV VXJJHVWHG WR EH LPSRUWDQW IRU WKH EUHDNGRZQ RI WKH LQWHUPHGLDWH DQG IRU VWDELOL]LQJ WKH R[\DQLRQ IRUPHG

 (4 +1 DQG 'RXEOH 0XWDQW ,W ZDV VXJJHVWHG WKDW + LV WKH JHQHUDO EDVH UHTXLUHG IRU DFWLYLW\ ([SHUL PHQWDO PXWDWLRQ RI + WR +1 UHYHDOHG KRZHYHU WKDW WKH HQ]\PH ZDV DEOH WR UHWDLQ VRPH DFWLYLW\ :H VXJJHVWHG WKDW ( FRXOG EH WKH EDFNXS EDVH DQG ZH KDYH VKRZQ H[SHULPHQWDOO\ WKDW WKH (4+1 GRXEOHPXWDQW ZDV LQDFWLYH LQ WKH K\GURO\WLF VWHS 2QH LPSRUWDQW ILQGLQJ ZLWK WKH +1 PXWDQW ZDV WKDW WKH FU\VWDO VWUXFWXUH RI LW UHYHDOHG D UHDUUDQJHPHQW RI WKH DFWLYHVLWH LQ WKH +1 PXWDQW DQG LPSRUWDQWO\ WKH ZDWHU FRQVXPHG LQ WKH FDWDO\WLF F\FOH ZDV QR ORQJHU REVHUYHG LQ WKLV HQ]\PH YDULDQW VHH DOVR )LJXUH  

7KH JHQHUDO UHDFWLRQ PHFKDQLVP ZDV HVWDEOLVKHG ZLWK WKLV SDUW RI WKH ZRUN ,Q WKH QH[W VHFWLRQ ZH LQYHVWLJDWHG WKH HQDQWLRFRQYHUJHQW EHKDYLRU RI 6W(+ ZLWK 62 VXEVWUDWH

  3DSHU ,,, &RQIRUPDWLRQDO 'LYHUVLW\ DQG (QDQWLRFRQYHUJHQFH LQ 3RWDWR (SR[LGH +\GURODVH , (DUOLHU H[SHULPHQWV IRXQG WKDW WKH ZLOGW\SH 6W(+ ZDV GLVSOD\LQJ HQDQWLR FRQYHUJHQFH ZLWK WKH VXEVWUDWH VW\UHQH R[LGH 62 WR \LHOG SULPDULO\ 5  SKHQ\OHWKDQHGLRO $IWHU WKH UHDFWLRQ PHFKDQLVP ZDV HVWDEOLVKHG ZLWK WKH 762 VXEVWUDWH VHH 3DSHU ,,  ZH VHW RXW WR LQYHVWLJDWH WKH HQDQWLRFRQYHUJHQFHRI 6W(+ DQG WKH VXEVWUDWH 62 ZDV VHOHFWHG WR LQYHVWLJDWH HQDQWLRFRQYHUJHQFH

,W ZDV GHPRQVWUDWHG HDUOLHU WKDW 6W(+ LV DEOH WR GR HQDQWLRFRQYHUJHQW FDWDO\VLV RI 62 DQG LQ WKLV DUWLFOH ZH FRPELQHG H[SHULPHQWDO FRPSXWDWLRQDO DQG FU\VWDOORJUDSKLF ZRUN WR XQGHUVWDQG DQG UHHQJLQHHU WKH HQ]\PH WR VKLIW WKH DFWLYLW\ RI WKH HQ]\PH 7KH HQHUJ\ SURILOH LV VKRZQ LQ ILJ  $V FDQ EH VHHQ IURP RXU HQHUJ\ FDOFXODWLRQV ZH FDQ H[SODLQ VWHUHRVHOHFWLYLW\ :H IRXQG WKDW WKH DGGLWLRQDO FRRUGLQDWLRQ PRGHV WKH VXEVWUDWH KDV LQ WKH DFWLYH VLWH KDV D PDMRU LPSDFW RQ WKH HQHUJHWLF SURILOH FRPSDUHG WR 762

)LJXUH  )UHH HQHUJ\ SURILOHV IRU WKH K\GURO\VLV RI ZLOGW\SH 6W(+ DQG 5&% PXWDQW VHH VHFWLRQ  LQ WKH ORZHVW HQHUJ\ ELQGLQJ PRGH IRU HDFK HQDQWLRPHU 5HSULQWHG ZLWK SHUPLVVLRQ IURP UHI >@  3XEOLVKHG E\ 7KH 5R\DO 6RFLHW\ RI &KHP LVWU\

 (QJLQHHUHG 9DULDQW DQG &U\VWDO 6WUXFWXUH $V QRWHG HDUOLHU WKH ZLOGW\SH HQ]\PH ZDV HQDQWLRFRQYHUJHQW WKH 62VXEVWUDWH QDPHO\ FRQYHUWLQJ ERWK 6 DQG 5  HQDQWLRPHU WR 5 GLRO 7KLV PRWLYDWHG H[SHULPHQWDO UHVHDUFK WR LQYHUW WKH VHOHFWLYLW\ VHH )LJXUH  

 $ PXWDQW HQ]\PH ZDV HQJLQHHUHG WR LQYHUW WKH HQDQWLRVHOHFWLYLW\ RI WKH HQ ]\PH 7KLV HQ]\PH YDULDQW WKH TXDGUXSOH PXWDQW 5&% KDV IRXU PXWD WLRQV :/ /< 9. DQG ,9 $V FDQ EH VHHQ IURP )LJXUH  WKH UHVLGXHV DUH ORFDWHG FORVH WR WKH DFWLYH VLWH

)LJXUH  &RPSDULVRQ RI WKH ZLOGW\SH HQ]\PH DQG WKH 5&% YDULDQW 7KH EDFN ERQH WKH FDWDO\WLFDOO\ LPSRUWDQW UHVLGXHV DQG WKH PXWDWHG UHVLGXHV DUH GLVSOD\HG LQ \HOORZ DQG DUH IURP WKH ZLOGW\SH FU\VWDO VWUXFWXUH 3'%,' &-3  'LVSOD\HG LQ SXU SOH DUH WKH VXSHULPSRVHG VXEVWLWXWHG UHVLGXHV RI 5&% IURP WKH FU\VWDO VWUXFWXUH 3'%,' 8)1 

 %LQGLQJ 0RGHV 8QOLNH WKH 762 VXEVWUDWH 62 GRHV QRW ILOO WKH DFWLYH VLWH RI WKH HQ]\PH 7KHUH IRUH WKH HQ]\PH FDQ ELQG WKH VXEVWUDWH LQ GLIIHUHQW PRGHV VHH )LJXUH  2XU FDOFXODWLRQV KDYH VKRZQ WKDW WKH : PXWDWLRQ ZDV YHU\ LPSRUWDQW IRU WKH LQYHUVLRQ RI HQDQWLRVHOHFWLYLW\ DQG DW WKH VDPH WLPH RWKHU PXWDWLRQV KDYH PDGH WKH VLPXODWLRQ RI WKH HQ]\PH PRUH GLIILFXOW WR PRGHO IRU H[DPSOH WKH 9. PXWDWLRQ

 $ %

)LJXUH  7KH 5 VW\UHQH R[LGH VXEVWUDWH VKRZLQJ WKH WZR DGGLWLRQDO ELQGLQJ PRGHV WKH DV\PHWULF 62 KDV RYHU 762 LH 762 KDV RQH RULHQWDWLRQ EHFDXVH IRU 762 $ %  1RWH WKDW : ZDV VXEMHFWHG WR PXWDJHQHVLV LQ 5&% DQG LV OHXFLQH

  &RQFOXVLRQV DQG )XWXUH 3HUVSHFWLYH

,Q VKRUW WKLV ZRUN LV DQ H[DPSOH IRU KRZ FODVVLFDO DSSURDFKHV DOORZ IRU HFR QRPLFDO H[WHQVLYH FRQIRUPDWLRQDO VDPSOLQJ 8VLQJ WKH (9% DSSURDFK KDV HQ DEOHG PH WR REWDLQ D PROHFXODU XQGHUVWDQGLQJ RI FRPSOH[ HQ]\PDWLF V\VWHPV , KDYH DOVR LQFUHDVHG WKH UHDFK RI WKH (9% DSSURDFK WKURXJK WKH LPSOHPHQ WDWLRQ RI &$'(( ZKLFK HQDEOHV HIILFLHQW DQG KLJKO\ SDUDOOHO LQ VLOLFR WHVWLQJ RI KXQGUHGVWRWKRXVDQGV RI LQGLYLGXDO HQ]\PH YDULDQWV 0HWKRGV EDVHG RQ WKH (9% KDYH EHHQ FRPELQHG ZLWK H[SHULPHQWDO ZRUN 8QGHUVWDQGLQJ RI WKH UHDFWLRQ PHFKDQLVP WKH UHJLR DQG WKH HQDQWLRVHOHFWLYLW\ RI 6W(+ KDYH EHHQ DFKLHYHG

1HZ 0HWDO 3DUDPHWHUV ,Q WKLV WKHVLV QHZ PHWDO SDUDPHWHUV ZHUH GHYHORSHG DQG GHPRQVWUDWHG H[SDQG LQJ WKH FDSDELOLWLHV RI FODVVLFDO PROHFXODU PRGHOLQJ 7KHVH SDUDPHWHUV KDYH IRXQG ZLGH DSSOLFDWLRQV DQG KDYH EHHQ XVHG DOVR E\ FRDXWKRUV >±@ 7KH SDUDPHWHUV KDYH DGGLWLRQDOO\ EHHQ H[SDQGHG WR FRYHU WKH -DKQ7HOOHU HI IHFW E\ /LDR HW DO >@

1HZ 0HFKDQLVP 7KH XVDJH RI (9% WR PRGHO UHDFWLRQV DOORZHG WR REWDLQ VXIILFLHQW VDPSOLQJ LQ 6W(+ DQG WR HVWDEOLVK WKH UHDFWLRQ PHFKDQLVP IRU WKLV HQ]\PH LQ FORVH FROODE RUDWLRQ ZLWK H[SHULPHQWDO FRZRUNHUV :LWK (9% ZH KDYH EHHQ DEOH WR XQFRYHU WKH LPSRUWDQW UROH WKH VHFRQG DFWLYH VLWH KLVWLGLQH IRU FDWDO\VLV 3523.$ KDV EHHQ XVHG WR HVWLPDWH WKH S.DV RI WKH LQYROYHG UHVLGXHV DQG VXSSRUWV RXU FODLP $GGLWLRQDOO\ ZH KDYH IRXQG WKDW UHODWHG HQ]\PHV RI WKH Į  ȕ K\GURODVH IDP LO\ KDYH D YHU\ KLJK FRQVHUYDWLRQ RI WKLV DFWLYHVLWH KLVWLGLQH 7KLV H[SDQGV WKH UHOHYDQFH RI RXU PHFKDQLVWLF ILQGLQJV IURP 6W(+ WR DQ HQWLUH IDPLO\ RI HQ]\PHV 0RGHOLQJ WKH K\GURO\VLV RI 762 DOORZHG XV WR H[SORUH PHFKDQLVP ZLWK UHGXFHG FRPSOH[LW\ EHFDXVH OHVV ELQGLQJ PRGHV QHHGHG WR EH FRQVLGHUHG $GGLWLRQDOO\ H[SHULPHQWDO NLQHWLF GDWD ZDV DYDLODEOH IRU WKH ZLOGW\SH HQ]\PH DQG GLIIHUHQW HQ]\PH YDULDQWV IRU WKLV VXEVWUDWH

1HZ 8QGHUVWDQGLQJ RI (QDQWLRFRQYHUJHQFH 7R LQYHVWLJDWH HQDQWLRVHOHFWLYLW\ DQG HQDQWLRFRQYHUJHQFH ZH KDYH PRGHOHG WKH HQ]\PH ZLWK WKH VPDOOHU VW\UHQH R[LGH VXEVWUDWH (DUOLHU H[SHULPHQWDO ZRUN

 KDV VKRZQ WKDW 6W(+ ZDV DEOH WR SHUIRUP HQDQWLRFRQYHUJHQW FDWDO\VLV ZLWK WKLV VXEVWUDWH $GGLWLRQDOO\ DQ H[SHULPHQWDO VWUXFWXUH RI D YDULDQW RI WKH VDPH HQ]\PH KDV EHHQ VROYHG WKDW GLVSOD\HG LQYHUWHG HQDQWLRFRQYHUJHQFH 7KLV ZDV KHQFH DQ LGHDO PRGHO V\VWHP DQG ZH KDYH HPSOR\HG (9% WR LQYHVWLJDWH WKH UHDFWLRQ PHFKDQLVP RI WKLV HQ]\PH 2XU FDOFXODWLRQV KDYH VKRZQ WKDW WKH : PXWDWLRQ ZDV YHU\ LPSRUWDQW IRU WKH LQYHUVLRQ RI HQDQWLRVHOHFWLYLW\ DQG DW WKH VDPH WLPH RWKHU PXWDWLRQV KDYH PDGH WKH VLPXODWLRQ RI WKH HQ]\PH PRUH GLIILFXOW WR PRGHO IRU H[DPSOH WKH 9. PXWDWLRQ

1HZ 6RIWZDUH 7R H[SDQG WKH UHDFK RI FRPSXWDWLRQDO PRGHOLQJ &$'(( D IUDPHZRUN IRU WKH &RPSXWHU$LGHG 'LUHFWHG (YROXWLRQ RI (Q]\PHV KDV EHHQ GHYHORSHG :LWK &$'(( D WRRO LV QRZ DYDLODEOH WR WKH VFLHQWLILF FRPPXQLW\ WKDW JUDQWV HDV\ DF FHVV WRZDUGV LQ VLOLFR HYROXWLRQ RI HQ]\PHV )RU QRZ &$'(( LV LPSOHPHQWHG VXFK WKDW WKH XVHU VXSSOLHV &$'(( ZLWK D UHIHUHQFH HQ]\PH $SSOLFDWLRQ RI &$'(( RQ V\VWHPV WKDW KDYH EHHQ LQYHVWLJDWHG ZLWK (9% HDUOLHU FRXOG EH D WHVW ILHOG WR GR H[SORUDWLRQV ZLWK LQ VLOLFR GLUHFWHG HYROXWLRQ &$'(( LV KRZHYHU QRW OLPLWHG WR LQ VLOLFR HYROXWLRQ DQG FDQ DOWHUQDWLYHO\ EH XVHG WR FRQYHQLHQWO\ FROOHFW D ODUJH DPRXQW RI VDPSOLQJ IRU D VLQJOH VSHFLILF HQ]\PH YDULDQW

3HUVRQDO 3HUVSHFWLYH 7R HQDEOH D ZLGHU VFRSH RI &$'(( , VXJJHVW WR IRFXV RQ VKRUWHQLQJ WKH DY HUDJH VLPXODWLRQ WLPHV PDFKLQH OHDUQLQJ DUWLILFLDO LQWHOOLJHQFH  7KLV FRXOG DOORZ IRU WULSOH SRLQW PXWDWLRQV   YDULDQWV DQG SRVVLEO\ HYHQ TXDGUX SOH SRLQW VDWXUDWLRQ RQ D UHGXFHG VHW RI DPLQR DFLGV HJ   YDULDQWV  )RU WKLV DGGLWLRQDO FRPSXWDWLRQDO UHVRXUFHV ZRXOG EH QHHGHG DQG &$'(( ZLOO KDYH WR SUHSDUH LQSXWV RQ WKH IO\ $GGLWLRQDOO\ VLJQLILFDQWO\ OHVV GDWD PXVW EH UHWDLQHG

)XWXUH 3HUVSHFWLYH 7KLV ZRUN KDV VKRZQ PRGHUQ DSSOLFDWLRQV RI PHWKRGV EDVHG RQ FODVVLFDO PROHF XODU PRGHOLQJ 7KH FRPSXWDWLRQDO PRGHOLQJ RI HQ]\PHV UHPDLQV FKDOOHQJLQJ DQG ZLWK PRGHUQ KDUG DQG VRIWZDUH WKH SRVVLELOLWLHV DUH H[WHQGHG

 6. Populärvetenskaplig Sammanfattning

Modellering av Enzymkatalys och Nya Möjligheter för Datorberäkningar I min avhandling har jag använt datorer för att studera och förklara kemiska fenomen på molekylär nivå. De senaste åren har datorer lett till en våldsam teknisk utveckling såväl i privatlivet som inom vetenskapen, vilken jag dragit nytta av inom min forskning.

Jag har använt en metod för att utnyttja de tillgängliga beräkningsresurserna på det mest effektiva sättet. Detta är möjligt främst då vår metod förenklar det studerade systemet.

”Så enkelt som möjligt, men inte enklare!” - Albert Einstein

Vi använder en klassisk beskrivning av de atomära systemen. Vi simulerar molekyler bestående av atomer och bindingar som kulor och fjädrar, vilken i motsats till kvantmekaniska beskrivningar innebär att elektronkonfiguration- erna inte beräknas exakt. Detta behövs för att beräkna molekyler och reak- tioner så noggrant som möjligt. Kvantmekaniska är dock mycket beräkningsintensiva och därför kan vissa egenskaper hos stora molekyler inte beräknas tillräckligt noggrant eller med tillräckligt många upprepningar (för biologiska system måste olika startkonfigurationer provas och systemet måste ges en viss tid att komma jämvikt). Med den förenklade klassiska beskrivnin- gen har jag forskat på atomer, molekyler och reaktioner. Jag har publicerat fyra arbeten inom följande tre områden:

För det första har jag tillsammans med ett antal kollegor arbetat med simu- lering av metalljoner med klassiska molekylära simuleringar. Vi har tagit fram parametrar med den så kallade Octahedral Dummy Model. Det finns redan sedan tidigare modeller för metalljoner som dock inte varit lämpade att beskriva jonernas egenskaper i enzymer. De parametrar vi publicerat kan beskriva experimentellt bestämda egenskaper som solveringsenergi, koordi- nationsnummer och avstånd mellan en metalljon och det omgivande vattnet. Särskilt viktigt för att kunna beskriva reaktioner är att vår modell kan genomgå ligand-utbyte: Det är möjligt att byta ut till exempel en vattenmolekyl mot en annan bindningspartner. Förutom detta är vår modell också enkel att överföra till andra system. Dessa egenskaper har tidigare modeller inte kunna uppfylla med samma noggrannhet och utan begränsningar.

51 För det andra har jag forskat på ett enzym från potatis. Enzym är biokatalys- atorer som, likt en katalysator i en bil, gör en kemisk reaktion snabbare. Detta potatisenzym kan omvandla så kallade epoxider mycket selektivt. Epoxider är molekyler som har en tre-ring av två kolatomer och en syreatom. Epoxider kan användas som råmaterial till olika finkemikalier och läkemedel. I min första artikel om StEH1, Solanum tuberosum epoxidhydrolas, har jag undersökt i detalj hur enzymet genomför reaktionen. Vi har funnit och publicerat reak- tionsmekanismen, och dessutom funnit att det är mycket sannolikt att en viss histidin aminosyra i enzymet är joniserad, samt att detta histidin också finns på motsvarande plats i närbesläktade enzymer. I tidigare datorstudier av lik- nande system har detta histidin ignorerats eller modellerats felaktigt. Genom mitt nästa steg, och tredje vetenskapliga arbete, har vi kunnat förstå en mycket intressant egenskap hos StEH1 bättre. Denna egenskap kallas enantiokonver- gens. Enkelt förklarat finns det molekyler som är spegelbilder av varandra. I fall man genomför en reaktion och får två spegelvända produkter men endast är intresserad av den ena, så förlorar man en stor del av produkten! Därför använ- der man ofta så kallade enantioselektiva katalysatorer vilka främst omvandlar substratet till den ena produkten (och inte dess spegelbild). StEH1, vårt pota- tisenzym, är en sådan. Men mycket mer spännande för oss var att StEH1 kan omvandla två spegelvända molekyler till en enda produkt, därav benämningen enantiokonvergens. Experiment med StEH1 har visat att selektiviteten till och med kan vändas. Vi har därför undersökt enantioselektiviteten hos StEH1. I vår modell har vi kunnat återskapa fenomenet och bättre kunnat förklara varför StEH1 är enantiokonvergent.

I mitt tredje projekt, och fjärde publikation, har jag utvecklat CADEE, vilket står för Computer-Aided Directed Evolution of Enzymes (ung. datorstödd styrd evolution av enzym). CADEE är ett vetenskapligt verktyg för att mer effektivt och mycket snabbare än tidigare simulera enzymvarianter. Med verktyget är det möjligt att inom några veckor skapa hundra- eller tusentals enzymvarianter och göra förutsägelser om hur de kommer bete sig i experiment. Slutligen har jag visat med ett exempel hur enkelt CADEE är att använda.

Sammanfattningsvis har jag med datorers hjälp studerat enzymer och genom CADEE utvidgat möjligheterna att förstå komplexa enzymatiska system.

52  $FNQRZOHGJPHQWV

, ZDQW WR WKDQN DOO WKH SHRSOH ZKR KDYH EHHQ WKHUH IRU PH :LWKRXW KHOS IURP DOO RI \RX WKLV ZRUN ZRXOG KDYH EHHQ LPSRVVLEOH WR DFKLHYH 

)LUVW , ZRXOG OLNH WR WKDQN P\ VXSHUYLVRU /\QQ .DPHUOLQ :LWKRXW \RXU KHOS \RXU VROLG GHFLVLRQV DQG ERWK \RXU GULYH DQG WUXVW WKLV ZRUN ZRXOG KDYH KDUGO\ EHHQ SRVVLEOH

6HFRQG , ZRXOG OLNH WR WKDQN P\ FRVXSHUYLVRU 0LNDHO :LGHUVWHQ,WZDV LPSRUWDQW WR PH WR KDYH D VWURQJ OLQN WR WKH H[SHULPHQWDO IDFWV DQG IRU VHLQJ WKH SRVLWLYH VLGH RI UHVXOWV

7KLUG , ZRXOG OLNH WR WKDQN P\ FRZRUNHUV )HUQDQGD 3DXO cVD 0LKD $OH[ )DELDQ ,UHN

, WKDQN WKH (XURSHDQ 5HVHDUFK &RXQFLO (5& IRU IXQGLQJ WKLV ZRUN

, DOVR WKDQN WKH 6ZHGLVK 1DWLRQDO ,QIUDVWUXFWXUH IRU &RPSXWLQJ 61,& IRU WKH FRPSXWDWLRQ WLPH JUDQWHG RQ 8330$; +3&1 DQG 16& 6SHFLDO WKDQNV WR 0DWV .URQHQEHUJ 16& IRU WKH VFDOLQJ WHVWV SHUIRUPHG RQ 7ULROLWK

, ZRXOG OLNH WR WKDQN 'DYLG YDQ GHU 6SRHO IRU KHOSIXO IHHGEDFN DW GLIIHUHQW VWDJHV GXULQJ P\ 3K' $QG , ZRXOG OLNH WR WKDQN 3HWHU .DVVRQ IRU IUXLWIXO GLVFXVVLRQV UHJDUGLQJ VWDWLVWLFV DQG $,

, ZRXOG OLNH WR WKDQN ,&0 %0& 8SSVDOD DQG 6ZHGHQ IRU ZHOFRPLQJ PH DQG DOORZLQJ PH WR FRQGXFW WKLV ZRUN , HQMR\HG WKH WLPH LQ WKLV IULHQGO\ DQG VDIH SODFH DQG , DP IDVFLQDWHG E\ WKH ZLOGHUQHVV RQ WKH GRRUVWHS , WKDQN DOO WKH SHRSOH , KDYH PHW KHUH DQG , DP JUDWHIXO WKDW , ZDV DOORZHG WR VSHQG WKLV WLPH RI P\ OLIH LQ WKLV XQXVXDO FRXQWU\

, WKDQN P\ FROOHDJXHV DW WKH .DPHUOLQ /DE )HUQDQGD )RU EHLQJ P\ ³VFLHQWLILF ROGHU VLVWHU´ DQG \RX EHFDPH D JRRG IULHQG 7KDQN \RX IRU WHDFKLQJ PH KRZ \RX WDFNOH FKDOOHQJHV $OH[

 EHHQ D ³IDQWDVWLVFK´ WUDLOEOD]HU 7KDQN \RX HVSHFLDOO\ IRU VHWWOLQJ PH LQ 3DXO 7KDQN \RX IRU ZRUNLQJ ZLWK PH RQ 6W(+ ,FK WKDQN \RX IRU RXU GLVFXVVLRQV WKDW KHOSHG WR LPSURYH RXU ZRUN 0LKD 7KDQN \RX IRU GHYHORSLQJ 4VFULSWV DQG DOORZLQJ PH WR LQWHUIDFH WKHP  $QG WKDQNV IRU DOO WKH IUXLW .ODXGLD ,UHN 7KDQNV IRU WKH ERDUG JDPHV DW \RXU SODFH DQG WKDQNV IRU VKDULQJ %DWWOHVWDU *DODFWLFD D JUHDW JDPH

7KH 327$7,6 7HDP cVD 3DXO 6KHUU\ DQG $ULQD 6RODQXP WXEHURVXP 7KDQN \RX IRU VKDULQJ NQRZOHGJH DVNLQJ LQWHUHVWLQJ TXHVWLRQV DQG VKDULQJ 7KH 4 7HDP ,UHN 3DXO 0LKD $OH[ DQG 0DXULFLR 7KDQNV IRU \RXU FRQ WULEXWLRQV WR WKH FRGH , WKDQN IRU IHHGEDFN IURP )DELDQ DQG 3DXO RQ WKLV ZRUN $OVR , ZRXOG OLNH WR WKDQN 4XLQJKXD DQG 'HQQLV IRU IHHGEDFN RQ SDUWV RI WKLV ZRUN , WKDQN $UYLG IRU KHOS ZLWK WUDQVODWLQJ WKH VDPPDQIDWWQLQJ

7KH IROORZLQJ SHRSOH KDYH FRQWULEXWHG WR WKLV ZRUN LQ D PRUH LQGLUHFW ZD\ , DP WKDQNIXO IRU WKH PRUQLQJ ILNDV ZLWK =D]D DQG IRU SODQQLQJ DQG DFFRXQW DELOLW\ 7KDQNV %URWKHUKRRG IRU KHOSLQJ ZLWK WKH FRQVWDQW HYROYLQJ DQG FRQ VWUXFWLYH ZRUN ZH GLG WRJHWKHU

7$&. $//$ 

  5HIHUHQFHV

>@ 6 5 -DPHV 5 : 'HQQHOO $ 6 *LOEHUW + 7 /HZLV - $ - *RZOHWW 7 ) /\QFK : & 0F*UHZ & 5 3HWHUV * * 3RSH $ % 6WDKO DQG RWKHUV +RPLQLG 8VH RI )LUH LQ WKH /RZHU DQG 0LGGOH 3OHLVWRFHQH $ 5HYLHZ RI WKH (YLGHQFH >DQG &RPPHQWV DQG 5HSOLHV@ &XUUHQW $QWKURSRORJ\   ±  >@ ) %HUQD 3 *ROGEHUJ / . +RUZLW] - %ULQN 6 +ROW 0 %DPIRUG DQG 0 &KD]DQ 0LFURVWUDWLJUDSKLF (YLGHQFH RI LQ 6LWX )LUH LQ WKH $FKHXOHDQ 6WUDWD RI :RQGHUZHUN &DYH 1RUWKHUQ &DSH 3URYLQFH 6RXWK $IULFD 3URFHHGLQJV RI WKH 1DWLRQDO $FDGHP\ RI 6FLHQFHV   (±( 0D\  >@ 0 6DOTXH 3 , %RJXFNL - 3\]HO , 6RENRZLDN7DEDND 5 *U\JLHO 0 6]P\W DQG 5 3 (YHUVKHG (DUOLHVW (YLGHQFH IRU &KHHVH 0DNLQJ LQ WKH 6L[WK 0LOOHQQLXP %F LQ 1RUWKHUQ (XURSH 1DWXUH   ± 'HFHPEHU  >@ 5 3 (YHUVKHG 6 3D\QH $ * 6KHUUDWW 0 6 &RSOH\ - &RROLGJH ' 8UHP.RWVX . .RWVDNLV 0 g]GR÷DQ $ ( g]GR÷DQ 2 1LHXZHQKX\VH 3 0 0 * $NNHUPDQV ' %DLOH\ 55 $QGHHVFX 6 &DPSEHOO 6 )DULG , +RGGHU 1 @ -/ /HJUDV ' 0HUGLQRJOX -0 &RUQXHW DQG ) .DUVW %UHDG %HHU DQG :LQH 6DFFKDURP\FHV &HUHYLVLDH 'LYHUVLW\ 5HIOHFWV +XPDQ +LVWRU\ 0ROHFXODU (FRORJ\   ± 0D\  >@ 3 ( 0F*RYHUQ - =KDQJ - 7DQJ = =KDQJ * 5 +DOO 5 $ 0RUHDX $ 1XxH] ( ' %XWU\P 0 3 5LFKDUGV &V :DQJ DQG RWKHUV )HUPHQWHG %HYHUDJHV RI 3UHDQG 3URWR+LVWRULF &KLQD 3URFHHGLQJV RI WKH 1DWLRQDO $FDGHP\ RI 6FLHQFHV RI WKH 8QLWHG 6WDWHV RI $PHULFD   ±  >@ ) +DEHU 7KH 6\QVWKHVLV RI $PPRQLD IURP LWV (OHPHQWV -XQH  >@ & %RVFK 7KH 'HYHORSPHQW RI WKH &KHPLFDO +LJK 3UHVVXUH 0HWKRG 'XULQJ WKH (VWDEOLVKPHQW RI WKH 1HZ $PPRQLD ,QGXVWU\ 1REHO /HFWXUHV &KHPLVWU\ ±  >@ & %RVFK 3URFHVV RI 3URGXFLQJ $PPRQLD $SULO  >@ 0 $SSO $PPRQLD  3URGXFWLRQ 3URFHVVHV ,Q :LOH\9&+ 9HUODJ *PE+ &R .*D$ HGLWRU 8OOPDQQ¶V (QF\FORSHGLD RI ,QGXVWULDO &KHPLVWU\ :LOH\9&+ 9HUODJ *PE+ &R .*D$ :HLQKHLP *HUPDQ\ 2FWREHU  ,6%1  >@ + 0OOHU 6XOIXULF $FLG DQG 6XOIXU 7ULR[LGH ,Q :LOH\9&+ 9HUODJ *PE+ &R .*D$ HGLWRU 8OOPDQQ¶V (QF\FORSHGLD RI ,QGXVWULDO &KHPLVWU\ :LOH\9&+ 9HUODJ *PE+ &R .*D$ :HLQKHLP *HUPDQ\ -XQH  ,6%1 

 >@ & . 6DYLOH - 0 -DQH\ ( & 0XQGRUII - & 0RRUH 6 7DP : 5 -DUYLV - & &ROEHFN $ .UHEEHU ) - )OHLW] - %UDQGV 3 1 'HYLQH * : +XLVPDQ DQG * - +XJKHV %LRFDWDO\WLF $V\PPHWULF 6\QWKHVLV RI &KLUDO $PLQHV IURP .HWRQHV $SSOLHG WR 6LWDJOLSWLQ 0DQXIDFWXUH 6FLHQFH    -XO\  >@ 5 :ROIHQGHQ DQG 0 - 6QLGHU 7KH 'HSWK RI &KHPLFDO 7LPH DQG WKH 3RZHU RI (Q]\PHV DV &DWDO\VWV $FFRXQWV RI &KHPLFDO 5HVHDUFK   ± 'HFHPEHU  >@ 6 5HEVGDW DQG ' 0D\HU (WK\OHQH 2[LGH ,Q :LOH\9&+ 9HUODJ *PE+ &R .*D$ HGLWRU 8OOPDQQ¶V (QF\FORSHGLD RI ,QGXVWULDO &KHPLVWU\ :LOH\9&+ 9HUODJ *PE+ &R .*D$ :HLQKHLP *HUPDQ\ 0DUFK  ,6%1  >@ + (\ULQJ 7KH $FWLYDWHG &RPSOH[ LQ &KHPLFDO 5HDFWLRQV 7KH -RXUQDO RI &KHPLFDO 3K\VLFV   ± )HEUXDU\  >@ % %RUKDQ $ ' -RQHV ) 3LQRW ' ) *UDQW 0 - .XUWK DQG % ' +DPPRFN 0HFKDQLVP RI 6ROXEOH (SR[LGH +\GURODVH )250$7,21 2) $1 α+<'52;< (67(5(1=<0( ,17(50(',$7( 7+528*+ $VS -RXUQDO RI %LRORJLFDO &KHPLVWU\   ±  >@ ( 3 &DUSHQWHU . %HLV $ ' &DPHURQ DQG 6 ,ZDWD 2YHUFRPLQJ WKH &KDOOHQJHV RI 0HPEUDQH 3URWHLQ &U\VWDOORJUDSK\ &XUUHQW 2SLQLRQ LQ 6WUXFWXUDO %LRORJ\   ± 2FWREHU  >@ + - '\VRQ DQG 3 ( :ULJKW ,QWULQVLFDOO\ 8QVWUXFWXUHG 3URWHLQV DQG 7KHLU )XQFWLRQV 1DWXUH 5HYLHZV 0ROHFXODU &HOO %LRORJ\   ± 0DUFK  >@ % $ $PUHLQ 3 %DXHU ) 'XDUWH c -DQIDON &DUOVVRQ $ 1DZRU\WD 6 / 0RZEUD\ 0 :LGHUVWHQ DQG 6 & / .DPHUOLQ ([SDQGLQJ WKH &DWDO\WLF 7ULDG LQ (SR[LGH +\GURODVHV DQG 5HODWHG (Q]\PHV $&6 &DWDO\VLV    ± 2FWREHU  >@ 4 &XL DQG 0 .DUSOXV 7ULRVHSKRVSKDWH ,VRPHUDVH $ 7KHRUHWLFDO &RPSDULVRQ RI $OWHUQDWLYH 3DWKZD\V -RXUQDO RI WKH $PHULFDQ &KHPLFDO 6RFLHW\   ± 0DUFK  >@ ) 'XDUWH - cTYLVW 1 + :LOOLDPV DQG 6 & / .DPHUOLQ 5HVROYLQJ $SSDUHQW &RQIOLFWV EHWZHHQ 7KHRUHWLFDO DQG ([SHULPHQWDO 0RGHOV RI 3KRVSKDWH 0RQRHVWHU +\GURO\VLV -RXUQDO RI WKH $PHULFDQ &KHPLFDO 6RFLHW\   ± -DQXDU\  >@ ' 5|WKOLVEHUJHU 2 .KHUVRQVN\ $ 0 :ROODFRWW / -LDQJ - 'H&KDQFLH - %HWNHU - / *DOODKHU ( $ $OWKRII $ =DQJKHOOLQL 2 '\P 6 $OEHFN . 1 +RXN ' 6 7DZILN DQG ' %DNHU .HPS (OLPLQDWLRQ &DWDO\VWV E\ &RPSXWDWLRQDO (Q]\PH 'HVLJQ 1DWXUH   ± 0D\  >@ + .ULHV 5 %ORPEHUJ DQG ' +LOYHUW 'H 1RYR (Q]\PHV E\ &RPSXWDWLRQDO 'HVLJQ &XUUHQW 2SLQLRQ LQ &KHPLFDO %LRORJ\   ± $SULO  >@ : - $OEHU\ DQG - 5 .QRZOHV )UHH(QHUJ\ 3URILOH IRU WKH 5HDFWLRQ &DWDO\]HG E\ 7ULRVHSKRVSKDWH ,VRPHUDVH %LRFKHPLVWU\   ±  >@ , $ 5RVH : - )XQJ DQG - 9 :DUPV 3URWRQ 'LIIXVLRQ LQ WKH $FWLYH 6LWH RI 7ULRVHSKRVSKDWH ,VRPHUDVH %LRFKHPLVWU\   ± 

 >@ 6 + .RHQLJ DQG 5 ' %URZQ +FR DV 6XEVWUDWH IRU &DUERQLF $QK\GUDVH LQ WKH 'HK\GUDWLRQ RI +&2 3URFHHGLQJV RI WKH 1DWLRQDO $FDGHP\ RI 6FLHQFHV   ±  >@ 5 $ -HQVHQ (Q]\PH 5HFUXLWPHQW LQ (YROXWLRQ RI 1HZ )XQFWLRQ $QQXDO 5HYLHZV LQ 0LFURELRORJ\   ±  >@ 6 * 3HLVDMRYLFK DQG ' 6 7DZILN 3URWHLQ (QJLQHHUV 7XUQHG (YROXWLRQLVWV 1DWXUH PHWKRGV   ±  >@ - 0 7KRPVRQ ( $ *DXFKHU 0 ) %XUJDQ ' : 'H .HH 7 /L - 3 $ULV DQG 6 $ %HQQHU 5HVXUUHFWLQJ $QFHVWUDO $OFRKRO 'HK\GURJHQDVHV IURP @ 3 &DUERQHOO DQG -/ )DXORQ 0ROHFXODU 6LJQDWXUHV%DVHG 3UHGLFWLRQ RI (Q]\PH 3URPLVFXLW\ %LRLQIRUPDWLFV   ± $XJXVW  >@ ) 'XDUWH % $ $PUHLQ DQG 6 & / .DPHUOLQ 0RGHOLQJ &DWDO\WLF 3URPLVFXLW\ LQ WKH $ONDOLQH 3KRVSKDWDVH 6XSHUIDPLO\ 3K\VLFDO &KHPLVWU\ &KHPLFDO 3K\VLFV     >@ 6 5 0F*XIIHH DQG $ + (OFRFN 'LIIXVLRQ &URZGLQJ DPS 3URWHLQ 6WDELOLW\ LQ D '\QDPLF 0ROHFXODU 0RGHO RI WKH %DFWHULDO &\WRSODVP 3/R6 &RPSXWDWLRQDO %LRORJ\   H 0DUFK  >@ 6 % =LPPHUPDQ DQG 6 2 7UDFK (VWLPDWLRQ RI 0DFURPROHFXOH &RQFHQWUDWLRQV DQG ([FOXGHG 9ROXPH (IIHFWV IRU WKH &\WRSODVP RI (VFKHULFKLD &ROL -RXUQDO RI PROHFXODU ELRORJ\   ±  >@ * : +XLVPDQ DQG 6 - &ROOLHU 2Q WKH 'HYHORSPHQW RI 1HZ %LRFDWDO\WLF 3URFHVVHV IRU 3UDFWLFDO 3KDUPDFHXWLFDO 6\QWKHVLV &XUUHQW 2SLQLRQ LQ &KHPLFDO %LRORJ\   ± $SULO  >@ 7 1DUDQFLF 5 'DYLV - 1LNRGLQRYLF5XQLF DQG . ( 2¶ &RQQRU 5HFHQW 'HYHORSPHQWV LQ %LRFDWDO\VLV %H\RQG WKH /DERUDWRU\ %LRWHFKQRORJ\ /HWWHUV   ± 0D\  >@ - 5 5HLG 7 &RROEHDU - 6 $\HUV DQG . 3 &RROEHDU 7KH $FWLRQ RI &K\PRVLQ RQ N&DVHLQ DQG LWV 0DFURSHSWLGH (IIHFW RI S+ DQG $QDO\VLV RI 3URGXFWV RI 6HFRQGDU\ +\GURO\VLV ,QWHUQDWLRQDO 'DLU\ -RXUQDO    ± $XJXVW  >@ 2 : *RRGLQJ 5 9RODGUL $ %DXWLVWD 7 +RSNLQV * +XLVPDQ 6 -HQQH 6 0D ( & 0XQGRUII 0 0 6DYLOH 6 - 7UXHVGHOO DQG - : :RQJ 'HYHORSPHQW RI D 3UDFWLFDO %LRFDWDO\WLF 3URFHVV IRU 5 0HWK\OSHQWDQRO 2UJDQLF 3URFHVV 5HVHDUFK 'HYHORSPHQW   ± -DQXDU\  >@ ) +DVDQ $ $ 6KDK DQG $ +DPHHG ,QGXVWULDO $SSOLFDWLRQV RI 0LFURELDO /LSDVHV (Q]\PH DQG 0LFURELDO 7HFKQRORJ\   ± -XQH  >@ <+ 3HUFLYDO =KDQJ 0 ( +LPPHO DQG - 5 0LHOHQ] 2XWORRN IRU &HOOXODVH ,PSURYHPHQW 6FUHHQLQJ DQG 6HOHFWLRQ 6WUDWHJLHV %LRWHFKQRORJ\ $GYDQFHV   ± 6HSWHPEHU  >@ 0 0HU] 7 (LVHOH 3 %HUHQGV ' $SSHO 6 5DEH , %ODQN 7 6WUHVVOHU DQG / )LVFKHU )ODYRXU]\PH DQ (Q]\PH 3UHSDUDWLRQ ZLWK ,QGXVWULDO 5HOHYDQFH $XWRPDWHG 1LQH6WHS 3XULILFDWLRQ DQG 3DUWLDO &KDUDFWHUL]DWLRQ RI (LJKW (Q]\PHV -RXUQDO RI $JULFXOWXUDO DQG )RRG &KHPLVWU\   ± -XQH  >@ 0 - 9DQ 'HU 0DDUHO % 9DQ 'HU 9HHQ - & 8LWGHKDDJ + /HHPKXLV DQG / 'LMNKXL]HQ 3URSHUWLHV DQG $SSOLFDWLRQV RI 6WDUFK&RQYHUWLQJ (Q]\PHVRI

 WKH α$P\ODVH )DPLO\ -RXUQDO RI ELRWHFKQRORJ\   ±  >@ $ 6FKPLG - 6 'RUGLFN % +DXHU $ .LHQHU 0 :XEEROWV DQG % :LWKROW ,QGXVWULDO %LRFDWDO\VLV 7RGD\ DQG 7RPRUURZ 1DWXUH   ±  >@ 8 7 %RUQVFKHXHU * : +XLVPDQ 5 - .D]ODXVNDV 6 /XW] - & 0RRUH DQG . 5RELQV (QJLQHHULQJ WKH 7KLUG :DYH RI %LRFDWDO\VLV 1DWXUH   ± 0D\  >@ $ 6 %RPPDULXV %LRFDWDO\VLV $ 6WDWXV 5HSRUW $QQXDO 5HYLHZ RI &KHPLFDO DQG %LRPROHFXODU (QJLQHHULQJ   ± -XO\  >@ - 'DPERUVN\ DQG - %UH]RYVN\ &RPSXWDWLRQDO 7RROV IRU 'HVLJQLQJ DQG (QJLQHHULQJ (Q]\PHV &XUUHQW 2SLQLRQ LQ &KHPLFDO %LRORJ\ ± $SULO  >@ * .LVV 1 dHOHELgOoP 5 0RUHWWL ' %DNHU DQG . 1 +RXN &RPSXWDWLRQDO (Q]\PH 'HVLJQ $QJHZDQGWH &KHPLH ,QWHUQDWLRQDO (GLWLRQ  ± 0D\  >@ 0 3 )UXVKLFKHYD 0 - 0LOOV 3 6FKRSI 0 . 6LQJK 5 % 3UDVDG DQG $ :DUVKHO &RPSXWHU $LGHG (Q]\PH 'HVLJQ DQG &DWDO\WLF &RQFHSWV &XUUHQW 2SLQLRQ LQ &KHPLFDO %LRORJ\ ± $XJXVW  >@ . ĝZLGHUHN , 7XxyQ 9 0ROLQHU DQG - %HUWUDQ &RPSXWDWLRQDO 6WUDWHJLHV IRU WKH 'HVLJQ RI 1HZ (Q]\PDWLF )XQFWLRQV $UFKLYHV RI %LRFKHPLVWU\ DQG %LRSK\VLFV ± 6HSWHPEHU  >@ ;' .RQJ 4 0D - =KRX %% =HQJ DQG -+ ;X $ 6PDUW /LEUDU\ RI (SR[LGH +\GURODVH 9DULDQWV DQG WKH 7RS +LWV IRU 6\QWKHVLV RI 6 β%ORFNHU 3UHFXUVRUV $QJHZDQGWH &KHPLH ,QWHUQDWLRQDO (GLWLRQ   ± -XQH  >@ / 7 /DXJKOLQ +) 7]HQJ 6 /LQ DQG 5 1 $UPVWURQJ 0HFKDQLVP RI 0LFURVRPDO (SR[LGH +\GURODVH 6HPLIXQFWLRQDO 6LWH6SHFLILF 0XWDQWV $IIHFWLQJ WKH $ON\ODWLRQ +DOI5HDFWLRQ %LRFKHPLVWU\   ±  >@ 5 5LQN DQG ' % -DQVVHQ .LQHWLF 0HFKDQLVP RI WKH (QDQWLRVHOHFWLYH &RQYHUVLRQ RI 6W\UHQH 2[LGH E\ (SR[LGH +\GURODVH IURP $JUREDFWHULXP UDGLREDFWHU $' %LRFKHPLVWU\   ± 'HFHPEHU  >@ 7 :DWDEH DQG 6 .DQHKLUD 6ROXELOL]DWLRQ RI (SR[LGH +\GURODVH IURP /LYHU 0LFURVRPHV &KHPLFDO 3KDUPDFHXWLFDO %XOOHWLQ   ±  >@ 0 , 0RQWHUGH 0 /RPEDUG $ $UFKHODV $ &URQLQ 0 $UDQG DQG 5 )XUVWRVV (Q]\PDWLF 7UDQVIRUPDWLRQV 3DUW  (QDQWLRFRQYHUJHQW %LRK\GURO\VLV RI 6W\UHQH 2[LGH 'HULYDWLYHV &DWDO\VHG E\ WKH 6RODQXP 7XEHURVXP (SR[LGH +\GURODVH 7HWUDKHGURQ $V\PPHWU\   ± 6HSWHPEHU  >@ 0 .RWLN $ $UFKHODV DQG 5 :RKOJHPXWK (SR[LGH +\GURODVHV DQG WKHLU $SSOLFDWLRQ LQ 2UJDQLF 6\QWKHVLV &XUUHQW 2UJDQLF &KHPLVWU\    ± )HEUXDU\  >@ $ $UFKHODV DQG 5 )XUVWRVV 6\QWKHWLF $SSOLFDWLRQV RI (SR[LGH +\GURODVHV &XUUHQW RSLQLRQ LQ FKHPLFDO ELRORJ\   ±  >@ 5 6 &DKQ & ,QJROG DQG 9 3UHORJ 6SHFLILFDWLRQ RI 0ROHFXODU &KLUDOLW\ $QJHZDQGWH &KHPLH ,QWHUQDWLRQDO (GLWLRQ LQ (QJOLVK   ± 

 >@ &6 &KHQ :5 6KLHK 3+ /X 6 +DUULPDQ DQG &< &KHQ 0HWDEROLF 6WHUHRLVRPHULF ,QYHUVLRQ RI ,EXSURIHQ LQ 0DPPDOV %LRFKLPLFD HW %LRSK\VLFD $FWD %%$ 3URWHLQ 6WUXFWXUH DQG 0ROHFXODU (Q]\PRORJ\   ±  >@ 5 *UHHQH DQG + ( $ )DUUDQ 13KWKDO\O *OXWDPLF ,PLGH %ULWLVK 0HGLFDO -RXUQDO   ± 1RYHPEHU  >@ - + .LP DQG $ 5 6FLDOOL 7KDOLGRPLGH 7KH 7UDJHG\ RI %LUWK 'HIHFWV DQG WKH (IIHFWLYH 7UHDWPHQW RI 'LVHDVH 7R[LFRORJLFDO 6FLHQFHV   ± -XO\  >@ * %ODVFKNH + 3 .UDIW . )LFNHQWVFKHU DQG ) .|KOHU &KURPDWRJUDSKLF VHSDUDWLRQ RI UDFHPLF WKDOLGRPLGH DQG WHUDWRJHQLF DFWLYLW\ RI LWV HQDQWLRPHUV DXWKRU¶V WUDQVO  $U]QHLPLWWHO)RUVFKXQJ   ±  >@ , $JUDQDW + &DQHU DQG - &DOGZHOO 3XWWLQJ &KLUDOLW\ WR :RUN 7KH 6WUDWHJ\ RI &KLUDO 6ZLWFKHV 1DWXUH 5HYLHZV 'UXJ 'LVFRYHU\   ± 2FWREHU  >@ 6 . 7HR . ( 5HV]WDN 0 $ 6FKHIIOHU . $ .RRN - % =HOGLV ' , 6WLUOLQJ DQG 6 ' 7KRPDV 7KDOLGRPLGH LQ WKH 7UHDWPHQW RI /HSURV\ 0LFUREHV DQG ,QIHFWLRQ   ±  >@ $ . 6WHZDUW +RZ 7KDOLGRPLGH :RUNV $JDLQVW &DQFHU 6FLHQFH     -DQXDU\  >@ & 0RULVVHDX DQG % ' +DPPRFN (SR[LGH +\GURODVHV 0HFKDQLVPV ,QKLELWRU 'HVLJQV DQG %LRORJLFDO 5ROHV $QQXDO 5HYLHZ RI 3KDUPDFRORJ\ DQG 7R[LFRORJ\   ± )HEUXDU\  >@ 6 / 0RZEUD\ / 7 (OIVWU|P . 0 $KOJUHQ & ( $QGHUVVRQ DQG 0 :LGHUVWHQ ;UD\ 6WUXFWXUH RI 3RWDWR (SR[LGH +\GURODVH 6KHGV /LJKW RQ 6XEVWUDWH 6SHFLILFLW\ LQ 3ODQW (Q]\PHV 3URWHLQ 6FLHQFH   ± -XO\  >@ / 7 (OIVWU|P DQG 0 :LGHUVWHQ &DWDO\VLV RI 3RWDWR (SR[LGH +\GURODVH 67(+ %LRFKHPLFDO -RXUQDO   ± 6HSWHPEHU  >@ ' /LQGEHUJ $ *RJROO DQG 0 :LGHUVWHQ 6XEVWUDWH'HSHQGHQW +\VWHUHWLF %HKDYLRU LQ 6W(+&DWDO\]HG +\GURO\VLV RI 6W\UHQH 2[LGH 'HULYDWLYHV (Q]\PH&DWDO\]HG 6W\UHQH 2[LGH +\GURO\VLV )(%6 -RXUQDO    ± 'HFHPEHU  >@ 0 / %HQGHU DQG % : 7XUQTXHVW *HQHUDO %DVLF &DWDO\VLV RI (VWHU +\GURO\VLV DQG ,WV 5HODWLRQVKLS WR (Q]\PDWLF +\GURO\VLV -RXUQDO RI WKH $PHULFDQ &KHPLFDO 6RFLHW\   ± $SULO  >@ c - &DUOVVRQ 3 %DXHU + 0D DQG 0 :LGHUVWHQ 2EWDLQLQJ 2SWLFDO 3XULW\ IRU 3URGXFW 'LROV LQ (Q]\PH&DWDO\]HG (SR[LGH +\GURO\VLV &RQWULEXWLRQV IURP &KDQJHV LQ ERWK (QDQWLR DQG 5HJLRVHOHFWLYLW\ %LRFKHPLVWU\    ± 6HSWHPEHU  >@ 5 6 0XOOLNHQ (OHFWURQLF 3RSXODWLRQ $QDO\VLV RQ /&$2±02 0ROHFXODU :DYH )XQFWLRQV , 7KH -RXUQDO RI &KHPLFDO 3K\VLFV   ± 2FWREHU  >@ * ( 6FXVHULD 7 3 +DPLOWRQ DQG + ) 6FKDHIHU $Q $VVHVVPHQW IRU WKH )XOO &RXSOHG &OXVWHU 0HWKRG ,QFOXGLQJ $OO 6LQJOH 'RXEOH DQG 7ULSOH ([FLWDWLRQV 7KH 'LDWRPLF 0ROHFXOHV /LK /L %K /LI & %HR &Q %I 1R DQG ) 7KH -RXUQDO RI &KHPLFDO 3K\VLFV   ± -DQXDU\ 

 >@ . +DOG ) 3DZáRZVNL 3 -¡UJHQVHQ DQG & +lWWLJ &DOFXODWLRQ RI )UHTXHQF\'HSHQGHQW 3RODUL]DELOLWLHV 8VLQJ WKH $SSUR[LPDWH &RXSOHG&OXVWHU 7ULSOHV 0RGHO &F 7KH -RXUQDO RI &KHPLFDO 3K\VLFV   ± -DQXDU\  >@ ) 6KLPRMR 5 . .DOLD $ 1DNDQR DQG 3 9DVKLVKWD /LQHDU6FDOLQJ 'HQVLW\)XQFWLRQDO7KHRU\ &DOFXODWLRQV RI (OHFWURQLF 6WUXFWXUH %DVHG RQ 5HDO6SDFH *ULGV 'HVLJQ $QDO\VLV DQG 6FDODELOLW\ 7HVW RI 3DUDOOHO $OJRULWKPV &RPSXWHU 3K\VLFV &RPPXQLFDWLRQV   ± 1RYHPEHU  >@ 0 *LOODQ ' %RZOHU $ 7RUUDOED DQG 7 0L\D]DNL 2UGHU1 )LUVW3ULQFLSOHV &DOFXODWLRQV ZLWK WKH &RQTXHVW &RGH &RPSXWHU 3K\VLFV &RPPXQLFDWLRQV   ± -XO\  >@ 0 $ULWD ' 5 %RZOHU DQG 7 0L\D]DNL 6WDEOH DQG (IILFLHQW /LQHDU 6FDOLQJ )LUVW3ULQFLSOHV 0ROHFXODU '\QDPLFV IRU  $WRPV -RXUQDO RI &KHPLFDO 7KHRU\ DQG &RPSXWDWLRQ   ± 'HFHPEHU  >@ ' 5 %RZOHU DQG 7 0L\D]DNL &DOFXODWLRQV IRU 0LOOLRQV RI $WRPV ZLWK 'HQVLW\ )XQFWLRQDO 7KHRU\ /LQHDU 6FDOLQJ 6KRZV ,WV 3RWHQWLDO -RXUQDO RI 3K\VLFV &RQGHQVHG 0DWWHU    )HEUXDU\  >@ : .RKQ 'HQVLW\ )XQFWLRQDO 7KHRU\ IRU 6\VWHPV RI 9HU\ 0DQ\ $WRPV ,QWHUQDWLRQDO -RXUQDO RI 4XDQWXP &KHPLVWU\   ±  >@ ' 5 %RZOHU DQG 7 0L\D]DNL 2 Q 0HWKRGV LQ (OHFWURQLF 6WUXFWXUH &DOFXODWLRQV 5HSRUWV RQ 3URJUHVV LQ 3K\VLFV    0DUFK  >@ 4 &XL DQG 0 (OVWQHU 'HQVLW\ )XQFWLRQDO 7LJKW %LQGLQJ 9DOXHV RI 6HPL(PSLULFDO 0HWKRGV LQ DQ $E ,QLWLR (UD 3K\VLFDO &KHPLVWU\ &KHPLFDO 3K\VLFV     >@ 9 0OêQVNê 3 %DQiã - âSRQHU 0 : YDQ GHU .DPS $ - 0XOKROODQG DQG 0 2W\HSND &RPSDULVRQ RI DE ,QLWLR ')7 DQG 6HPLHPSLULFDO 4000 $SSURDFKHV IRU 'HVFULSWLRQ RI &DWDO\WLF 0HFKDQLVP RI +DLUSLQ 5LER]\PH -RXUQDO RI &KHPLFDO 7KHRU\ DQG &RPSXWDWLRQ   ± $SULO  >@ : 7KLHO 6HPLHPSLULFDO 4XDQWXP&KHPLFDO 0HWKRGV 6HPLHPSLULFDO 4XDQWXP&KHPLFDO 0HWKRGV :LOH\ ,QWHUGLVFLSOLQDU\ 5HYLHZV &RPSXWDWLRQDO 0ROHFXODU 6FLHQFH   ± 0DUFK  >@ . + +RSPDQQ DQG ) +LPR 7KHRUHWLFDO 6WXG\ RI WKH )XOO 5HDFWLRQ 0HFKDQLVP RI +XPDQ 6ROXEOH (SR[LGH +\GURODVH &KHPLVWU\  $ (XURSHDQ -RXUQDO   ± 6HSWHPEHU  >@ 0 ( 6 /LQG DQG ) +LPR 4XDQWXP &KHPLFDO 0RGHOLQJ RI (QDQWLRFRQYHUJHQF\ LQ 6ROXEOH (SR[LGH +\GURODVH $&6 &DWDO\VLV    ± 'HFHPEHU  >@ $ & 7 YDQ 'XLQ 6 'DVJXSWD ) /RUDQW DQG : $ *RGGDUG 5HD[)) $ 5HDFWLYH )RUFH )LHOG IRU +\GURFDUERQV 7KH -RXUQDO RI 3K\VLFDO &KHPLVWU\ $   ± 2FWREHU  >@ / 0LFKDHOLV DQG 0 / 0HQWHQ 'LH .LQHWLN GHU ,QYHUWLQZLUNXQJ %LRFKHP ]     >@ 6 ) $OWVFKXO : *LVK : 0LOOHU ( : 0\HUV DQG ' - /LSPDQ %DVLF /RFDO $OLJQPHQW 6HDUFK 7RRO -RXUQDO RI PROHFXODU ELRORJ\   ± 

 >@ ) 6LHYHUV $ :LOP ' 'LQHHQ 7 - *LEVRQ . .DUSOXV : /L 5 /RSH] + 0F:LOOLDP 0 5HPPHUW - 6RGLQJ - ' 7KRPSVRQ DQG ' * +LJJLQV )DVW 6FDODEOH *HQHUDWLRQ RI +LJK4XDOLW\ 3URWHLQ 0XOWLSOH 6HTXHQFH $OLJQPHQWV 8VLQJ &OXVWDO 2PHJD 0ROHFXODU 6\VWHPV %LRORJ\   ± $SULO  >@ * ( &URRNV * +RQ -0 &KDQGRQLD DQG 6 ( %UHQQHU :HE/RJR $ 6HTXHQFH /RJR *HQHUDWRU *HQRPH UHVHDUFK   ±  >@ * - :KLWURZ 7KH 0DVV RI WKH 8QLYHUVH 1DWXUH ±  >@ 3 $ 5RPHUR DQG ) + $UQROG ([SORULQJ 3URWHLQ )LWQHVV /DQGVFDSHV E\ 'LUHFWHG (YROXWLRQ 1DWXUH 5HYLHZV 0ROHFXODU &HOO %LRORJ\   ± 'HFHPEHU  >@ < *XPXO\D - 6DQFKLV DQG 0 7 5HHW] 0DQ\ 3DWKZD\V LQ /DERUDWRU\ (YROXWLRQ &DQ /HDG WR ,PSURYHG (Q]\PHV +RZ WR (VFDSH IURP /RFDO 0LQLPD &KHP%LR&KHP   ± 0D\  >@ 0 7 5HHW] ' .DKDNHDZ DQG 5 /RKPHU $GGUHVVLQJ WKH 1XPEHUV 3UREOHP LQ 'LUHFWHG (YROXWLRQ &KHP%LR&KHP   ± -XO\  >@ 0 6KLUWV DQG 9 6 3DQGH 6FUHHQ 6DYHUV RI WKH :RUOG 8QLWH 6FLHQFH    'HFHPEHU  >@ ' ( 6KDZ - *URVVPDQ - $ %DQN % %DWVRQ - $ %XWWV - & &KDR 0 0 'HQHURII 5 2 'URU $ (YHQ & + )HQWRQ $ )RUWH - *DJOLDUGR * *LOO % *UHVNDPS & 5 +R ' - ,HUDUGL / ,VHURYLFK - 6 .XVNLQ 5 + /DUVRQ 7 /D\PDQ /6 /HH $ . /HUHU & /L ' .LOOHEUHZ . 0 0DFNHQ]LH 6 <+ 0RN 0 $ 0RUDHV 5 0XHOOHU / - 1RFLROR - / 3HWLFRODV 7 4XDQ ' 5DPRW - . 6DOPRQ ' 3 6FDUSD]]D 8 % 6FKDIHU 1 6LGGLTXH & : 6Q\GHU - 6SHQJOHU 3 7 3 7DQJ 0 7KHREDOG + 7RPD % 7RZOHV % 9LWDOH 6 & :DQJ DQG & @ . /LQGRUII/DUVHQ 6 3LDQD 5 2 'URU DQG ' ( 6KDZ +RZ )DVW)ROGLQJ 3URWHLQV )ROG 6FLHQFH    2FWREHU  >@ 3 / )UHGGROLQR & % +DUULVRQ < /LX DQG . 6FKXOWHQ &KDOOHQJHV LQ 3URWHLQ)ROGLQJ 6LPXODWLRQV 1DWXUH 3K\VLFV   ± 2FWREHU  >@ - & .(1'5(: * %2'2 + 0 ',17=,6 5 * 3$55,6+ + :<&.2)) DQG ' & 3+,//,36 $ 7KUHH'LPHQVLRQDO 0RGHO RI WKH 0\RJORELQ 0ROHFXOH 2EWDLQHG E\ ;UD\ $QDO\VLV 1DWXUH    ± 0DUFK  >@ 3 (PVOH\ DQG . &RZWDQ &RRW 0RGHO%XLOGLQJ 7RROV IRU 0ROHFXODU *UDSKLFV $FWD &U\VWDOORJUDSKLFD 6HFWLRQ ' %LRORJLFDO &U\VWDOORJUDSK\  ± 'HFHPEHU  >@ + 0 %HUPDQ - :HVWEURRN = )HQJ * *LOOLODQG 7 1 %KDW + :HLVVLJ , 1 6KLQG\DORY DQG 3 ( %RXUQH 7KH 3URWHLQ 'DWD %DQN 1XFOHLF $FLGV 5HVHDUFK   ± -DQXDU\  >@ 0 ) +DQWNH ' +DVVH ) 5 1 & 0DLD 7 (NHEHUJ . -RKQ 0 6YHQGD 1 ' /RK $ 9 0DUWLQ 1 7LPQHDQX ' 6 ' /DUVVRQ * YDQ GHU 6FKRW * + &DUOVVRQ 0 ,QJHOPDQ - $QGUHDVVRQ ' :HVWSKDO 0 /LDQJ ) 6WHOODWR ' 3 'H3RQWH 5 +DUWPDQQ 1 .LPPHO 5 $ .LULDQ 0 0

 6HLEHUW . 0KOLJ 6 6FKRUE . )HUJXVRQ & %RVWHGW 6 &DUURQ - ' %R]HN ' 5ROOHV $ 5XGHQNR 6 (SS + 1 &KDSPDQ $ %DUW\ - +DMGX DQG , $QGHUVVRQ +LJK7KURXJKSXW ,PDJLQJ RI +HWHURJHQHRXV &HOO 2UJDQHOOHV ZLWK DQ ;UD\ /DVHU 1DWXUH 3KRWRQLFV   ± 1RYHPEHU  >@ $ 0HUN $ %DUWHVDJKL 6 %DQHUMHH 9 )DOFRQLHUL 3 5DR 0 , 'DYLV 5 3UDJDQL 0 % %R[HU / $ (DUO - / 0LOQH DQG 6 6XEUDPDQLDP %UHDNLQJ &U\R(0 5HVROXWLRQ %DUULHUV WR )DFLOLWDWH 'UXJ 'LVFRYHU\ &HOO   ± -XQH  >@ 9 % &KHQ : % $UHQGDOO - - +HDGG ' $ .HHG\ 5 0 ,PPRUPLQR * - .DSUDO / : 0XUUD\ - 6 5LFKDUGVRQ DQG ' & 5LFKDUGVRQ 0RO3URELW\ $OO$WRP 6WUXFWXUH 9DOLGDWLRQ IRU 0DFURPROHFXODU &U\VWDOORJUDSK\ $FWD &U\VWDOORJUDSKLFD 6HFWLRQ ' %LRORJLFDO &U\VWDOORJUDSK\   ± -DQXDU\  >@ 0 + 0 2OVVRQ & 5 6¡QGHUJDDUG 0 5RVWNRZVNL DQG - + -HQVHQ 3523.$ &RQVLVWHQW 7UHDWPHQW RI ,QWHUQDO DQG 6XUIDFH 5HVLGXHV LQ (PSLULFDO S . D 3UHGLFWLRQV -RXUQDO RI &KHPLFDO 7KHRU\ DQG &RPSXWDWLRQ  ± )HEUXDU\  >@ % 6 $KORRZDOLD 0 0DOXV]\QVNL DQG . 1LFKWHUOHLQ *OREDO ,PSDFW RI 0XWDWLRQ'HULYHG 9DULHWLHV (XSK\WLFD   ±  >@ $ 7KRPDHXV $ 1DZRU\WD 6 / 0RZEUD\ DQG 0 :LGHUVWHQ 5HPRYDO RI 'LVWDO 3URWHLQ:DWHU +\GURJHQ %RQGV LQ D 3ODQW (SR[LGH +\GURODVH ,QFUHDVHV &DWDO\WLF 7XUQRYHU EXW 'HFUHDVHV 7KHUPRVWDELOLW\ 3URWHLQ 6FLHQFH    ± -XO\  >@ : +XPSKUH\ $ 'DONH DQG . 6FKXOWHQ 90' 9LVXDO 0ROHFXODU '\QDPLFV -RXUQDO RI PROHFXODU JUDSKLFV   ±  >@ 7 *RXOG % +DUULQJWRQ 1 +XUVW DQG 0HQ7D/JX< ,QNVFDSH >@ 5 / 'XQEUDFN 5RWDPHU /LEUDULHV LQ WKH VW &HQWXU\ &XUUHQW 2SLQLRQ LQ 6WUXFWXUDO %LRORJ\    ±   >@ 0 9 6KDSRYDORY DQG 5 / 'XQEUDFN $ 6PRRWKHG %DFNERQH'HSHQGHQW 5RWDPHU /LEUDU\ IRU 3URWHLQV 'HULYHG IURP $GDSWLYH .HUQHO 'HQVLW\ (VWLPDWHV DQG 5HJUHVVLRQV 6WUXFWXUH   ± -XQH  >@ ( ) 3HWWHUVHQ 7 ' *RGGDUG & & +XDQJ * 6 &RXFK ' 0 *UHHQEODWW ( & 0HQJ DQG 7 ( )HUULQ 8&6) &KLPHUD  D 9LVXDOL]DWLRQ 6\VWHP IRU ([SORUDWRU\ 5HVHDUFK DQG $QDO\VLV -RXUQDO RI &RPSXWDWLRQDO &KHPLVWU\  ± 2FWREHU  >@ 6FKU|GLQJHU //& 7KH 2SHQ6RXUFH 3\02/ 0ROHFXODU *UDSKLFV 6\VWHP 9HUVLRQ [ 1RYHPEHU  >@ * * .ULYRY 0 9 6KDSRYDORY DQG 5 / 'XQEUDFN ,PSURYHG 3UHGLFWLRQ RI 3URWHLQ 6LGH&KDLQ &RQIRUPDWLRQV ZLWK 6&:5/ 3URWHLQV 6WUXFWXUH )XQFWLRQ DQG %LRLQIRUPDWLFV   ± 'HFHPEHU  >@ 0 * (YDQV DQG 0 3RODQ\L 6RPH $SSOLFDWLRQV RI WKH 7UDQVLWLRQ 6WDWH 0HWKRG WR WKH &DOFXODWLRQ RI 5HDFWLRQ 9HORFLWLHV (VSHFLDOO\ LQ 6ROXWLRQ 7UDQVDFWLRQV RI WKH )DUDGD\ 6RFLHW\ ±  >@ . - /DLGOHU DQG 0 & .LQJ 'HYHORSPHQW RI 7UDQVLWLRQ6WDWH 7KHRU\ 7KH -RXUQDO RI 3K\VLFDO &KHPLVWU\   ± -XO\  >@ 0 - $EUDKDP 7 0XUWROD 5 6FKXO] 6 3iOO - & 6PLWK % +HVV DQG ( /LQGDKO *520$&6 +LJK 3HUIRUPDQFH 0ROHFXODU 6LPXODWLRQV 7KURXJK

 0XOWL/HYHO 3DUDOOHOLVP IURP /DSWRSV WR 6XSHUFRPSXWHUV 6RIWZDUH;  ± 6HSWHPEHU  >@ % +HVV & .XW]QHU ' YDQ GHU 6SRHO DQG ( /LQGDKO *520$&6  $OJRULWKPV IRU +LJKO\ (IILFLHQW /RDG%DODQFHG DQG 6FDODEOH 0ROHFXODU 6LPXODWLRQ -RXUQDO RI &KHPLFDO 7KHRU\ DQG &RPSXWDWLRQ   ± 0DUFK  >@ '$ &DVH 50 %HW] : %RWHOOR6PLWK '6 &HUXWWL 7( &KHDWKDP ,,, 7$ 'DUGHQ 5( 'XNH 7- *LHVH + *RKONH $: *RHW] 1 +RPH\HU 6 ,]DGL 3 -DQRZVNL - .DXV $ .RYDOHQNR 76 /HH 6 /H*UDQG 3 /L & /LQ 7 /XFKNR 5 /XR % 0DGHM ' 0HUPHOVWHLQ .0 0HU] * 0RQDUG + 1JX\HQ +7 1JX\HQ , 2PHO\DQ $ 2QXIULHY '5 5RH $ 5RLWEHUJ & 6DJXL &/ 6LPPHUOLQJ - 6ZDLOV 5& :DONHU - :DQJ 50 :ROI ; :X / ;LDR '0 @ % 5 %URRNV & / %URRNV $ ' 0DFNHUHOO / 1LOVVRQ 5 - 3HWUHOOD % 5RX[ < :RQ * $UFKRQWLV & %DUWHOV 6 %RUHVFK $ &DIOLVFK / &DYHV 4 &XL $ 5 'LQQHU 0 )HLJ 6 )LVFKHU - *DR 0 +RGRVFHN : ,P . .XF]HUD 7 /D]DULGLV - 0D 9 2YFKLQQLNRY ( 3DFL 5 : 3DVWRU & % 3RVW - = 3X 0 6FKDHIHU % 7LGRU 5 0 9HQDEOH + / :RRGFRFN ; :X : @ - 0DUHOLXV . .ROPRGLQ , )HLHUEHUJ DQG - cTYLVW 4 $ 0ROHFXODU '\QDPLFV 3URJUDP IRU )UHH (QHUJ\ &DOFXODWLRQV DQG (PSLULFDO 9DOHQFH %RQG 6LPXODWLRQV LQ %LRPROHFXODU 6\VWHPV -RXUQDO RI 0ROHFXODU *UDSKLFV DQG 0RGHOOLQJ   ±  >@ % $ $PUHLQ ) 6WHIIHQ0XQVEHUJ , 6]HOHU 0 3XUJ < .XONDUQL DQG 6 & / .DPHUOLQ &$'(( &RPSXWHU$LGHG 'LUHFWHG (YROXWLRQ RI (Q]\PHV ,8&U-   ± -DQXDU\  >@ * .LQJ DQG $ :DUVKHO $ 6XUIDFH &RQVWUDLQHG $OO$WRP 6ROYHQW 0RGHO IRU (IIHFWLYH 6LPXODWLRQV RI 3RODU 6ROXWLRQV 7KH -RXUQDO RI &KHPLFDO 3K\VLFV  ± 6HSWHPEHU  >@ 7 3 6HQIWOH 6 +RQJ 0 0 ,VODP 6 % .\ODVD < =KHQJ < . 6KLQ & -XQNHUPHLHU 5 (QJHO+HUEHUW 0 - -DQLN + 0 $NWXOJD 7 9HUVWUDHOHQ $ *UDPD DQG $ & 7 YDQ 'XLQ 7KH 5HD[)) UHDFWLYH IRUFHILHOG GHYHORSPHQW DSSOLFDWLRQV DQG IXWXUH GLUHFWLRQV 1SM &RPSXWDWLRQDO 0DWHULDOV  0DUFK  >@ 5 : =ZDQ]LJ +LJK7HPSHUDWXUH (TXDWLRQ RI 6WDWH E\ D 3HUWXUEDWLRQ 0HWKRG , 1RQSRODU *DVHV 7KH -RXUQDO RI &KHPLFDO 3K\VLFV    ± $XJXVW  >@ : / -RUJHQVHQ )UHH (QHUJ\ &DOFXODWLRQV $ %UHDNWKURXJK IRU 0RGHOLQJ 2UJDQLF &KHPLVWU\ LQ 6ROXWLRQ $FFRXQWV RI &KHPLFDO 5HVHDUFK    ±  >@ 3 .ROOPDQ )UHH (QHUJ\ &DOFXODWLRQV $SSOLFDWLRQV WR &KHPLFDO DQG %LRFKHPLFDO 3KHQRPHQD &KHPLFDO UHYLHZV   ±  >@ : ) YDQ *XQVWHUHQ DQG + - & %HUHQGVHQ &RPSXWHU 6LPXODWLRQ RI 0ROHFXODU '\QDPLFV 0HWKRGRORJ\ $SSOLFDWLRQV DQG 3HUVSHFWLYHV LQ &KHPLVWU\ $QJHZDQGWH &KHPLH ,QWHUQDWLRQDO (GLWLRQ LQ (QJOLVK   

 ±  >@ $ :DUVKHO DQG 5 0 :HLVV $Q (PSLULFDO 9DOHQFH %RQG $SSURDFK IRU &RPSDULQJ 5HDFWLRQV LQ 6ROXWLRQV DQG LQ (Q]\PHV -RXUQDO RI WKH $PHULFDQ &KHPLFDO 6RFLHW\   ±  >@ $ :DUVKHO &RPSXWHU 0RGHOLQJ RI &KHPLFDO 5HDFWLRQV LQ (Q]\PHV DQG 6ROXWLRQV :LOH\ 1RYHPEHU  ,6%1  *RRJOH%RRNV,' V+Z$$$$0$$- >@ ( 5RVWD 0 .OlKQ DQG $ :DUVKHO 7RZDUGV $FFXUDWH $E ,QLWLR 4000 &DOFXODWLRQV RI )UHH(QHUJ\ 3URILOHV RI (Q]\PDWLF 5HDFWLRQV 7KH -RXUQDO RI 3K\VLFDO &KHPLVWU\ %   ± )HEUXDU\  >@ $ 6KXUNL ( 'HUDW $ %DUUR]R DQG 6 & / .DPHUOLQ +RZ 9DOHQFH %RQG 7KHRU\ &DQ +HOS @ 0 + 0 2OVVRQ * +RQJ DQG $ :DUVKHO )UR]HQ 'HQVLW\ )XQFWLRQDO )UHH (QHUJ\ 6LPXODWLRQV RI 5HGR[ 3URWHLQV &RPSXWDWLRQDO 6WXGLHV RI WKH 5HGXFWLRQ 3RWHQWLDO RI 3ODVWRF\DQLQ DQG 5XVWLF\DQLQ -RXUQDO RI WKH $PHULFDQ &KHPLFDO 6RFLHW\   ± $SULO  >@ < ;LDQJ DQG $ :DUVKHO 4XDQWLI\LQJ )UHH (QHUJ\ 3URILOHV RI 3URWRQ 7UDQVIHU 5HDFWLRQV LQ 6ROXWLRQ DQG 3URWHLQV E\ 8VLQJ D 'LDEDWLF )')7 0DSSLQJ 7KH -RXUQDO RI 3K\VLFDO &KHPLVWU\ %   ± -DQXDU\  >@ + /LX DQG $ :DUVKHO 7KH &DWDO\WLF (IIHFW RI 'LK\GURIRODWH 5HGXFWDVH DQG ,WV 0XWDQWV ,V 'HWHUPLQHG E\ 5HRUJDQL]DWLRQ (QHUJLHV %LRFKHPLVWU\    ± 0D\  >@ 6 & / .DPHUOLQ DQG $ :DUVKHO 7KH (9% DV D TXDQWLWDWLYH WRRO IRU IRUPXODWLQJ VLPXODWLRQV DQG DQDO\]LQJ ELRORJLFDO DQG FKHPLFDO UHDFWLRQV )DUDGD\ 'LVFXVVLRQV   ±  >@ 0 % 3HWHUV < @ & $QGUHLQL , %HUWLQL * &DYDOODUR * / +ROOLGD\ DQG - 0 7KRUQWRQ 0HWDO ,RQV LQ %LRORJLFDO &DWDO\VLV )URP (Q]\PH 'DWDEDVHV WR *HQHUDO 3ULQFLSOHV -%,& -RXUQDO RI %LRORJLFDO ,QRUJDQLF &KHPLVWU\    ± 1RYHPEHU  >@ 9 *XDOODU DQG ) + :DOOUDSS 4000 0HWKRGV /RRNLQJ ,QVLGH +HPH 3URWHLQV %LRFKHPLVW\ %LRSK\VLFDO &KHPLVWU\   ± -XQH  >@ ; /L 6 $ +D\LN DQG . 0 0HU] 4000 ;UD\ 5HILQHPHQW RI =LQF 0HWDOORHQ]\PHV -RXUQDO RI ,QRUJDQLF %LRFKHPLVWU\   ± 0D\  >@ 3 ( 0 6LHJEDKQ 7KH 3HUIRUPDQFH RI +\EULG ')7 IRU 0HFKDQLVPV ,QYROYLQJ 7UDQVLWLRQ 0HWDO &RPSOH[HV LQ (Q]\PHV -%,& -RXUQDO RI %LRORJLFDO ,QRUJDQLF &KHPLVWU\   ± 6HSWHPEHU  >@ - 1 +DUYH\ 2Q WKH $FFXUDF\ RI 'HQVLW\ )XQFWLRQDO 7KHRU\ LQ 7UDQVLWLRQ 0HWDO &KHPLVWU\ $QQXDO 5HSRUWV 6HFWLRQ ´&´ 3K\VLFDO &KHPLVWU\    >@ 0 6ZDUW 6SLQ 6WDWHV RI %LR ,QRUJDQLF 6\VWHPV 6XFFHVVHV DQG 3LWIDOOV ,QWHUQDWLRQDO -RXUQDO RI 4XDQWXP &KHPLVWU\   ± -DQXDU\ 

 >@ 3 2HOVFKODHJHU 0 .ODKQ : $ %HDUG 6 + :LOVRQ DQG $ :DUVKHO 0DJQHVLXPFDWLRQLF 'XPP\ $WRP 0ROHFXOHV (QKDQFH 5HSUHVHQWDWLRQ RI '1$ 3RO\PHUDVH β LQ 0ROHFXODU '\QDPLFV 6LPXODWLRQV ,PSURYHG $FFXUDF\ LQ 6WXGLHV RI 6WUXFWXUDO )HDWXUHV DQG 0XWDWLRQDO (IIHFWV -RXUQDO RI 0ROHFXODU %LRORJ\   ± )HEUXDU\  >@ ) /LQ DQG 5 :DQJ 6\VWHPDWLF 'HULYDWLRQ RI $0%(5 )RUFH )LHOG 3DUDPHWHUV $SSOLFDEOH WR =LQF&RQWDLQLQJ 6\VWHPV -RXUQDO RI &KHPLFDO 7KHRU\ DQG &RPSXWDWLRQ   ± -XQH  >@ - cTYLVW DQG $ :DUVKHO )UHH (QHUJ\ 5HODWLRQVKLSV LQ 0HWDOORHQ]\PH&DWDO\]HG 5HDFWLRQV &DOFXODWLRQV RI WKH (IIHFWV RI 0HWDO ,RQ 6XEVWLWXWLRQV LQ 6WDSK\ORFRFFDO 1XFOHDVH -RXUQDO RI WKH $PHULFDQ &KHPLFDO 6RFLHW\   ±  >@ <3 3DQJ . 8 1 ;X - (/ <$=$/ DQG ) * 3UHQGHUJDVW 6XFFHVVIXO 0ROHFXODU '\QDPLFV 6LPXODWLRQ RI WKH =LQF%RXQG )DUQHV\OWUDQVIHUDVH 8VLQJ WKH &DWLRQLF 'XPP\ $WRP $SSURDFK 3URWHLQ 6FLHQFH   ±  >@ ) 'XDUWH 3 %DXHU $ %DUUR]R % $ $PUHLQ 0 3XUJ - cTYLVW DQG 6&/ .DPHUOLQ )RUFH )LHOG ,QGHSHQGHQW 0HWDO 3DUDPHWHUV 8VLQJ D 1RQERQGHG 'XPP\ 0RGHO 7KH -RXUQDO RI 3K\VLFDO &KHPLVWU\ %   ± $SULO  >@ 5 0 1R\HV 7KHUPRG\QDPLFV RI ,RQ +\GUDWLRQ DV D 0HDVXUH RI (IIHFWLYH 'LHOHFWULF 3URSHUWLHV RI :DWHU -RXUQDO RI WKH $PHULFDQ &KHPLFDO 6RFLHW\  ± )HEUXDU\  >@ , 3HUVVRQ +\GUDWHG 0HWDO ,RQV LQ $TXHRXV 6ROXWLRQ +RZ 5HJXODU $UH 7KHLU 6WUXFWXUHV" 3XUH DQG $SSOLHG &KHPLVWU\    -DQXDU\  >@ 6 6HOOLQ DQG % 0DQQHUYLN 0HWDO GLVVRFLDWLRQ FRQVWDQWV IRU JO\R[DODVH , UHFRQVWLWXWHG ZLWK =Q &R 0Q DQG 0J -RXUQDO RI %LRORJLFDO &KHPLVWU\   ± 6HSWHPEHU  >@ ' %ODKD1HOVRQ ' 0 .UJHU . 6]HOHU 0 %HQ'DYLG DQG 6 & / .DPHUOLQ $FWLYH 6LWH +\GURSKRELFLW\ DQG WKH &RQYHUJHQW (YROXWLRQ RI 3DUDR[RQDVH $FWLYLW\ LQ 6WUXFWXUDOO\ 'LYHUJHQW (Q]\PHV 7KH &DVH RI 6HUXP 3DUDR[RQDVH  -RXUQDO RI WKH $PHULFDQ &KHPLFDO 6RFLHW\   ± -DQXDU\  >@ 0 3XUJ $ 3DELV ) %DLHU 1 7RNXULNL & -DFNVRQ DQG 6 & / .DPHUOLQ 3URELQJ WKH 0HFKDQLVPV IRU WKH 6HOHFWLYLW\ DQG 3URPLVFXLW\ RI 0HWK\O 3DUDWKLRQ +\GURODVH 3KLORVRSKLFDO 7UDQVDFWLRQV RI WKH 5R\DO 6RFLHW\ $ 0DWKHPDWLFDO 3K\VLFDO DQG (QJLQHHULQJ 6FLHQFHV    1RYHPEHU  >@ $ %DUUR]R ) 'XDUWH 3 %DXHU $ 7 3 &DUYDOKR DQG 6 & / .DPHUOLQ &RRSHUDWLYH (OHFWURVWDWLF ,QWHUDFWLRQV 'ULYH )XQFWLRQDO (YROXWLRQ LQ WKH $ONDOLQH 3KRVSKDWDVH 6XSHUIDPLO\ -RXUQDO RI WKH $PHULFDQ &KHPLFDO 6RFLHW\   ± -XO\  >@ & :DOOLQ < 6 .XONDUQL $ $EHOHLQ - -DUYHW 4 /LDR % 6WURGHO / 2OVVRQ - /XR - 3 $EUDKDPV 6 % 6KROWV 3 0 5RRV 6 & .DPHUOLQ $ *UlVOXQG DQG 6 . :lUPOlQGHU &KDUDFWHUL]DWLRQ RI 0Q ,, ,RQ %LQGLQJ WR WKH $P\ORLGβ 3HSWLGH LQ $O]KHLPHU¶V 'LVHDVH -RXUQDO RI 7UDFH (OHPHQWV LQ 0HGLFLQH DQG %LRORJ\ ± 'HFHPEHU 

 >@ 4 /LDR 6 & / .DPHUOLQ DQG % 6WURGHO 'HYHORSPHQW DQG $SSOLFDWLRQ RI D 1RQERQGHG &X  0RGHO 7KDW ,QFOXGHV WKH -DKQ±7HOOHU (IIHFW 7KH -RXUQDO RI 3K\VLFDO &KHPLVWU\ /HWWHUV   ± -XO\  >@ < -LDQJ + =KDQJ : )HQJ DQG 7 7DQ 5HILQHG 'XPP\ $WRP 0RGHO RI 0J E\ 6LPSOH 3DUDPHWHU 6FUHHQLQJ 6WUDWHJ\ ZLWK 5HYLVHG ([SHULPHQWDO 6ROYDWLRQ )UHH (QHUJ\ -RXUQDO RI &KHPLFDO ,QIRUPDWLRQ DQG 0RGHOLQJ  ± 'HFHPEHU  >@ < -LDQJ + =KDQJ DQG 7 7DQ 5DWLRQDO 'HVLJQ RI 0HWKRGRORJ\,QGHSHQGHQW 0HWDO 3DUDPHWHUV 8VLQJ D 1RQERQGHG 'XPP\ 0RGHO -RXUQDO RI &KHPLFDO 7KHRU\ DQG &RPSXWDWLRQ   ± -XO\  >@ 3 /L DQG . 0 0HU] 0HWDO ,RQ 0RGHOLQJ 8VLQJ &ODVVLFDO 0HFKDQLFV &KHPLFDO 5HYLHZV -DQXDU\  >@ . 6WHLQHU DQG + 6FKZDE 5HFHQW $GYDQFHV LQ 5DWLRQDO $SSURDFKHV IRU (Q]\PH (QJLQHHULQJ &RPSXWDWLRQDO DQG 6WUXFWXUDO %LRWHFKQRORJ\ -RXUQDO  ± 6HSWHPEHU  >@ 0 . 7LZDUL 5 6LQJK 5 . 6LQJK ,: .LP DQG -. /HH &RPSXWDWLRQDO $SSURDFKHV IRU 5DWLRQDO 'HVLJQ RI 3URWHLQV ZLWK 1RYHO )XQFWLRQDOLWLHV &RPSXWDWLRQDO DQG 6WUXFWXUDO %LRWHFKQRORJ\ -RXUQDO   ± 6HSWHPEHU  >@ & -lFNHO 3 .DVW DQG ' +LOYHUW 3URWHLQ 'HVLJQ E\ 'LUHFWHG (YROXWLRQ $QQXDO 5HYLHZ RI %LRSK\VLFV   ± -XQH  >@ $ &XUULQ 1 6ZDLQVWRQ 3 - 'D\ DQG ' % .HOO 6\QWKHWLF %LRORJ\ IRU WKH 'LUHFWHG (YROXWLRQ RI 3URWHLQ %LRFDWDO\VWV 1DYLJDWLQJ 6HTXHQFH 6SDFH ,QWHOOLJHQWO\ &KHP 6RF 5HY   ±  >@ ) + $UQROG DQG $ $ 9RONRY 'LUHFWHG (YROXWLRQ RI %LRFDWDO\VWV &XUUHQW 2SLQLRQ LQ &KHPLFDO %LRORJ\   ± )HEUXDU\  >@ 0 6 3DFNHU DQG ' 5 /LX 0HWKRGV IRU WKH 'LUHFWHG (YROXWLRQ RI 3URWHLQV 1DWXUH 5HYLHZV *HQHWLFV   ± -XQH  >@ 5 - )R[ 6 & 'DYLV ( & 0XQGRUII / 0 1HZPDQ 9 *DYULORYLF 6 . 0D / 0 &KXQJ & &KLQJ 6 7DP 6 0XOH\ - *UDWH - *UXEHU - & :KLWPDQ 5 $ 6KHOGRQ DQG * : +XLVPDQ ,PSURYLQJ &DWDO\WLF )XQFWLRQ E\ 3UR6$5'ULYHQ (Q]\PH (YROXWLRQ 1DWXUH %LRWHFKQRORJ\   ± 0DUFK  >@ - %HQGO - 6WRXUDF ( 6HEHVWRYD 2 9DYUD 0 0XVLO - %UH]RYVN\ DQG - 'DPERUVN\ +RW6SRW :L]DUG  $XWRPDWHG 'HVLJQ RI 6LWH6SHFLILF 0XWDWLRQV DQG 6PDUW /LEUDULHV LQ 3URWHLQ (QJLQHHULQJ 1XFOHLF $FLGV 5HVHDUFK  : :±: -XO\  >@ 5 9HUPD 8 6FKZDQHEHUJ DQG ' 5RFFDWDQR &RPSXWHU$LGHG 3URWHLQ 'LUHFWHG (YROXWLRQ $ 5HYLHZ RI :HE 6HUYHUV 'DWDEDVHV DQG 2WKHU &RPSXWDWLRQDO 7RROV IRU 3URWHLQ (QJLQHHULQJ &RPSXWDWLRQDO DQG 6WUXFWXUDO %LRWHFKQRORJ\ -RXUQDO   ± 6HSWHPEHU  >@ 0 3 )UXVKLFKHYD - &DR = 7 &KX DQG $ :DUVKHO ([SORULQJ &KDOOHQJHV LQ 5DWLRQDO (Q]\PH 'HVLJQ E\ 6LPXODWLQJ WKH &DWDO\VLV LQ $UWLILFLDO .HPS (OLPLQDVH 3URFHHGLQJV RI WKH 1DWLRQDO $FDGHP\ RI 6FLHQFHV    ±  >@ 0 )X[UHLWHU &RPSXWDWLRQDO $SSURDFKHV WR 3URWHLQ '\QDPLFV )URP 4XDQWXP WR &RDUVH*UDLQHG 0HWKRGV &5& 3UHVV 'HFHPEHU  ,6%1

  *RRJOH%RRNV,' %G(T%J$$4%$- >@ 0 5RFD $ 9DUGL.LOVKWDLQ DQG $ :DUVKHO 7RZDUG $FFXUDWH 6FUHHQLQJ LQ &RPSXWHU$LGHG (Q]\PH 'HVLJQ %LRFKHPLVWU\   ± $SULO  >@ $ :DUVKHO 3 . 6KDUPD 0 .DWR < ;LDQJ + /LX DQG 0 + 0 2OVVRQ (OHFWURVWDWLF %DVLV IRU (Q]\PH &DWDO\VLV &KHPLFDO 5HYLHZV    ± $XJXVW  >@ 6 & / .DPHUOLQ 3 . 6KDUPD = 7 &KX DQG $ :DUVKHO .HWRVWHURLG ,VRPHUDVH 3URYLGHV )XUWKHU 6XSSRUW IRU WKH ,GHD 7KDW (Q]\PHV :RUN E\ (OHFWURVWDWLF 3UHRUJDQL]DWLRQ 3URFHHGLQJV RI WKH 1DWLRQDO $FDGHP\ RI 6FLHQFHV   ± 0DUFK  >@ - /XR % YDQ /RR DQG 6 & .DPHUOLQ &DWDO\WLF 3URPLVFXLW\ LQ 3VHXGRPRQDV $HUXJLQRVD $U\OVXOIDWDVH DV DQ ([DPSOH RI &KHPLVWU\'ULYHQ 3URWHLQ (YROXWLRQ )(%6 /HWWHUV   ± -XQH  >@ $ %DUUR]R ) 'XDUWH 3 %DXHU $ 7 3 &DUYDOKR DQG 6 & / .DPHUOLQ &RRSHUDWLYH (OHFWURVWDWLF ,QWHUDFWLRQV 'ULYH )XQFWLRQDO (YROXWLRQ LQ WKH $ONDOLQH 3KRVSKDWDVH 6XSHUIDPLO\ -RXUQDO RI WKH $PHULFDQ &KHPLFDO 6RFLHW\   ± -XO\  >@ 3 %DXHU c - &DUOVVRQ % $ $PUHLQ ' 'REULW]VFK 0 :LGHUVWHQ DQG 6 & / .DPHUOLQ &RQIRUPDWLRQDO 'LYHUVLW\ DQG (QDQWLRFRQYHUJHQFH LQ 3RWDWR (SR[LGH +\GURODVH  2UJ %LRPRO &KHP   ±  >@ 6 & /RYHOO - 0 :RUG - 6 5LFKDUGVRQ DQG ' & 5LFKDUGVRQ 7KH 3HQXOWLPDWH 5RWDPHU /LEUDU\ 3URWHLQV 6WUXFWXUH )XQFWLRQ DQG %LRLQIRUPDWLFV   ±  >@ $ ' 6FRXUDV DQG 9 'DJJHWW 7KH '\QDPHRPLFV 5RWDPHU /LEUDU\ $PLQR $FLG 6LGH &KDLQ &RQIRUPDWLRQV DQG '\QDPLFV IURP &RPSUHKHQVLYH 0ROHFXODU '\QDPLFV 6LPXODWLRQV LQ :DWHU 3URWHLQ 6FLHQFH   ± )HEUXDU\  >@ 5 /RQVGDOH 6 +R\OH ' 7 *UH\ / 5LGGHU DQG $ - 0XOKROODQG 'HWHUPLQDQWV RI 5HDFWLYLW\ DQG 6HOHFWLYLW\ LQ 6ROXEOH (SR[LGH +\GURODVH IURP 4XDQWXP 0HFKDQLFV0ROHFXODU 0HFKDQLFV 0RGHOLQJ %LRFKHPLVWU\    ± )HEUXDU\  >@ 1 /HQIDQW 7 +RWHOLHU ( 9HOOXHW < %RXUQH 3 0DUFKRW DQG $ &KDWRQQHW (67+(5 WKH 'DWDEDVH RI WKH DOSKDEHWD+\GURODVH )ROG 6XSHUIDPLO\ RI 3URWHLQV 7RROV WR ([SORUH 'LYHUVLW\ RI )XQFWLRQV 1XFOHLF $FLGV 5HVHDUFK ' '±' -DQXDU\ 

 Acta Universitatis Upsaliensis Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology 1484 Editor: The Dean of the Faculty of Science and Technology

A doctoral dissertation from the Faculty of Science and Technology, Uppsala University, is usually a summary of a number of papers. A few copies of the complete dissertation are kept at major Swedish research libraries, while the summary alone is distributed internationally through the series Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology. (Prior to January, 2005, the series was published under the title “Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology”.)

ACTA UNIVERSITATIS UPSALIENSIS Distribution: publications.uu.se UPPSALA urn:nbn:se:uu:diva-314686 2017