bioRxiv preprint doi: https://doi.org/10.1101/2020.07.23.218917; this version posted July 24, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Protein sequence design by explicit energy landscape optimization 1,2,# 1,2,# 1,2,5 6 1,2 1,2 Christoffer Norn , Basile I. M. Wicky , David Juergens , Sirui Liu , David Kim , Brian Koepnick , Ivan 1,2 7 1,2,4,* 3,6,* Anishchenko , Foldit Players , David Baker , Sergey Ovchinnikov 1 Department of Biochemistry, University of Washington, Seattle, WA 98105 2 Institute for Protein Design, University of Washington, Seattle, WA 98105 3 John Harvard Distinguished Science Fellowship Program, Harvard University, Cambridge, MA 02138 4 Howard Hughes Medical Institute, University of Washington, Seattle, WA 98105 5 Molecular Engineering and Sciences Institute, University of Washington, Seattle, WA 98105 6 FAS Division of Science, Harvard University, Cambridge, MA 02138 7 A list of participants will appear in the supplementary information #These authors contributed equally. *Co-corresponding authors:
[email protected],
[email protected] Abstract The protein design problem is to identify an amino acid sequence which folds to a desired structure. Given Anfinsen’s thermodynamic hypothesis of folding, this can be recast as finding an amino acid sequence for which the lowest