<<

BLUE WATERS ANNUAL REPORT 2019

DI

ATOMISTIC SIMULATION OF A PROTOCELL Figure 1: Depicted is a primordial -membrane GA structure that compartmentalizes primitive cells, Allocation: Illinois/994 Knh often referred to as protocells, from their PI: Emad Tajkhorshid1 environment. Cells are encapsulated TN by semipermeable membranes that 1University of Illinois at Urbana–Champaign regulate the flux of material into and out of the in a concerted manner regulated by a diversity of . EXECUTIVE SUMMARY METHODS & CODES Computational has been critical in achieving a molec- The MD simulations were performed with NAMD [9], a high- ular understanding of the dynamics and of proteins, nu- ly parallelized, GPU-accelerated, publicly available MD program cleic acids, and biological membranes. Most research in this field with demonstrated scalability to hundreds of thousands of proces- has been limited to single-protein characterization owing to lim- sors for both single- and multiple-replica simulations. All- its in computational power and analytical methods. However, to MD simulations rely on the accurate integration of the equations truly understand the mechanics of biological systems, proteins of motion for all of the system. The total potential ener- need to be studied in the broader context of a cell. This project gy of the system was described by the CHARMM36m force field aims to spearhead a new generation of re- [10,11]. Periodic boundary conditions were used to avoid surface search that seeks insight from molecular dynamics (MD) simu- effects at the simulated system’s boundary, allowing the efficient lations to study cellular-scale processes in atomic detail. The im- computation of nontruncated electrostatic interactions by the mensely complex nature of a cell belies the elementary physics fast Fourier transform-based particle-mesh Ewald method [12]. that govern it, and by using MD simulations a rigorous connec- tion can be made between the underlying physical laws and the RESULTS & IMPACT emergent phenomena they generate. By pioneering new, large The research team designed, developed, and tested a protocol molecular simulations, researchers can begin to gain insight into for the construction of spherical cellular envelopes of any lip- and to quantify mesoscale biological phenomena. id/protein composition, which can be divided into three stages. The first step in the designed workflow was to use experimen- RESEARCH CHALLENGE tally derived or user-specified cell diameters to directly deter- The living cell represents a spatially complex and highly regu- mine the surface area of the inner and outer leaflet of the cell en- lated arrangement of whose coordinated motions and velope and to accurately calculate the number of needed activities underlie all the processes that allow it to grow, repro- to construct both leaflets. The membrane shell was then gener- duce, and carry out a wide spectrum of cellular functions. Recent ated using the Fibonacci sphere algorithm [13] to approximate- advances in experimental techniques have targeted the identifi- ly solve the Tammes problem [14,15] for the thousands of lipids cation of the positions of subcellular , ribosomes, and required to create a micrometer-scaled membranous protocell. such as proteins, messenger RNA, and DNA at Next, each protein was assigned a unique position on the spher- high resolution [1]. This has in turn fed a growing demand for ical surface. In order to do this, coarse-bead approximations of next-generation computational tools that allow researchers to each protein were generated using a hierarchical clustering algo- construct realistic, cellular-scale structural models at a range of rithm. These were then simulated using a force field that properly resolutions; to resolve molecular functional states; and to simu- oriented and localized them to the intended membrane surface. late their stochastic, time-dependent behavior. Lastly, the membrane and the proteins were merged, which in- Biological membranes constitute the most fundamental com- volved removal of requisite lipids from both the inner- and out- ponent of cellular and subcellular structures, defining the active er-membrane leaflets to ensure conservation of surface area and brane and proteins were merged together and solvated, and the only be achieved on a petascale computing platform such as Blue boundaries not only between the cell and its environment but avoid clashes between and protein molecules. final system was used for MD simulations. Waters. The GPU accelerators on Blue Waters confer a significant also between different compartments within the cell. Diverse in The number of lipids that each protein displaced was careful- The simulations performed on Blue Waters for this project in- boost to simulation speed (two to three times) versus the non- form, structure, and lipid composition, they are highly hetero- ly derived from conventional (smaller) protein-embedded lipid formed areas of refinement for the research team’s assembly pro- accelerated CPU-only nodes. NAMD has been extensively test- geneous environments containing lipids and proteins that act in bilayer simulations and dictated how many lipids were removed tocol, particularly for the removal of lipids during the incorpora- ed and optimized for Blue Waters, making use of the high-speed concert to regulate the flow of information and materials into and from the spherical membrane. Lipids were hierarchically elim- tion of proteins and solvation procedures. The final simulations node interconnect and showing sustained petascale performance out of the cell [2,3]. Modeling membranes at cellular scales is an inated based on their degree of impingement into the intended indicated the constructed protocell systems are stable and that the across hundreds, even thousands, of nodes. The team’s bench- emerging challenge that remains to be addressed and, therefore, protein volume and any remaining potential protein–lipid clash- assembly protocols are adequate. The project presents the most marks demonstrated an efficient scaling performance (greater is one of the critical steps in modeling the cell. The research team es were resolved via a brief grid-steered simulation of the mem- carefully crafted and stable protocell at full atomic detail (Fig. 1). than 80%) while using up to 2,048 Blue Waters GPU (XK7) nodes has a long tradition in modeling and simulating membranes and brane that forces any offending lipids from the protein space. This for a one-billion-atom system. membrane proteins, particularly emphasizing the importance procedure ensured that the exact number of lipids matching the WHY BLUE WATERS of including the natural environment in the simulations [4–8]. cross-sectional area of the protein (calculated independently for For this project, the researchers performed MD simulations with the inner and outer leaflets) were removed. Finally, the mem- NAMD of systems measuring one billion atoms in size, which can

308 309