Interactive Visualization of Molecular Dynamics Simulation Data
Total Page:16
File Type:pdf, Size:1020Kb
Copyright: The author, Naif Ghazi Alharbi, 2020. Interactive Visualization of Molecular Dynamics Simulation Data Naif Ghazi Alharbi Submitted to Swansea University in fulfilment of the requirements for the Degree of Doctor of Philosophy Department of Computer Science Swansea University April 26, 2020 Declaration This work has not been previously accepted in substance for any degree and is not being con- currently submitted in candidature for any degree. Signed ................. ..................... (candidate) Date ............................................................ Statement 1 This thesis is the result of my own investigations, except where otherwise stated. Other sources are acknowledged by footnotes giving explicit references. A bibliography is appended. Signed ........................ ................ (candidate) Date ............................................................ Statement 2 I hereby give my consent for my thesis, if accepted, to be available for photocopying and for inter-library loan, and for the title and summary to be made available to outside organisations. Signed .................. .................... (candidate) Date ............................................................ Abstract Molecular Dynamics Simulations (MD) plays an essential role in the field of computational biology. The simulations produce extensive high-dimensional, spatio-temporal data describ- ing the motion of atoms and molecules. A central challenge in the field is the extraction and visualization of useful behavioral patterns from these simulations. Throughout this thesis, I collaborated with a computational biologist who works on Molecular Dynamics (MD) Simu- lation data. For the sake of exploration, I was provided with a large and complex membrane simulation. I contributed solutions to his data challenges by developing a set of novel visual- ization tools to help him get a better understanding of his simulation data. I employed both scientific and information visualization, and applied concepts of abstraction and dimensions projection in the proposed solutions. The first solution enables the user to interactively fil- ter and highlight dynamic and complex trajectory constituted by motions of molecules. The molecular dynamic trajectories are identified based on path length, edge length, curvature, and normalized curvature, and their combinations. The tool exploits new interactive visual- ization techniques and provides a combination of 2D-3D path rendering in a dual dimension representation to highlight differences arising from the 2D projection on a plane. The sec- ond solution introduces a novel abstract interaction space for Protein-Lipid interaction. The proposed solution addresses the challenge of visualizing complex, time-dependent interactions between protein and lipid molecules. It also proposes a fast GPU-based implementation that maps lipid-constituents involved in the interaction onto the abstract protein interaction space. I also introduced two abstract level-of-detail (LoD) representations with six levels of detail for lipid molecules and protein interaction. Finally, I proposed a novel framework consisting of four linked views: A time-dependent 3D view, a novel hybrid view, a clustering timeline, and a details-on-demand window. The framework exploits abstraction and projection to enable the user to study the molecular interaction and the behavior of the protein-protein interaction iii and clusters. I introduced a selection of visual designs to convey the behavior of protein-lipid interaction and protein-protein interaction through a unified coordinate system. Abstraction is used to present proteins in hybrid 2D space, and a projected tiled space is used to present both Protein-Lipid Interaction (PLI) and Protein-Protein Interaction (PPI) at the particle level in a heat-map style visual design. Glyphs are used to represent PPI at the molecular level. I coupled visually separable visual designs in a unified coordinate space. The result lets the user study both PLI and PPI separately, or together in a unified visual analysis framework. Acknowledgements Undertaking this Ph.D. has been a truly life-changing experience for me, and it would not have been possible without the support and guidance that I received from many people. Firstly, I would like to thank The Almighty God for His help and guidance, and for choosing me to be supervised by Dr. Robert S. Laramee. I believe that being supervised by Dr. Laramee is one of the bounties of Almighty God. I would like to give a very big thank-you to my supervisor Dr. Laramee for all the support and encouragement he gave me during my Ph.D. He has been so generous with his time and efforts—supporting above and beyond what is typical of a Ph.D. supervisor. Without his guidance and constant feedback, this Ph.D. would not have been achievable. I would also like to thank my second supervisor Dr. Adeline Paiement, who was always ready to answer my questions and provide me with advice regarding my research topic. Many thanks also to Dr. Matthieu Chavent for his support and guidance throughout our projects. I would also like to thank Mohammed Alharbi, Xavier Martinez, Alexander Rose, and Marc Baaden for their help and valuable feedback. A special thanks to Michael Krone for his continuous feedback. I would like to thank the Ministry of Education of Saudi Arabia and the Saudi Cultural Bureau in London for their financial support. The community of Ph.D. students in our research group has been such a valuable support network. In particular, I would also like to thank Dr. Sean Walton, Dr. Richard Roberts, Liam McNabb, Dylan Rees, Dave Greten, and Rhodri Fabbro who have proofread my work throughout the project. An additional thanks to Dr. Joss Whittle for his advice and guidance on GPU techniques. Lastly, I would like to express my deepest gratitude to my family for their endless support throughout my life. I am incredibly thankful for my mother, wife, and children for supporting me spiritually throughout writing this thesis and being supportive in all that I do. v List of Publications This thesis is based on the following publications: 1. Naif Alharbi, Robert S Laramee, and Matthieu Chavant, MolePathFinder: Interactive Multi-Dimensional Path Filtering of Molecular Dynamics Simulation Data, The Com- puter Graphics and Visual Computing (CGVC) Conference 2016, pages 9-16, 15-16 September 2016, Bournemouth University, UK 2. Naif Alharbi, Mohammed Alharbi, Xavier Martinez, Michael Krone, Alexander Rose, Marc Baaden, Robert S. Laramee, Matthieu Chavent, Molecular Visualization of Com- putational Biology Data: A Survey of Surveys in EuroVis Short Papers 2017 , pages 133-137 3. Naif Alharbi, Micheal Krone, Matthieu Chavant, Robert S Laramee, VAPLI: Novel Vi- sual Abstraction for Protein-Lipid Interactions, IEEE SciVis Short Papers, IEEE VIS 2018, 21-26 October 2018, Berlin, Germany 4. Naif Alharbi, Micheal Krone, Matthieu Chavant, Robert S Laramee, LoD LPI: Level of Detail for Visualizing Time-Dependent, Protein-Lipid Interaction, International Con- ference on Information Visualization and Applications (IVAPP) 2019, 25-27 February 2019, Prague, Czech Republic 5. Naif Alharbi, Micheal Krone, Matthieu Chavant, Robert S Laramee, Hybrid Visualiza- tion of Protein-Lipid and Protein-Protein Interaction, the Eurographics Workshop on Visual Computing in Biology and Medicine (VCBM) 2019, 4-6 September 2019, Brno, Czech Republic Supplementary Material Each chapter includes a supplementary material. I highly recommend that the reader views these videos before reading their corresponding chapter—they are designed to provide an overview of the software features and demonstrate their utility. These videos and the source code are available online here: Chapter Material type URL Chapter 2 Video https://github.com/NaifAlhar6i/Chapter2 Chapter 3 Video https://github.com/NaifAlhar6i/Chapter3 Chapter 4 Video https://github.com/NaifAlhar6i/Chapter4 Chapter 5 Video https://github.com/NaifAlhar6i/Chapter5 Chapter 6 Video https://github.com/NaifAlhar6i/Chapter6 Chapter 7 Source Code https://github.com/NaifAlhar6i/Chapter7 Table of Contents Table of Contents 1 List of Tables 4 List of Figures 5 1 Introduction and Motivation 15 1.1 Molecular Dynamics Simulation ......................... 18 1.2 Interactive Visualization ............................. 19 1.3 Thesis Pipeline .................................. 20 1.4 Thesis Overview and Contributions ........................ 21 2 Literature Review 28 2.1 Introduction and Motivation ........................... 29 2.2 Survey of Molecular Dynamics Visualization Surveys . 30 2.2.1 From Text to Information: Meta-analysis of the Reviews . 33 2.3 Survey of Molecular Dynamics Visualization Articles . 36 2.3.1 Challenges ................................ 37 2.3.2 Survey Scope ............................... 38 2.3.3 Literature Search Methodology ..................... 38 2.3.4 Molecular Dynamic Visualization .................... 39 2.3.5 Visualization of Protein Interaction and Clusters . 58 2.4 Conclusion .................................... 63 3 Interactive Multi-Dimensional Path Filtering of Molecular Dynamics Simula- tion Data 64 1 Table of Contents 3.1 Introduction and Motivation ........................... 65 3.2 Background .................................... 66 3.3 Visualization of MD Data with MolPathFinder . 67 3.3.1 Visualization Techniques and Data Representation . 69 3.4 Experimental Results ............................... 77 3.5 Conclusion ...................................