Reproducible Model Development in the Cardiac Electrophysiology Web Lab

Reproducible Model Development in the Cardiac Electrophysiology Web Lab

bioRxiv preprint doi: https://doi.org/10.1101/257683; this version posted January 31, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Reproducible model development in the Cardiac Electrophysiology Web Lab Aidan C. Dalya, Michael Clerxa, Kylie A. Beattiea, Jonathan Cooperb, David J. Gavaghana,∗, Gary R. Miramsc,∗ aComputational Biology, Department of Computer Science, University of Oxford, Oxford, UK bResearch IT Services, University College London, London, UK cCentre for Mathematical Medicine & Biology, School of Mathematical Sciences, University of Nottingham, Nottingham, UK Abstract The modelling of the electrophysiology of cardiac cells is one of the most mature areas of systems biology. This extended concentration of research effort brings with it new challenges, foremost among which is that of choosing which of these models is most suitable for addressing a particular scientific question. In a previous paper, we presented our initial work in developing an online resource for the characterisation and comparison of electrophysiological cell models in a wide range of experimental scenarios. In that work, we described how we had developed a novel protocol language that allowed us to separate the details of the mathematical model (the majority of cardiac cell models take the form of ordinary differential equations) from the experimental protocol being simulated. We developed a fully-open online repository (which we termed the Cardiac Electrophysiology Web Lab) which allows users to store and compare the results of applying the same experimental protocol to competing models. In the current paper we describe the most recent and planned extensions of this work, focused on supporting the process of model building from experimental data. We outline the necessary work to develop a machine-readable language to describe the process of inferring parameters from wet lab datasets, and illustrate our approach through a detailed example of fitting a model of the hERG channel using experimental data. We conclude by discussing the future challenges in making further progress in this domain towards our goal of facilitating a fully reproducible approach to the development of cardiac cell models. ∗Corresponding authors: Email addresses: [email protected] (David J. Gavaghan), [email protected] (Gary R. Mirams) Preprint submitted to PBMB January 31, 2018 bioRxiv preprint doi: https://doi.org/10.1101/257683; this version posted January 31, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 1 1. Introduction 2 The problem of reproducibility in science is becoming well known and of great concern to re- 3 searchers, funders and publishers. We define reproduction as \independent re-implementation of 4 the essential aspects of a carefully described experiment," which we contrast with replication, 5 defined as \re-running a simulation in exactly the same way" (Howe, 2012; Cooper et al., 2015b). 6 Computational studies are immune from many of the statistical traps (Ioannidis, 2005) that tra- 7 ditionally lead to problems in reproducing studies. Yet, when asked \what factors contribute to 8 irreproducible research?" 45% of scientists replied that \methods and code being unavailable" 9 always/often contributed, and 40% replied that \raw data availability" always/often contributed. 10 For each of these factors, these figures rose to 80% for \sometimes" contributing to irreproducible 11 research (Baker, 2016). Our field of cardiac electrophysiology modelling is not immune to these 12 factors. This paper will discuss the issue of reproducible model development, which is distinct 13 from reproducing a given study undertaken with a model: reproducible model development im- 14 plies much more than simply knowing whether a computational modelling study can be repeated 15 or reimplemented to arrive at the same results. Reproducible model development is a require- 16 ment for new science too | all of the following uses of a previously published model will be 17 performed better if we know how a model was developed: (i) using an existing model within a 18 new simulation study/conditions, (ii) fitting an existing model to new datasets for new experi- 19 mental situations (e.g. cell types, species, temperatures); and (iii) extending an existing model 20 to explore new experimental phenomena. We will elaborate on why these applications require a 21 well documented and reproducible model development process below. 22 We focus this article on development of mathematical cardiac electrophysiology models. The first 23 model of the cardiac action potential was created by Denis Noble (Noble, 1960, 1962) for Purkinje 24 fibres, based on the Hodgkin and Huxley (1952) paradigm. The field of cardiac cell modelling 25 has blossomed into one of the most popular and mature areas of systems biology research. In 26 the ensuing decades, the scope and number of these models has increased dramatically, spurred 27 on by a desire to recreate and understand electrophysiological phenomena across a range of 28 species, cell types, and experimental conditions (Noble and Rudy, 2001; Noble et al., 2012). 29 This research has been undertaken by a diverse global research community, with members of 30 disparate institutions seeking to make improvements to existing model formulations through 31 iterative studies in modelling and experimentation. Despite the availability of community-focused 32 tools, we believe there remain problems with the ways modelling studies are reported that limit 33 the usefulness, reproducibility and re-usability of our models. These problems certainly exist in 34 many other domains, and many of the ideas in this manuscript should be directly transferable to 35 other differential equation-based biological models. Cardiac modelling is a good area to focus on 36 first because of its maturity, its standardisation in terms of a modular Hodgkin & Huxley-derived 37 approach, and its importance in scientific and clinical applications. 38 There are at least three aspects to replicability and reproducibility in computational models that 39 we would like to distinguish between: 40 1. Models | in order to facilitate cooperation among this physically dispersed research com- 41 munity, a number of tools have been developed to aid in the representation and exchange of 42 models. Provision of software code that states equations and parameter values in an unam- 2 bioRxiv preprint doi: https://doi.org/10.1101/257683; this version posted January 31, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 43 biguous format is an excellent and welcome step in model reporting. While providing code 44 allows replication, as a reporting practice on its own it limits our ability to apply models 45 to new systems without substantial alterations (Drummond, 2009), and reproducibility is 46 a more useful aim. So more generically and usefully, reproducibility is provided by model 47 markup languages such as CellML/SBML (Lloyd et al., 2004; Garny et al., 2008; Hucka 48 et al., 2003) and model repositories such as the Physiome Model Repository or BioModels 49 Database (Yu et al., 2011; Chelliah et al., 2015) that provide public and curated reference 50 versions of models. These repository versions of models can be used to auto-generate code 51 in many different programming languages that can provide reproduction. Open-source 52 software libraries such as Chaste (Mirams et al., 2013; Cooper et al., 2015a), OpenCOR 53 (Garny and Hunter, 2015), Myokit (Clerx et al., 2016), and COPASI (S¨utterlinet al., 2012) 54 have been developed to perform simulations on models specified in these formats. 55 2. Protocols | it is helpful to maintain a separation between the notions of a `model', the 56 underlying mathematical equations thought to represent the system, and a `protocol', the 57 manner in which the system is interrogated/stimulated and the results of that interrogation 58 are recorded. An unambiguous protocol describes how models are used to run certain 59 simulated experiments, for instance to generate figures or lead to conclusions in scientific 60 papers. This information can be provided by sharing simulation codes; or, again, more 61 reproducibly by providing detailed machine-readable instructions on how to run simulations 62 (such as those provided by SED-ML (Waltemath et al., 2011) or our Web Lab protocol 63 descriptions; more on these later). 64 3. Model development | reporting on models at present generally takes the form of their 65 final parameterised equations. Documentation on how models were built (in terms of 66 choosing equations to use, fitting their parameters to experimental data, and validation 67 against other experimental data) is usually insufficient to recreate models accurately, or 68 even missing entirely. Only with this information can you decide whether a model is going 69 to be useful for your setting, or whether you are breaking any assumptions involved in 70 creating it. In this paper we propose that we need a reproducible definition of model 71 development. 72 The academic cardiac modelling community's current standard is that the first point above | 73 replicable models | should generally always be met, at least for action potential models: nearly 74 all studies now provide code for replication, and most also provide CellML for reproducibility. 75 These efforts are certainly to be applauded and it has taken a large amount of effort, primarily 76 from the CellML team, to get to this point. Some authors provide their simulation codes to 77 enable replication of protocols and simulated experiments (e.g.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    24 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us