Beg 2021 CSE Using Jupyter.Pdf (3.511Mb)

Total Page:16

File Type:pdf, Size:1020Kb

Beg 2021 CSE Using Jupyter.Pdf (3.511Mb) This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/MCSE.2021.3052101, Computing in Science & Engineering Using Jupyter for reproducible scientific workflows Marijan Beg Faculty of Engineering and Physical Sciences, University of Southampton, University Road, SO17 1BJ Southampton, United Kingdom Juliette Belin Logilab, 104 Boulevard Auguste Blanqui, 75013 Paris, France Thomas Kluyver European XFEL GmbH, Holzkoppel 4, 22869 Schenefeld, Germany Alexander Konovalov School of Computer Science, University of St Andrews, Jack Cole Building, North Haugh, KY16 9SX St Andrews, United Kingdom Min Ragan-Kelley Simula Research Laboratory, Martin Linges vei 25, 1364 Fornebu, Norway Nicolas M. Thiery´ Laboratoire de Recherche en Informatique, Universite´ Paris-Saclay, CNRS, 91405 Orsay, France Hans Fangohr Max Planck Institute for the Structure and Dynamics of Matter, Luruper Chaussee 149, 22761 Hamburg, Germany European XFEL GmbH, Holzkoppel 4, 22869 Schenefeld, Germany Faculty of Engineering and Physical Sciences, University of Southampton, University Road, SO17 1BJ Southampton, United Kingdom Abstract—Literate computing has emerged as an important tool for computational studies and open science, with growing folklore of best practices. In this work, we report two case studies – one in computational magnetism and another in computational mathematics – where a dedicated software was exposed into the Jupyter environment. This enabled interactive and batch computational exploration of data, simulations, data analysis, and workflow documentation and outcome in Jupyter notebooks. In the first study, Ubermag drives existing computational micromagnetics software through a domain-specific language embedded in Python. In the second study, a dedicated Jupyter kernel interfaces with the GAP system for computational discrete algebra and its dedicated programming language. In light of these case studies, we discuss the benefits of this approach, including progress towards more reproducible and re-usable research results and outputs, notably through the use of infrastructure such as JupyterHub and Binder. INTRODUCTION conclusions. For a publication to be scientifically Research usually results in a publication that valid, it must present the methodology rigorously, presents and shares the obtained findings and so that readers can follow the “recipe” and Published by the IEEE Computer Society © IEEE 1 1521-9615 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information. Authorized licensed use limited to: ST ANDREWS UNIVERSITY. Downloaded on February 04,2021 at 14:51:24 UTC from IEEE Xplore. Restrictions apply. This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/MCSE.2021.3052101, Computing in Science & Engineering reproduce the results. If this criterion is met, In the last decade, literate computing has the publication is considered reproducible. Re- emerged as an important tool for computational producible publications are more easily re-usable studies and open science, with an ever-growing and thus provide a significant opportunity to make set of best practices. In this paper, we review and (often tax-payer funded) research more impactful. expand some of these best practices in the context However, the reproducibility of computational of two case studies: computational magnetism work is usually hindered not only by a lack of and mathematics. This is based on the experience data or meta-data but also by a lack of details on of enabling and applying Jupyter environments the procedure and tools used: in these fields as a part of the OpenDreamKit (https://opendreamkit.org/) project. 1) The source code of the software used is not To be able to run computational studies from available. the Jupyter environment, it is necessary to either 2) Information on the computing environment, have the simulation and/or analysis code exposed such as the hardware, operating system, to a general-purpose programming language sup- supporting libraries, and (if required) code ported by Jupyter, or have a dedicated Jupyter compilation details is not revealed. kernel for the computational libraries. Although 3) The exact procedure which led to the results the main topic of this work is the overview of reported in the publication is not shared. features and capabilities of the Jupyter environ- This should include the set of parameters ment for reproducible workflows, we begin by used, the simulation and data analysis pro- discussing how a computational library can be cedure, and any additional data cleaning, exposed to Jupyter as a necessary prerequisite. processing, and visualization. Ideally, these are shared as open-source code and analy- sis scripts used to perform the simulation Prerequisite: Exposing computational libraries to the Jupyter environment and to read, analyze, and visualize the re- Computational studies often use existing com- sulting data. This way, the entire process putational (legacy) tools. These could be executa- can be repeated by re-running simulation bles called from the command line or libraries and/or analysis scripts. A human-readable that are used within a programming language. For document detailing the computational steps the approach suggested here, these computational taken, despite being “better-than-nothing”, tools need to be accessible to scientists from is still insufficient to ensure reproducibility, a general-purpose programming language sup- and keeping a detailed log of all steps taken ported by Jupyter (such as Python). For some do- during a computational study is often impos- mains, such as pure mathematics research, there sible. are domain-specific languages with enough power Reproducibility is a challenging question and to be used directly as the programming language spans a range of different topics. In this work, we in notebooks (e.g., Singular and GAP). In other focus on one of them. We describe the features areas, exposing computational tools to a general- and capabilities of the Jupyter environment that, purpose programming language is the key to inte- in our view, make it a highly productive environ- grating them into researchers’ custom code. A key ment for computational science and mathematics, benefit of making computational tools available while facilitating reproducibility. in a general-purpose programming language is The topic of bitwise reproducibility is outside that the computation can be driven flexibly using the scope of this work: even with the same the control structures provided by that language. hardware and same software, it may be diffi- For example, a simulation can conveniently be cult to reproduce computational results to be repeated with a range of parameters through a for- bitwise identical. This can originate from the non- loop, rather than having to change a configuration associativity of floating-point operations com- file for each value and trigger execution of the bined with parallel execution or from compiler simulation manually. optimizations. Bitwise reproducibility is not al- Making the computational capability acces- ways required to be achieved. sible from a general-purpose programming lan- 2 1521-9615 (c) 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information. Authorized licensed use limited to: ST ANDREWS UNIVERSITY. Downloaded on February 04,2021 at 14:51:24 UTC from IEEE Xplore. Restrictions apply. This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/MCSE.2021.3052101, Computing in Science & Engineering guage supported through a Jupyter kernel such and many more) are supported through other as Python may be trivial – for example, if the Jupyter Notebook kernels. In this work, we sug- required code is already a Python library. When gest using a Jupyter research environment from the computational functionality is locked into an which computational studies can be driven and executable, one can create an interface layer so conducted efficiently. In this section, we discuss that functionality can be accessed via a Python the benefits of using the Jupyter environment for function or class [1]: input parameters will then reproducible scientific workflows. be translated into configuration files, the exe- cutable called, outputs retrieved, and finally, the 1. One study – one document results returned. The notebook allows us to carry out an entire If the computational tool uses a programming study within a single notebook and provides a language that Jupyter does not support, another complete and executable record of the process. It possibility is to implement a Jupyter kernel for is possible to put the interpretation of the results that language so that the computational library into the same document, immediately below the can be exposed to the Jupyter environment (as graphical, tabular or text-based output that needs done for GAP and SageMath for example). to be described. The “one study – one document” Over time, scientific
Recommended publications
  • Data Quality Management in Large-Scale Cyber-Physical Systems
    Data Quality Management in Large-Scale Cyber-Physical Systems Ahmed Abdulhasan Alwan School of Architecture, Computing and Engineering University of East London A thesis presented for the degree of Doctor of Philosophy July 19, 2021 Abstract Cyber-Physical Systems (CPSs) are cross-domain, multi-model, advance informa- tion systems that play a significant role in many large-scale infrastructure sectors of smart cities public services such as traffic control, smart transportation control, and environmental and noise monitoring systems. Such systems, typically, involve a substantial number of sensor nodes and other devices that stream and exchange data in real-time and usually are deployed in uncontrolled, broad environments. Thus, unexpected measurements may occur due to several internal and external factors, including noise, communication errors, and hardware failures, which may compromise these systems quality of data and raise serious concerns related to safety, reliability, performance, and security. In all cases, these unexpected measurements need to be carefully interpreted and managed based on domain knowledge and computational models. Therefore, in this research, data quality challenges were investigated, and a com- prehensive, proof of concept, data quality management system was developed to tackle unaddressed data quality challenges in large-scale CPSs. The data quality management system was designed to address data quality challenges associated with detecting: sensor nodes measurement errors, sensor nodes hardware failures, and mismatches in sensor nodes spatial and temporal contextual attributes. De- tecting sensor nodes measurement errors associated with the primary data quality dimensions of accuracy, timeliness, completeness, and consistency in large-scale CPSs were investigated using predictive and anomaly analysis models via utilising statistical and machine-learning techniques.
    [Show full text]
  • PDF Download
    Table of Contents About ICSEC 2019 ................................................................................................................... I Message from President of Prince of Songkla University .................................................. III Message from Rector of University North, Croatia ............................................................ IV Message from General Chair ................................................................................................ VI Keynote Speakers ................................................................................................................. VII Organizing Committee .......................................................................................................... XI Conference Venue ............................................................................................................... XIII Program at a Glance ........................................................................................................... XVI Technical Sessions Main Track ............................................................................................................................. 1 Special Session on Advanced Digital Media ....................................................................... 33 Special Session on Future SDN: Security, Virtualization, Systems and Architectures ....... 35 Author Index .......................................................................................................................... 36 List of Reviewers
    [Show full text]
  • Cyphyhouse: a Programming, Simulation, and Deployment Toolchain for Heterogeneous Distributed Coordination
    2020 IEEE International Conference on Robotics and Automation (ICRA) 31 May - 31 August, 2020. Paris, France CyPhyHouse: A programming, simulation, and deployment toolchain for heterogeneous distributed coordination Ritwika Ghosh1, Joao P. Jansch-Porto2, Chiao Hsieh1, Amelia Gosse2, Minghao Jiang3, Hebron Taylor2, Peter Du3, Sayan Mitra3, Geir Dullerud2 Abstract— Programming languages, libraries, and develop- With the aim of simplifying application development for ment tools have transformed the application development distributed and heterogeneous systems, we introduce CyPhy- processes for mobile computing and machine learning. This House1—an open source software toolchain for program- paper introduces CyPhyHouse—a toolchain that aims to provide similar programming, debugging, and deployment benefits for ming, simulating, and deploying mobile robotic applications. distributed mobile robotic applications. Users can develop In this work, we target distributed coordination applica- hardware-agnostic, distributed applications using the high-level, tions such as collaborative mapping [8], surveillance, de- event driven Koord programming language, without requiring livery, formation-flight, etc. with aerial drones and ground expertise in controller design or distributed network protocols. vehicles. We believe that for these applications, low-level The modular, platform-independent middleware of CyPhyHouse implements these functionalities using standard algorithms for motion control for the individual robots is standard but path planning (RRT), control (MPC), mutual exclusion, etc. tedious, and coordination across distributed (and possibly A high-fidelity, scalable, multi-threaded simulator for Koord heterogeneous) robots is particularly difficult and error- applications is developed to simulate the same application code prone. This motivates the two key abstractions provided for dozens of heterogeneous agents. The same compiled code by CyPhyHouse: (a) portability of high-level coordination can also be deployed on heterogeneous mobile platforms.
    [Show full text]
  • Behavioral Types in Programming Languages
    Behavioral Types in Programming Languages Behavioral Types in Programming Languages iv Davide Ancona, DIBRIS, Università di Genova, Italy Viviana Bono, Dipartimento di Informatica, Università di Torino, Italy Mario Bravetti, Università di Bologna, Italy / INRIA, France Joana Campos, LaSIGE, Faculdade de Ciências, Universidade de Lisboa, Portugal Giuseppe Castagna, CNRS, IRIF, Univ Paris Diderot, Sorbonne Paris Cité, Paris, France Pierre-Malo Deniélou, Royal Holloway, University of London, UK Simon J. Gay, School of Computing Science, University of Glasgow, UK Nils Gesbert, Université Grenoble Alpes, France Elena Giachino, Università di Bologna, Italy / INRIA, France Raymond Hu, Department of Computing, Imperial College London, UK Einar Broch Johnsen, Institutt for informatikk, Universitetet i Oslo, Norway Francisco Martins, LaSIGE, Faculdade de Ciências, Universidade de Lisboa, Portugal Viviana Mascardi, DIBRIS, Università di Genova, Italy Fabrizio Montesi, University of Southern Denmark Rumyana Neykova, Department of Computing, Imperial College London, UK Nicholas Ng, Department of Computing, Imperial College London, UK Luca Padovani, Dipartimento di Informatica, Università di Torino, Italy Vasco T. Vasconcelos, LaSIGE, Faculdade de Ciências, Universidade de Lisboa, Portugal Nobuko Yoshida, Department of Computing, Imperial College London, UK Boston — Delft Foundations and Trends R in Programming Languages Published, sold and distributed by: now Publishers Inc. PO Box 1024 Hanover, MA 02339 United States Tel. +1-781-985-4510 www.nowpublishers.com [email protected] Outside North America: now Publishers Inc. PO Box 179 2600 AD Delft The Netherlands Tel. +31-6-51115274 The preferred citation for this publication is D. Ancona et al.. Behavioral Types in Programming Languages. Foundations and Trends R in Programming Languages, vol. 3, no.
    [Show full text]
  • Insight MFR By
    Manufacturers, Publishers and Suppliers by Product Category 11/6/2017 10/100 Hubs & Switches ASCEND COMMUNICATIONS CIS SECURE COMPUTING INC DIGIUM GEAR HEAD 1 TRIPPLITE ASUS Cisco Press D‐LINK SYSTEMS GEFEN 1VISION SOFTWARE ATEN TECHNOLOGY CISCO SYSTEMS DUALCOMM TECHNOLOGY, INC. GEIST 3COM ATLAS SOUND CLEAR CUBE DYCONN GEOVISION INC. 4XEM CORP. ATLONA CLEARSOUNDS DYNEX PRODUCTS GIGAFAST 8E6 TECHNOLOGIES ATTO TECHNOLOGY CNET TECHNOLOGY EATON GIGAMON SYSTEMS LLC AAXEON TECHNOLOGIES LLC. AUDIOCODES, INC. CODE GREEN NETWORKS E‐CORPORATEGIFTS.COM, INC. GLOBAL MARKETING ACCELL AUDIOVOX CODI INC EDGECORE GOLDENRAM ACCELLION AVAYA COMMAND COMMUNICATIONS EDITSHARE LLC GREAT BAY SOFTWARE INC. ACER AMERICA AVENVIEW CORP COMMUNICATION DEVICES INC. EMC GRIFFIN TECHNOLOGY ACTI CORPORATION AVOCENT COMNET ENDACE USA H3C Technology ADAPTEC AVOCENT‐EMERSON COMPELLENT ENGENIUS HALL RESEARCH ADC KENTROX AVTECH CORPORATION COMPREHENSIVE CABLE ENTERASYS NETWORKS HAVIS SHIELD ADC TELECOMMUNICATIONS AXIOM MEMORY COMPU‐CALL, INC EPIPHAN SYSTEMS HAWKING TECHNOLOGY ADDERTECHNOLOGY AXIS COMMUNICATIONS COMPUTER LAB EQUINOX SYSTEMS HERITAGE TRAVELWARE ADD‐ON COMPUTER PERIPHERALS AZIO CORPORATION COMPUTERLINKS ETHERNET DIRECT HEWLETT PACKARD ENTERPRISE ADDON STORE B & B ELECTRONICS COMTROL ETHERWAN HIKVISION DIGITAL TECHNOLOGY CO. LT ADESSO BELDEN CONNECTGEAR EVANS CONSOLES HITACHI ADTRAN BELKIN COMPONENTS CONNECTPRO EVGA.COM HITACHI DATA SYSTEMS ADVANTECH AUTOMATION CORP. BIDUL & CO CONSTANT TECHNOLOGIES INC Exablaze HOO TOO INC AEROHIVE NETWORKS BLACK BOX COOL GEAR EXACQ TECHNOLOGIES INC HP AJA VIDEO SYSTEMS BLACKMAGIC DESIGN USA CP TECHNOLOGIES EXFO INC HP INC ALCATEL BLADE NETWORK TECHNOLOGIES CPS EXTREME NETWORKS HUAWEI ALCATEL LUCENT BLONDER TONGUE LABORATORIES CREATIVE LABS EXTRON HUAWEI SYMANTEC TECHNOLOGIES ALLIED TELESIS BLUE COAT SYSTEMS CRESTRON ELECTRONICS F5 NETWORKS IBM ALLOY COMPUTER PRODUCTS LLC BOSCH SECURITY CTC UNION TECHNOLOGIES CO FELLOWES ICOMTECH INC ALTINEX, INC.
    [Show full text]
  • Products Newsletter Emergiing Trends, Cuttiing-Edge Technollogiies, and Research Breakthroughs Iin Power Ellectroniics
    Products Newsletter Emergiing Trends, Cuttiing-Edge Technollogiies, and Research Breakthroughs iin Power Ellectroniics ​ ​ ​ November 20, 2020 | Issue 7 IEEE Power Electronics Magazine To empower next-generation engineers with hands-on skills in control and power electronics, Qing- Chang Zhong, Yeqin Wang, Yiting Dong, Beibei Ren, and Mohammad Amin have developed a versatile experimental tool that lowers the barriers to go real from simulations to experiments for various power electronic systems. Their Smart Grid Research and Educational Kit, which is a reconfigurable, open-source, multifunctional power electronic converter with the capability of directly downloading codes from MATLAB/Simulink. Besides minimizing the time, cost, and efforts needed to develop hardware systems, it removes the burden of coding. Read more in the September 2020 issue of IEEE Power Electronics Magazine! IEEE Transactions on Power Electronics (TPEL) The December 2020 Issue presents 86 papers with the latest research in power electronics! December Highlighted Papers: On Beat Frequency Oscillation of Two-Stage Wireless Power Receivers Kerui Li, Siew-Chong Tan, and Ron Shu Yuen Hui Novel iGSE-C Loss Modeling of X7R Ceramic Capacitors David Menzi, Dominik Bortis, Grayson Zulauf, Morris Heller, and Johann W. Kolar December Papers with Active/Multimedia Content Analysis and Design of the LLC LED Driver Based on State-Space Representation Direct Time- Domain Solution by Maikel Menke, João P. Duranti, Leandro Roggia, Fábio E. Bisogno, Rodrigo V. Tambara, Álysson R. Seidel provides a PowerPoint Presentation elucidating each step development of the proposed design procedure during the design of the LLC LED driver. In addition, the authors offer three Wolfram Mathematica notebook script files.
    [Show full text]
  • Homomorphic Encryption: Working and Analytical Assessment DGHV, Helib, Paillier, FHEW and HE in Cloud Security
    Master of Science in Computer Science February 2017 Homomorphic Encryption: Working and Analytical Assessment DGHV, HElib, Paillier, FHEW and HE in cloud security Srinivas Divya Papisetty Faculty of Computing Blekinge Institute of Technology SE-371 79 Karlskrona Sweden This thesis is submitted to the Faculty of Computing at Blekinge Institute of Technology in partial fulfillment of the requirements for the degree of Master of Science in Computer Science. The thesis is equivalent to 20 weeks of full time studies. Contact Information: Author(s): Srinivas Divya Papisetty E-mail: [email protected] University advisor: Dr. Emiliano Casalicchio Dept. of Computer Science & Engineering Faculty of Computing Internet : www.bth.se Blekinge Institute of Technology Phone : +46 455 38 50 00 SE-371 79 Karlskrona, Sweden Fax : +46 455 38 50 57 i i ABSTRACT Context. Secrecy has kept researchers spanning over centuries engaged in the creation of data protection techniques. With the growing rate of data breach and intervention of adversaries in confidential data storage and communication, efficient data protection has found to be a challenge. Homomorphic encryption is one such data protection technique in the cryptographic domain which can perform arbitrary computations on the enciphered data without disclosing the original plaintext or message. The first working fully homomorphic encryption scheme was proposed in the year 2009 and since then there has been a tremendous increase in the development of homomorphic encryption schemes such that they can be applied to a wide range of data services that demand security. All homomorphic encryption schemes can be categorized as partially homomorphic (PHE), somewhat homomorphic (SHE), leveled homomorphic (LHE), and fully homomorphic encryption (FHE).
    [Show full text]
  • How Machine Learning Has Been Applied in Software Engineering?
    How Machine Learning Has Been Applied in Software Engineering? Olimar Teixeira Borges a, Julia Colleoni Couto b, Duncan Dubugras A. Ruiz c and Rafael Prikladnicki1 d School of Technology, PUCRS, Porto Alegre, Brazil Keywords: Software Engineering, Machine Learning, Mapping Study. Abstract: Machine Learning (ML) environments are composed of a set of techniques and tools, which can help in solving problems in a diversity of areas, including Software Engineering (SE). However, due to a large number of possible configurations, it is a challenge to select the ML environment to be used for a specific SE domain issue. Helping software engineers choose the most suitable ML environment according to their needs would be very helpful. For instance, it is possible to automate software tests using ML models, where the model learns software behavior and predicts possible problems in the code. In this paper, we present a mapping study that categorizes the ML techniques and tools reported as useful to solve SE domain issues. We found that the most used algorithm is Na¨ıve Bayes and that WEKA is the tool most SE researchers use to perform ML experiments related to SE. We also identified that most papers use ML to solve problems related to SE quality. We propose a categorization of the ML techniques and tools that are applied in SE problem solving, linking with the Software Engineering Body of Knowledge (SWEBOK) knowledge areas. 1 INTRODUCTION automated to reduce human effort and project cost. However, to automate these tasks using ML, we need Machine Learning (ML) is an Artificial Intelligence to have the SE project’s data to explore.
    [Show full text]
  • Failure of the Point Blinding Countermeasure Against Fault Attack in Pairing-Based Cryptography Nadia El Mrabet, Emmanuel Fouotsa
    Failure of the Point Blinding Countermeasure Against Fault Attack in Pairing-Based Cryptography Nadia El Mrabet, Emmanuel Fouotsa To cite this version: Nadia El Mrabet, Emmanuel Fouotsa. Failure of the Point Blinding Countermeasure Against Fault Attack in Pairing-Based Cryptography. 2015, 10.1007/978-3-319-18681-8_21. hal-01197148 HAL Id: hal-01197148 https://hal.archives-ouvertes.fr/hal-01197148 Submitted on 11 Sep 2015 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Failure of the Point Blinding Countermeasure against Fault Attack in Pairing-Based Cryptography ? Nadia EL MRABET1 and Emmanuel FOUOTSA2 (1) LIASD − Universit´eParis 8 -France - [email protected] (1) SAS - CMP Gardanne - France - [email protected] (2) Dep of Mathematics, Higher Teacher's Training College, University of Bamenda - Cameroun (2) LMNO − Universit´ede Caen - France - [email protected] Abstract. Pairings are mathematical tools that have been proven to be very useful in the con- struction of many cryptographic protocols. Some of these protocols are suitable for implementation on power constrained devices such as smart cards or smartphone which are subject to side channel attacks. In this paper, we analyse the efficiency of the point blinding countermeasure in pairing based cryptography against side channel attacks.
    [Show full text]
  • Comparison of Three Common Statistical Programs Available to Washington State County Assessors: SAS, SPSS and NCSS
    Washington State Department of Revenue Comparison of Three Common Statistical Programs Available to Washington State County Assessors: SAS, SPSS and NCSS February 2008 Abstract: This summary compares three common statistical software packages available to county assessors in Washington State. This includes SAS, SPSS and NCSS. The majority of the summary is formatted in tables which allow the reader to more easily make comparisons. Information was collected from various sources and in some cases includes opinions on software performance, features and their strengths and weaknesses. This summary was written for Department of Revenue employees and county assessors to compare statistical software packages used by county assessors and as a result should not be used as a general comparison of the software packages. Information not included in this summary includes the support infrastructure, some detailed features (that are not described in this summary) and types of users. Contents General software information.............................................................................................. 3 Statistics and Procedures Components................................................................................ 8 Cost Estimates ................................................................................................................... 10 Other Statistics Software ................................................................................................... 13 General software information Information in this section was
    [Show full text]
  • STATA January 1993 TECHNICAL STB-11 BULLETIN a Publication to Promote Communication Among Stata Users
    STATA January 1993 TECHNICAL STB-11 BULLETIN A publication to promote communication among Stata users Editor Associate Editors Joseph Hilbe J. Theodore Anagnoson, Cal. State Univ., LA Department of Sociology Richard DeLeon, San Francisco State Univ. Arizona State University Paul Geiger, USC School of Medicine Tempe, Arizona 85287 Lawrence C. Hamilton, Univ. of New Hampshire 602-860-1446 FAX Stewart West, Baylor College of Medicine [email protected] EMAIL Subscriptions are available from Stata Corporation, email [email protected], telephone 979-696-4600 or 800-STATAPC, fax 979-696-4601. Current subscription prices are posted at www.stata.com/bookstore/stb.html. Previous Issues are available individually from StataCorp. See www.stata.com/bookstore/stbj.html for details. Submissions to the STB, including submissions to the supporting files (programs, datasets, and help files), are on a nonexclusive, free-use basis. In particular, the author grants to StataCorp the nonexclusive right to copyright and distribute the material in accordance with the Copyright Statement below. The author also grants to StataCorp the right to freely use the ideas, including communication of the ideas to other parties, even if the material is never published in the STB. Submissions should be addressed to the Editor. Submission guidelines can be obtained from either the editor or StataCorp. Copyright Statement. The Stata Technical Bulletin (STB) and the contents of the supporting files (programs, datasets, and help files) are copyright c by StataCorp. The contents of the supporting files (programs, datasets, and help files), may be copied or reproduced by any means whatsoever, in whole or in part, as long as any copy or reproduction includes attribution to both (1) the author and (2) the STB.
    [Show full text]
  • Reproducible Research in Computational Harmonic Analysis
    R EPRODUCIBLE RESEARCH Reproducible Research in Computational Harmonic Analysis Scientific computation is emerging as absolutely central to the scientific method, but the prevalence of very relaxed practices is leading to a credibility crisis. Reproducible computational research, in which all details of computations—code and data—are made conveniently available to others, is a necessary response to this crisis. The authors review their approach to reproducible research and describe how it has evolved over time, discussing the arguments for and against working reproducibly. assive computation is transform- academic scientist likely has more in common ing science, as researchers from with a large corporation’s information technol- numerous fields launch ambitious ogy manager than with a philosophy or English projects involving large-scale com- professor at the same university. Mputations. Emblems of our age include A rapid transition is now under way—visible particularly over the past two decades—that will • data mining for subtle patterns in vast data- finish with computation as absolutely central to bases; and scientific enterprise. However, the transition is • massive simulations of a physical system’s com- very far from completion. In fact, we believe that plete evolution repeated numerous times, as the dominant mode of scientific computing has al- simulation parameters vary systematically. ready brought us to a state of crisis. The prevalence of very relaxed attitudes about communicating ex- The traditional image of the scientist as a soli- perimental details and validating results is causing tary person working in a laboratory with beakers a large and growing credibility gap. It’s impossible and test tubes is long obsolete.
    [Show full text]