Research Cloud Program Accelerating Science and Innovation

Total Page:16

File Type:pdf, Size:1020Kb

Research Cloud Program Accelerating Science and Innovation AWS Research Cloud Program Accelerating Science and Innovation Researcher’s Handbook (Version 2.0, Released 2017-10-01) Edited by Kevin Jorissen and Brendan Bouffler with contributions by the Research & Technical Computing Group Amazon Web Services © 2017, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document is provided for informational purposes only. It represents AWS’s current product offerings and practices as of the date of issue of this document, which are subject to change without notice. Users are responsible for making their own independent assessment of the information in this document and any use of AWS’s products or services, each of which is provided “as is” without warranty of any kind, whether express or implied. This document does not create any warranties, representations, contractual commitments, conditions or assurances from AWS, its affiliates, suppliers or licensors. The responsibilities and liabilities of AWS to its customers are controlled by AWS agreements. This handbook is not part of, and does not modify any agreement between AWS and its customers. This handbook may include a set of suggested solutions but should not be construed as a legally-binding offer from AWS. For current prices of AWS services, please refer to the AWS website at www.aws.amazon.com. AWS Research Cloud Program Accelerating Science and Innovation FOREWORD When I was asked to write the foreword for a book that explains how easy it is to use cloud computing for research, I jumped at the opportunity. Technology should never be an obstacle to researchers and their ability to gain new insights into our world. For the past 20 years, I’ve run several research departments at the Fraunhofer Institute for Industrial Mathematics (ITWM) as well as leading our IT department for more than 10 of those years. I always felt it obvious to look for solutions that make the life of a researcher developing and using simulation software easier - and to transform IT to support this. In 2001, I wrote a proposal called “I-LAB” (Internet-Lab), taking inspiration from the MOSAIC project that led to the first web browser. “The browser was the basis for the initial success of the internet, and with I-LAB, the internet should become a ´problem solving environment´, first for researchers, then for large companies and finally for everybody”, I wrote at the time. However, we understood then that the protocols and concepts of Grid Computing had been too complex to become a real success. At Fraunhofer, we started to use web services intensively to automate fault-tolerant execution, provisioning and large-scale parallelization. We started to develop what is today known as BeeGFS - a parallel file system – and we developed industry- grade applications based on asynchronous communication protocols. After Amazon Web Services launched Amazon S3 and Amazon EC2, we were ready to go, and within a few weeks, I ran several large parallel finance applications in the cloud directly from my laptop. I was even able to do this in real time – in fact, live during lectures I was giving on the subject! Since then AWS has become the most flexible and versatile web services company and has set the standards for cloud computing. This book is an easy to read, step-by-step guide to AWS and its services. It describes the application ecosystem that is helping researchers around the globe to provision software components and set up their research IT infrastructure with just a few mouse clicks. Today I head up the HPC Department at Fraunhofer ITWM where we develop applications that rely on highly specialized HPC platforms. We now use the cloud to run many of these applications at scale and today cloud and HPC coexist happily because for many problems, the limits of a supercomputer are where we begin our work. In most universities and research institutes, it is considered a waste of valuable resources to run early stage developments, simple farming jobs or weakly coupled applications on highly specialized systems when there is a readily available cloud nearby that can scale up to whatever extent your science requires. I’m still fascinated by the ongoing revolution that’s come with the availability of cloud infrastructures and what this means for research and startups. For those like me that are interested in this broader view, I still recommend picking up Nicolas Carr´s book from 2008, “The Big Switch”, where he describes the changes the use of cloud infrastructures will bring. Along with machine intelligence these topics will have a large impact on science and society. I recommend this current document to get answers on why all this is important to you as a researcher, as well as practical knowledge on how to get started. This is a book written not for the IT guy, but for the researcher who’s seeking a better way to get their work done. Dr. Franz-Josef Pfreundt Division Director, Fraunhofer Institute for Industrial Mathematics Kaiserslautern, 8 December 2016 Researcher’s Handbook – v2.0 – 2017-10-01 Page i AWS Research Cloud Program Accelerating Science and Innovation Contents 1 INTRODUCTION TO THE AWS RESEARCHER’S HANDBOOK ...................................... 1 1.1 WHAT IS CLOUD COMPUTING? ...................................................................................... 2 1.2 WHAT IS THE AWS CLOUD? .......................................................................................... 2 1.3 WHY AWS FOR SCIENCE AND RESEARCH? .................................................................... 6 1.4 AWS COMPUTE AND STORAGE ....................................................................................11 1.5 AWS MARKETPLACE ...................................................................................................14 1.6 SCIENCE AS A SERVICE................................................................................................15 1.7 SECURITY ...................................................................................................................16 1.8 DATA PRIVACY ............................................................................................................17 1.9 THE HANDBOOK AND YOU ............................................................................................17 2 SETTING UP YOUR AWS ACCOUNT ..............................................................................19 2.1 CREATING AN AWS ACCOUNT AND REGISTERING FOR RCP BENEFITS ...........................19 2.2 CREATING A NEW AWS ACCOUNT THROUGH A PARTNER PORTAL (INTERNET2, ARCUS, COMPAREX, SPARKLE) ......................................................................................................20 2.3 GETTING A NEW AWS ACCOUNT DIRECTLY WITH AWS ..................................................25 2.4 THE ROOT ACCOUNT ...................................................................................................27 2.5 CREATING IAM LOGINS ................................................................................................28 2.6 AWS ORGANIZATIONS .................................................................................................36 2.7 SETTING UP SSH ACCESS KEYS ..................................................................................36 3 BUDGETING .....................................................................................................................41 3.1 SETTING BUDGETS IN YOUR AWS ACCOUNT .................................................................41 3.2 OPTIONAL: THE BUDGET SAFETY SWITCH .....................................................................46 3.3 AMAZON EC2 LAUNCH LIMITS ......................................................................................56 3.4 NEXT STEPS ...............................................................................................................57 4 WORKING WITH DATA ....................................................................................................58 4.1 RANGE OF STORAGE TYPES .........................................................................................58 4.2 STORING YOUR RESEARCH DATA .................................................................................59 4.3 SHARING THE DATA IN YOUR S3 BUCKET. ......................................................................64 4.4 GLOBAL DATA EGRESS WAIVER FOR RESEARCH ...........................................................65 4.5 VERY LARGE DATA TRANSFERS TO S3...........................................................................65 4.6 DATA STREAM INGESTION ............................................................................................66 4.7 OPEN DATA MEANS MORE SCIENTIFIC IMPACT ...............................................................66 5 WORKING WITH SENSITIVE AND CONTROLLED-ACCESS DATA ..............................71 5.1 THE SHARED RESPONSIBILITY SECURITY MODEL ...........................................................71 5.2 MEETING SECURITY AND COMPLIANCE REQUIREMENTS WITH AWS QUICK STARTS ..........73 6 WORKING WITH COMPUTE ............................................................................................75 6.1 THE AMAZON ELASTIC COMPUTE CLOUD (AMAZON EC2) ..............................................75 6.2 AMAZON EC2 COMPUTE INSTANCE TYPES ....................................................................76 6.3 HOW ARE EC2 COMPUTE INSTANCES PRICED? ..............................................................77 6.4 OTHER COMPUTE SERVICES .........................................................................................82
Recommended publications
  • GAO-17-14 Accessible Version, OPEN INNOVATION: Practices to Engage
    United States Government Accountability Office Report to Congr essional Committees October 2016 OPEN INNOVATION Practices to Engage Citizens and Effectively Implement Federal Initiatives Accessible Version GAO-17-14 October 2016 OPEN INNOVATION Practices to Engage Citizens and Effectively Implement Federal Initiatives Highlights of GAO-17-14, a report to congressional committees Why GAO Did This Study What GAO Found To address the complex and Open innovation involves using various tools and approaches to harness the crosscutting challenges facing the ideas, expertise, and resources of those outside an organization to address an federal government, agencies need to issue or achieve specific goals. GAO found that federal agencies have frequently effectively engage and collaborate with used five open innovation strategies to collaborate with citizens and external those in the private, nonprofit, and stakeholders, and encourage their participation in agency initiatives. academic sectors, other levels of government, and citizens. Agencies Descriptions of Open Innovation Strategies Used by Federal Agencies are increasingly using open innovation strategies for these purposes. The GPRA Modernization Act of 2010 (GPRAMA) requires federal agencies to identify strategies and resources they will use to achieve their goals. GPRAMA also requires GAO to periodically review how implementation of its requirements is affecting agency performance. This report identifies and illustrates practices that help agencies effectively implement open innovation strategies, and how the use of those strategies has affected agency performance and opportunities for citizen engagement. GAO identified seven practices that agencies can use to effectively implement To identify these practices, GAO initiatives that involve the use of these strategies: analyzed relevant federal guidance and academic literature, and · Select the strategy appropriate for the purpose of engaging the public and interviewed open innovation experts.
    [Show full text]
  • An Open Ecosystem Engagement Strategy Through the Lens of Global Food Safety [Version 1; Peer Review: 2 Approved]
    F1000Research 2015, 4:129 Last updated: 27 SEP 2021 OPINION ARTICLE An open ecosystem engagement strategy through the lens of global food safety [version 1; peer review: 2 approved] Paul Stacey1, Garin Fons2, Theresa M Bernardo3 1Creative Commons, Mountain View, CA, 94042, USA 2Center for Open Educational Resources and Language Learning (COERLL), University of Texas at Austin, Austin, TX, 78712, USA 3College of Veterinary Medicine, Michigan State University, East Lansing, MI, 48824, USA v1 First published: 27 May 2015, 4:129 Open Peer Review https://doi.org/10.12688/f1000research.6123.1 Latest published: 27 May 2015, 4:129 https://doi.org/10.12688/f1000research.6123.1 Reviewer Status Invited Reviewers Abstract The Global Food Safety Partnership (GFSP) is a public/private 1 2 partnership established through the World Bank to improve food safety systems through a globally coordinated and locally-driven version 1 approach. This concept paper aims to establish a framework to help 27 May 2015 report report GFSP fully leverage the potential of open models. 1. Ted Hanss Jr, University of Michigan, Ann In preparing this paper the authors spoke to many different GFSP stakeholders who asked questions about open models such as: Arbor, USA • what is it? 2. Nikos Manouselis, Agro-Know, Athens, • what’s in it for me? Greece • why use an open rather than a proprietary model? Any reports and responses or comments on the • how will open models generate equivalent or greater article can be found at the end of the article. sustainable revenue streams compared to the current “traditional” approaches? This last question came up many times with assertions that traditional service providers need to see opportunity for equivalent or greater revenue dollars before they will buy-in.
    [Show full text]
  • OPEN INNOVATION Strategies the Federal Government Can Use to Gather Ideas and Expertise from the Public an OVERVIEW of GAO-17-14
    OPEN INNOVATION Strategies the federal government can use to gather ideas and expertise from the public AN OVERVIEW OF GAO-17-14 What is open innovation? How can they be the most effective? Open innovation uses activities and technologies to engage We have identified the following practices that facilitate the effective with citizens, organizations, and experts, and to harness the implementation of open innovation initiatives, and can serve as a ideas, expertise, and resources they offer. framework for evaluating projects. Federal agencies need to engage and collaborate DESIGN with all sectors of society. Online technologies What have enhanced their ability is the Knowing the primary purpose or purposes of an initiative can help agencies to make these connections. primary choose the most appropriate strategy purpose? or combination of strategies. What can it achieve? Agencies can use open innovation strategies to achieve Do we have what Consider agency’s capability to implement a strategy, including leadership one or more of the following high-level purposes: we need to support, and the availability of: make it Collect information and Enhance agency capacity. work? legal authority resources capacity perspectives. Build or expand community. Develop and test new ideas, Increase public awareness. solutions, or products. What will Set goals and This can help agencies ensure that their success determine how open innovation initiatives are What strategies are agencies using? look like? success will be clearly directed at meeting measured. agency needs and goals. Agencies have used one or more of these strategies: Crowdsourcing and Citizen Science Identifying and engaging external stake- It can also help provide additional: Who can holders and partner organizations can Agencies submit an open call for voluntary assistance from a large group of individ- help? staff advice uals to complete defined tasks, or assist with specific science-related tasks, which increase an agency’s can include collecting and analyzing data and interpreting and reporting results.
    [Show full text]
  • Exploring Precision FDA, an Online Platform for Crowdsourcing Genomics Jordan Paradise Loyola University Chicago, School of Law, [email protected]
    Loyola University Chicago, School of Law LAW eCommons Faculty Publications & Other Works 2018 Exploring Precision FDA, an Online Platform for Crowdsourcing Genomics Jordan Paradise Loyola University Chicago, School of Law, [email protected] Follow this and additional works at: https://lawecommons.luc.edu/facpubs Part of the Administrative Law Commons, and the Food and Drug Law Commons Recommended Citation Jordan Paradise, Exploring Precision FDA, an Online Platform for Crowdsourcing Genomics, 58 Jurimetrics J. 267-282 (2018) This Article is brought to you for free and open access by LAW eCommons. It has been accepted for inclusion in Faculty Publications & Other Works by an authorized administrator of LAW eCommons. For more information, please contact [email protected]. EXPLORING PRECISION FDA, AN ONLINE PLATFORM FOR CROWDSOURCING GENOMICS Jordan Paradise* ABSTRACT: The U.S. Food and Drug Administration has created an online platform for the next generation sequencing community, enabling users to evaluate biomarker in- formation and share resources. This article examines this online platform and offers sev- eral observations about potential legal and regulatory implications. CITATION: Jordan Paradise, Exploring PrecisionFDA, an Online Platform for CrowdsourcingGenomics, 58 JuRIMETRIcS J. 267-282 (2018). Precision medicine is here, with rapid advancements in the technologies, tools, and life-saving products entering the market for the treatment of serious and life-threatening disease. In May 2017, the United States Food and Drug Ad- ministration (FDA) approved the first cancer treatment for solid tumors based on a genetic biomarker rather than the tissue of origin.1 One month later, the agency approved a companion diagnostic panel that uses next generation se- quencing (NGS) to simultaneously screen a genetic sample for 23 cancer genes, three of which have FDA-approved therapies for non-small cell lung cancer.2 *Georgia Reithal Professor of Law, Loyola University Chicago School of Law.
    [Show full text]
  • The Future Role of HPC in Medical Product Decision Making
    The Future Role of HPC in Medical Product Decision Making LLNL-TR-690736 This document was prepared as an account of work sponsored by an agency of the United States government. Neither the United States government nor Lawrence Livermore National Security, LLC, nor any of their employees makes any warranty, expressed or implied, or assumes any legal liability or responsibility for the accuracy, completeness, or usefulness of any information, apparatus, product, or process disclosed, or represents that its use would not infringe privately owned rights. Reference herein to any specific commercial product, process, or service by trade name, trademark, manufacturer, or otherwise does not necessarily constitute or imply its endorsement, recommendation, or favoring by the United States government or Lawrence Livermore National Security, LLC. The views and opinions of authors expressed herein do not necessarily state or reflect those of the United States government or Lawrence Livermore National Security, LLC, and shall not be used for advertising or product endorsement purposes. This work performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344. 1 Summary Report on a Workshop on the Future Role of High Performance Computing in Medical Product Decision Making Abstract On September 10, 2015, Lawrence Livermore National Laboratory’s Center for Global Security Research1 and High Performance Computing Innovation Center2 hosted a workshop in Silver Spring, Maryland, on the “Future Role of High Performance Computing (HPC) in Medical Product Decision Making.” Stakeholders from academia, industry, national laboratories and government agencies attended and discussed the role of HPC in making regulatory decisions.
    [Show full text]
  • US Department of Health and Human Services Open
    U.S. Department of Health and Human Services Open Government Plan Version 4.0 September 2016 1 TABLE OF CONTENTS PAGE EXECUTIVE SUMMARY 1 OVERVIEW OF PROGRESS FROM VERSION 3.0 OF THE HHS OPEN GOVERNMENT PLAN ............................................................................................. 4 2 OVERVIEW OF HOW VERSION 4.0 OF THE OPEN GOVERNMENT PLAN WAS DEVELOPED ............................................................................................................. 5 3 TRANSPARENCY 3.1 ADMINISTRATIVE APPROACHES TO ENHANCE AVAILABILITY OF DATA.6 3.2 TRANSPARENCY INITIATIVES ..................................................................... 18 3.3 PROACTIVE DISCLOSURE ........................................................................... 24 3.4 PRIVACY ........................................................................................................ 27 3.5 WHISTLEBLOWER PROTECTION ................................................................ 28 3.6 RECORDS MANAGEMENT ........................................................................... 30 3.7 FREEDOM OF INFORMATION (FOIA) REQUESTS ...................................... 35 3.8 WEBSITES (DIGITAL SERVICES STRATEGY) ............................................. 35 3.9 PUBLIC NOTICE …………………………………………………………………….36 3.10 CONGRESSIONAL REQUESTS .................................................................... 39 3.11 DECLASSIFICATION ..................................................................................... 42 3.12 OPEN INNOVATION METHODS ...................................................................
    [Show full text]
  • BIT-2015-Agenda.Pdf
    Final Agenda Cover Cambridge Healthtech Institut e’s Fourteenth Annual Schedule-at-a-Glance APRIL 21 – 23, 2015 Plenary Sessions SEAPORT WORLD TRADE CENTER Awards CONFERENCE & EXPO ’15 BOSTON, MA Pre-Conference Workshops Enabling Technology. Leveraging Data. Transforming Medicine. IT Infrastructure – Hardware Software Development CONFERENCE TRACKS: PLENARY SESSION SPEAKERS: EVENT FEATURES: • Access All 12 Tracks for One Price Cloud Computing 1 IT Infrastructure – Hardware Philip E. Bourne, Ph.D. • Network with 3,000+ Global 2 Software Development Associate Director for Data Science Bioinformatics (ADDS), National Institutes of Health Attendees 3 Cloud Computing Next-Gen Sequencing Informatics • Hear 150+ Technology and Scientific Chris Sander, Ph.D. Presentations 4 Bioinformatics Computational and Systems Biology, Clinical & Translational Informatics Memorial Sloan Kettering • Attend Bio-IT World’s Best 5 Next-Gen Sequencing Informatics Practices Awards Data Visualization & Exploration Tools Cancer Center 6 Clinical & Translational Informatics • Connect with Attendees Using Pharmaceutical R&D Informatics Benjamin Heywood CHI’s Intro-Net 7 Data Visualization & Exploration Tools Co-Founder and President, • Participate in the Poster Clinical Genomics 8 Pharmaceutical R&D Informatics PatientsLikeMe, Inc. Competition Collaborations & Open Access Innovations 9 Clinical Genomics • Choose from 17 Pre-Conference Andreas Kogelnik, M.D., Ph.D. Workshops Cancer Informatics 10 Collaborations & Open Access Founder, Open Medicine Institute • See the
    [Show full text]
  • Developing Open Source Tools for Clinical Trials
    Developing Open Source Tools for Clinical Trials Jeremy Wildfire, Rho PhUSE Single Day Event, 12/1/2016 Chapel Hill, NC What is Open Source? The Open Source Initiative has a good definition: Generally, Open Source software is software that can be freely accessed, used, changed, and shared (in modified or unmodified form) by anyone. 7 Trends Towards Open Source (relevant to medical research in the last 10 years or so) 1. Mature Open Source Ecosystem 2. Data Science 3. Regulatory Guidance Updates 4. Reproducible Research 5. Slow (but accelerating) Industry Adoption 6. R can’t be ignored 7. Open Source and SAS Trend 1: Mature Open Source Ecosystem Link: State of the Octoverse (aka github) Example 1 – GitHub Guides Links: Rho GitHub & GitHub Guides Trend 2: Data Science Add Venn diagram Link: Battle of the Data Science Venn Diagrams Hopkins/Coursera Data Science Specialization Link: Coursera Data Science Specialization Link: State of the Octoverse (aka github) Trend 2.1: Interactive Data Visualization Links: Data Driven Documents - d3.js Rho Graphics Example 2 – qtlcharts (Karl Broman, UW-Madison) Link: r/qtlcharts Trend 3: Regulatory Guidance Updates In NIH's view, all data should be considered for data sharing. Data should be made as widely and freely available as possible while safeguarding the privacy of participants, and protecting confidential and proprietary data. Link: NIH Data Sharing Policy (2003) This policy also establishes a pilot program that requires agencies, when commissioning new custom software, to release at least 20 percent of new custom-developed code as Open Source Software (OSS) for three years, and collect additional data concerning new custom software to inform metrics to gauge the performance of this pilot.
    [Show full text]
  • Open Collaboration in the Public Sector : the Case of Social Coding on Github
    Erschienen in: Government Information Quarterly ; 32 (2015), 4. - S. 464-472 https://dx.doi.org/10.1016/j.giq.2015.09.004 Open collaboration in the public sector: The case of social coding on GitHub Ines Mergel Public Administration and International Affairs, Maxwell School of Citizenship and Public Affairs, Syracuse University, 215 Eggers Hall, Syracuse, NY 13244, United States abstract Open collaboration has evolved as a new form of innovation creation in the public sector. Government organiza tions are using online platforms to collaborative create or contribute to public sector innovations with the help of external and internal problem solvers. Most recently the U.S. federal government has encouraged agencies to col laboratively create and share open source code on the social coding platform GitHub and allow third parties to share their changes to the code. A community of government employees is using the social coding site GitHub Keywords: to share open source code for software and website development, distribution of data sets and research results, Open collaboration or to seek input to draft policy documents. Quantitative data extracted from GitHub's application programming Social coding interface is used to analyze the collaboration ties between contributors to government repositories and their GitHub reuse of digital products developed on GitHub by other government entities in the U.S. federal government. In addition, qualitative interviews with government contributors in this social coding environment provide insights into new forms of co development of open source digital products in the public sector. 1. Introduction government research teams also share their data sets, and algorithms, and recently selected agencies have started to co develop policy docu Multiple forms of open collaboration have emerged that enable the ments in text format on GitHub.
    [Show full text]
  • Plan to Increase Access to Results of FDA-Funded Scientific Research
    Plan to Increase Access to Results of FDA-Funded Scientific Research U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES FOOD AND DRUG ADMINISTRATION February 2015 TABLE OF CONTENTS I. BACKGROUND AND PURPOSE ....................................................................................................................... 1 II. DEFINITIONS .................................................................................................................................................. 3 III. SCOPE AND EFFECTIVE DATES ........................................................................................................................ 5 IV. PUBLIC ACCESS REQUIREMENTS ................................................................................................................ 6 A. PEER REVIEWED ARTICLES AND ARTICLE METADATA..................................................................................................... 6 i. General Requirements .................................................................................................................................. 6 ii. Scientific Article Repository .......................................................................................................................... 7 iii. Article Embargo Period ................................................................................................................................ 8 B. DIGITAL DATA ......................................................................................................................................................
    [Show full text]
  • In Plain Sight: Is Open Data Improving Our Health?
    CALIFORNIA HEALTHCARE FOUNDATION In Plain Sight: Is Open Data Improving Our Health? JANUARY 2015 Contents About the Author 3 Introduction Cheryl Wold, MPH, is an independent con- Understanding the Impact of Open Data sultant who provides strategic research and evaluation services to nonprofit and public- 5 Preventing Foodborne Illness in Chicago sector organizations. Previously, Cheryl was Innovation chief of the Health Assessment Unit in the Los Early Results Angeles County Public Health Department, where she directed the Los Angeles County 6 Childhood Obesity Surveillance in New York Health Survey and developed and dissemi- Innovation nated health indicators and data to guide Early Results health improvement activities. 8 Reducing Pedestrian Injuries in San Francisco About the Foundation Innovation The California HealthCare Foundation Early Results (CHCF) is leading the way to better health care for all Californians, particularly those 9 Public Reporting of Hospital-Acquired Infections whose needs are not well served by the Innovation and Early Results status quo. We work to ensure that people have access to the care they need, when California they need it, at a price they can afford. New York Going Forward CHCF informs policymakers and industry leaders, invests in ideas and innovations, 12 How Do We Evaluate Impact? and connects with changemakers to create a more responsive, patient-centered health 14 Acknowledgments care system. 15 Endnotes For more information, visit www.chcf.org. © 2015 California HealthCare Foundation California
    [Show full text]
  • CAPITALIZING on the OPEN DATA REVOLUTION Click Here to Save the Date! FOREWORD: WELCOME to the DATA-DRIVEN GOVERNMENT
    CAPITALIZING ON THE OPEN DATA REVOLUTION Click here to save the date! FOREWORD: WELCOME TO THE DATA-DRIVEN GOVERNMENT EMC’s Audie Hittle, federal Take the findings of a recent report by the Department chief technology officer, of Commerce’s Economics and Statistics Administration, Isilon Storage Division, EMC “Fostering Innovation, Creating Jobs, Driving Better Deci- Corporation, tells us how sions: The Value of Government Data.” It states that: open data is transforming our world £ Government data guides trillions of dollars of investments each year by local, state and federal governments; households; institutions; and private companies. £ Firms that intensively use data from Commerce’s Government has entered a new era, and it’s being pow- statistical agencies often combine it with other ered by open data. Open data has been driving us to government and private-sector data to create $24 strive for a more transparent, participatory and collabo- billion to $221 billion in annual revenues. rative government, and it helps us deal with the complex £ Government data is uniquely comprehensive, con- challenges we face today. sistent, credible, relevant and accessible, and still But how will the data-driven era shape the future of protects confidentiality. government? The report represents only a portion of the power of It’s simple. With open data programs, governments no government data to drive markets and improve govern- longer have to be the sole provider of services. Govern- ment efficiency. ment data can now serve as a platform for developers and We believe that government must embrace this new era citizens to access and create new services.
    [Show full text]