Science Gateways: the Long Road to the Birth of an Institute

Total Page:16

File Type:pdf, Size:1020Kb

Science Gateways: the Long Road to the Birth of an Institute Proceedings of the 50th Hawaii International Conference on System Sciences | 2017 Science Gateways: The Long Road to the Birth of an Institute Sandra Gesing Katherine Lawrence University of Notre Dame University of Michigan Notre Dame, USA Ann Harbor, USA Linda B. Hayden [email protected] [email protected] Elizabeth City State University Elizabeth City, USA Nancy Wilkins-Diehr Michael Zentner [email protected] University of California Purdue University San Diego, USA West Lafayette, USA [email protected] [email protected] Suresh Marru Indiana University Maytal Dahan Marlon E. Pierce Bloomington, USA University of Texas Indiana University [email protected] Austin, USA Bloomington, USA [email protected] [email protected] Abstract community-based access to shared, distributed Nowadays, research in various disciplines is enhanced resources and services - telescopes, sensor arrays, via computational methods, cutting-edge technologies supercomputers, digital repositories, software as a and diverse resources including computational service, collaboration environments, and more. infrastructures and instruments. Such infrastructures Gateways enable the formation of scientific are often complex and researchers need means to communities, accelerating and transforming the conduct their research in an efficient way without discovery process, and engaging citizens and students getting distracted with information technology in the scientific process. They represent a fundamental nuances. Science gateways address such demands and social and technological change in how science is offer user interfaces tailored to a specific community. being conducted. Creators of science gateways face a breadth of topics Since gateways form end-to-end solutions tailored and manifold challenges, which necessitate close to the communities’ needs, the creators of science collaboration with the domain specialists but also gateways are concerned with a diversity of topics and calling in experts for diverse aspects of a science have to tackle various challenges. The fundamental gateway such as project management, licensing, team first step is to understand the requirements of the composition, sustainability, HPC, visualization, and community and an at least high-level insight into the usability specialists. The Science Gateway Community research area. The following domain-related topics Institute tackles the challenges around science need to be addressed in close collaboration with a gateways to support domain specialists and developers target community. via connecting them to diverse experts, offering • Specific goal of a science gateway consultancy as well as providing a software • Visions/demands on the layout collaborative, which contains ready-to-use science • Priorities of features and options, e.g., a list gateway frameworks and science gateway components. from must-have to great-to-have options • Integration of existing applications or 1. Introduction development of applications • Technologies of the applications Billions of people use the web every day. From • Demand on computational resources for an banking to travel arrangements to connecting with efficient and effective use of an application family, the web has changed how we conduct our lives. • Visualization of results and job submissions Its impact on scientific research is no different. Science • Security demands, i.e., users may want to share gateways are used by many researchers to conduct data after they received a patent or published it their work. Implemented as web, desktop, and mobile- • Need for workflows device applications, science gateways provide URI: http://hdl.handle.net/10125/41919 ISBN: 978-0-9981331-0-2 CC-BY-NC-ND 6243 • Data management, e.g., the location of input Gateway development has often been done in an data, the demand on storage for input, ad-hoc way, limiting success and long-term impact. intermediate and output data The science gateways name originated from a focus When these topics are clarified, creators should area within the NSF TeraGrid program (2005-2011) consider further topics for a seamless integration of [11, 12]. This program developed policies and cyberinfrastructure (CI) to achieve efficiently a procedures that made the use of high performance sustainable solution. computers via the web viable. Gateway developers in • Available infrastructure including security the program worked together both to design the infrastructure and resources policies and procedures and then use them as they • Available support of suitable technologies incorporated the use of supercomputers in their own • Scalability of suitable technologies gateways. But the program had surprising, unintended benefits. In addition to working on the common goal to • Effort for extending existing technologies use supercomputers in gateways, developers benefitted compared to novel developments from the mere existence of a community designing Synergy effects with other science gateway • advanced web portals for science. There was projects unexpected commonality across disciplines and few • Experience with available solutions and/or other forums for sharing experiences in scientific programming languages development. National Science Foundation (NSF) reports [1,2,3,4,5,6,7] describe the enormous effect that CI is In the course of this work, we noticed problems having on science, but they do not highlight the with duplication of effort and the long-term dramatic shift taking place in how the average user sustainability of gateways. Science gateways enable accesses advanced CI. Specialized, technical interfaces research, sometimes for thousands, but are not research such as the Unix command line no longer serve the projects in and of themselves. As a result, they struggle broad, diversifying community. Because research to find the right avenues for sustainable funding [13]. challenges are more interdisciplinary, scientists This work revealed a key finding: Gateways funded via demand the ability to focus on their research questions, short-term research grants can follow a destructive as well as to convey their results to the general public. cycle. Prototypes are developed, early adopters are Moreover, faculty wants to more efficiently and identified, and interest in using the resulting gateway is effectively integrate their research with teaching. encouraged. Then the project ends. Understandably, Programming skills amongst interdisciplinary teams researchers who had invested time in gateway use can vary widely. For all of these reasons, science teams become disillusioned and less likely to use gateways are turning to the web [7,8]. But the significance of again. gateways goes beyond replacement of command-line A 2008 white paper [13] highlighted these tools to include sharing of research (methods, data, challenges and led to an NSF-funded study (2009- visualizations, etc.), increasing transparency of 2011) to investigate what contributes to gateway methods and data used, reproducing others’ results, success [14]. Participants in focus groups helped shielding users from much of the complex underlying identify solutions. One outcome is that gateway infrastructure, and applying research-grade tools in builders need a common, reliable reservoir of software, classrooms and training. expertise, and support they can call on to increase their chances of successful development and 2. Background implementation. Developers should be able to easily share experience and use community-contributed tools Over the past ten years, research, workshops, to more quickly create robust, less expensive, and collected papers, and special journal issues [9, 10] have sustainable gateways. Their efforts should maximize called attention to the challenges and successes of the likelihood that their respective communities science gateways, while extending our understanding become interested in and maintain those gateways of how to address these challenges. Despite notable when initial funding ends. Similarly, gateway software successes, building and operating gateways is still too and service providers need to plan for the future with costly and too prone to failure. The success rate and clear leadership to more effectively serve the gateway capabilities of gateways must be further increased community. while reducing associated costs and effort spent. The next step is to leverage and enhance every dollar that is invested in gateways by supporting best practices for development, deployment, and sustainability. 6244 3. Large-Scale Survey 4. Institute Components A further award (2012-2016) allowed to explore via The Science Gateways Community Institute is the a 5000-respondent survey [15] whether the ideas result of years of study and community input. An gleaned from small but carefully chosen focus groups organization at this level can serve as a focal point that resonated with the larger research community. To our galvanizes the community, leverages the extensive but best knowledge the large-scale survey is the most disjointed investments in gateway resources, codes, comprehensive of its kind, covering the breadth of the and expertise, and supports longer-term career paths scientific and engineering disciplines represented at for gateway developers by highlighting the importance NSF and across all geographic areas in the U.S. The of gateways in the conduct of research today. The survey targeted NSF principal investigators, gateway Institute’s mission is to provide resources, expertise, developers, and leaders in higher
Recommended publications
  • Retrocomputing As Preservation and Remix
    Retrocomputing as Preservation and Remix Yuri Takhteyev Quinn DuPont University of Toronto University of Toronto [email protected] [email protected] Abstract This paper looks at the world of retrocomputing, a constellation of largely non-professional practices involving old computing technology. Retrocomputing includes many activities that can be seen as constituting “preservation.” At the same time, it is often transformative, producing assemblages that “remix” fragments from the past with newer elements or joining together historic components that were never combined before. While such “remix” may seem to undermine preservation, it allows for fragments of computing history to be reintegrated into a living, ongoing practice, contributing to preservation in a broader sense. The seemingly unorganized nature of retrocomputing assemblages also provides space for alternative “situated knowledges” and histories of computing, which can sometimes be quite sophisticated. Recognizing such alternative epistemologies paves the way for alternative approaches to preservation. Keywords: retrocomputing, software preservation, remix Recovering #popsource In late March of 2012 Jordan Mechner received a shipment from his father, a box full of old floppies. Among them was a 3.5 inch disk labelled: “Prince of Persia / Source Code (Apple) / ©1989 Jordan Mechner (Original).” Mechner’s announcement of this find on his blog the next day took the world of nerds by storm.1 Prince of Persia, a game that Mechner single-handedly developed in the late 1980s, revolutionized computer games when it came out due to its surprisingly realistic representation of human movement. After being ported to DOS and Apple’s Mac OS in the early 1990s the game sold 2 million copies (Pham, 2001).
    [Show full text]
  • The Past, Present, and Future of Software Evolution
    The Past, Present, and Future of Software Evolution Michael W. Godfrey Daniel M. German Software Architecture Group (SWAG) Software Engineering Group School of Computer Science Department of Computer Science University of Waterloo, CANADA University of Victoria, CANADA email: [email protected] email: [email protected] Abstract How does our system compare to that of our competitors? How easy would it be to port to MacOS? Are users still Change is an essential characteristic of software devel- angry about the spyware incident? As new features are de- opment, as software systems must respond to evolving re- vised and deployed, as new runtime platforms are envis- quirements, platforms, and other environmental pressures. aged, as new constraints on quality attributes are requested, In this paper, we discuss the concept of software evolu- so must software systems continually be adapted to their tion from several perspectives. We examine how it relates changing environment. to and differs from software maintenance. We discuss in- This paper explores the notion of software evolution. We sights about software evolution arising from Lehman’s laws start by comparing software evolution to the related idea of software evolution and the staged lifecycle model of Ben- of software maintenance and briefly explore the history of nett and Rajlich. We compare software evolution to other both terms. We discuss two well known research results of kinds of evolution, from science and social sciences, and we software evolution: Lehman’s laws of software evolution examine the forces that shape change. Finally, we discuss and the staged lifecycle model of Bennett and Rajlich. We the changing nature of software in general as it relates to also relate software evolution to biological evolution, and evolution, and we propose open challenges and future di- discuss their commonalities and differences.
    [Show full text]
  • A Brief History of Software Engineering Niklaus Wirth ([email protected]) (25.2.2008)
    1 A Brief History of Software Engineering Niklaus Wirth ([email protected]) (25.2.2008) Abstract We present a personal perspective of the Art of Programming. We start with its state around 1960 and follow its development to the present day. The term Software Engineering became known after a conference in 1968, when the difficulties and pitfalls of designing complex systems were frankly discussed. A search for solutions began. It concentrated on better methodologies and tools. The most prominent were programming languages reflecting the procedural, modular, and then object-oriented styles. Software engineering is intimately tied to their emergence and improvement. Also of significance were efforts of systematizing, even automating program documentation and testing. Ultimately, analytic verification and correctness proofs were supposed to replace testing. More recently, the rapid growth of computing power made it possible to apply computing to ever more complicated tasks. This trend dramatically increased the demands on software engineers. Programs and systems became complex and almost impossible to fully understand. The sinking cost and the abundance of computing resources inevitably reduced the care for good design. Quality seemed extravagant, a loser in the race for profit. But we should be concerned about the resulting deterioration in quality. Our limitations are no longer given by slow hardware, but by our own intellectual capability. From experience we know that most programs could be significantly improved, made more reliable, economical and comfortable to use. The 1960s and the Origin of Software Engineering It is unfortunate that people dealing with computers often have little interest in the history of their subject.
    [Show full text]
  • The History of Computing in the History of Technology
    The History of Computing in the History of Technology Michael S. Mahoney Program in History of Science Princeton University, Princeton, NJ (Annals of the History of Computing 10(1988), 113-125) After surveying the current state of the literature in the history of computing, this paper discusses some of the major issues addressed by recent work in the history of technology. It suggests aspects of the development of computing which are pertinent to those issues and hence for which that recent work could provide models of historical analysis. As a new scientific technology with unique features, computing in turn can provide new perspectives on the history of technology. Introduction record-keeping by a new industry of data processing. As a primary vehicle of Since World War II 'information' has emerged communication over both space and t ime, it as a fundamental scientific and technological has come to form the core of modern concept applied to phenomena ranging from information technolo gy. What the black holes to DNA, from the organization of English-speaking world refers to as "computer cells to the processes of human thought, and science" is known to the rest of western from the management of corporations to the Europe as informatique (or Informatik or allocation of global resources. In addition to informatica). Much of the concern over reshaping established disciplines, it has information as a commodity and as a natural stimulated the formation of a panoply of new resource derives from the computer and from subjects and areas of inquiry concerned with computer-based communications technolo gy.
    [Show full text]
  • An Overview of Software Evolution
    An Overview of Software Evolution CPRE 416-Software Evolution and Maintenance-Lecture 2 1 Software Evolution • What is it? • How important is it? • What to do about it? 2 An early history of software engineering • The following slides provide a condensation of the ideas of Robert L. Glass in his book "In the Beginning: Recollections of Software Pioneers" about the history of software engineering. 3 The Pioneering Era (1955-1965) • New computers were coming out every year or two. • Programmers did not have computers on their desks and had to go to the "machine room". • Jobs were run by signing up for machine time. Punch cards were used. • Computer hardware was application-specific. Scientific and business tasks needed different machines. 4 The Pioneering Era (1955-1965) • High-level languages like FORTRAN, COBOL, and ALGOL were developed. • No software companies were selling packaged software. • Academia did not yet teach the principles of computer science. 5 The Stabilizing Era (1965-1980) • Came the IBM 360. • This was the largest software project to date. • The 360 also combined scientific and business applications onto one machine. 6 The Stabilizing Era (1965-1980) • Programmers had to use the job control language (JCL) to tell OS what to do. • PL/I, introduced by IBM to merge all programming languages into one, failed. • The notion of timesharing emerged. • Software became a corporate asset and its value became huge. • Academic computing started in the late 60's. • Software engineering discipline did not yet exist. 7 The Stabilizing Era (1965-1980) • High-hype disciplines like Artificial Intelligence emerged.. • Structured Programming burst on the scene.
    [Show full text]
  • The History of Unix in the History of Software Haigh Thomas
    The History of Unix in the History of Software Haigh Thomas To cite this version: Haigh Thomas. The History of Unix in the History of Software. Cahiers d’histoire du Cnam, Cnam, 2017, La recherche sur les systèmes : des pivots dans l’histoire de l’informatique – II/II, 7-8 (7-8), pp77-90. hal-03027081 HAL Id: hal-03027081 https://hal.archives-ouvertes.fr/hal-03027081 Submitted on 9 Dec 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. 77 The History of Unix in the History of Software Thomas Haigh University of Wisconsin - Milwaukee & Siegen University. You might wonder what I am doing The “software crisis” and here, at an event on this history of Unix. the 1968 NATO Conference As I have not researched or written about on Software Engineering the history of Unix I had the same question myself. But I have looked at many other The topic of the “software crisis” things around the history of software and has been written about a lot by profes- this morning will be talking about how sional historians, more than anything some of those topics, including the 1968 else in the entire history of software2.
    [Show full text]
  • The Value of Source Code to Digital Preservation Strategies
    School of Information Student Research Journal Volume 2 Issue 2 Article 5 January 2013 Consider the Source: The Value of Source Code to Digital Preservation Strategies Michel Castagné The University of British Columbia, [email protected] Follow this and additional works at: https://scholarworks.sjsu.edu/ischoolsrj Part of the Library and Information Science Commons Recommended Citation Castagné, M. (2013). Consider the Source: The Value of Source Code to Digital Preservation Strategies. School of Information Student Research Journal, 2(2). https://doi.org/10.31979/2575-2499.020205 Retrieved from https://scholarworks.sjsu.edu/ischoolsrj/vol2/iss2/5 This article is brought to you by the open access Journals at SJSU ScholarWorks. It has been accepted for inclusion in School of Information Student Research Journal by an authorized administrator of SJSU ScholarWorks. For more information, please contact [email protected]. Consider the Source: The Value of Source Code to Digital Preservation Strategies Abstract One of the major challenges in the digital preservation field is the difficulty of ensuring long-term access to digital objects, especially in cases when the software that was used to create an object is no longer current. Software source code has a human-readable, documentary structure that makes it an overlooked aspect of digital preservation strategies, in addition to a valuable component for the records of modern computing history. The author surveys several approaches to software preservation and finds that, yb supporting open source initiatives, digital libraries can improve their ability to preserve access to their collections for future generations. Keywords source code, open source, digital preservation, software preservation About Author Michel Castagné is a Master of Library and Information Studies candidate at the University of British Columbia.
    [Show full text]
  • Javascript Affogato
    JavaScript Afogato: Programming a Culture of Improvised Expertise Brian Lennon Pennsylvania State University ABSTRACT: This essay attempts a philological—meaning a both technically and socially attentive—historical study of an individual computer programming language, JavaScript. From its introduction, JavaScript’s reception by software developers, and its importance in web development as we now understand it, was structured by a con- tinuous negotiation of expertise. I use the term “improvised expertise” to describe both conditions for and effects of the unanticipated de- velopment of JavaScript, originally designed for casual and inexpert coders, into a complex of technical artifacts and practices whose range and complexity of use has today propelled it into domains previously dominated by other, often older and more prestigious languages. “Im- provised expertise” also marks the conditions for and effects of three specific developmental dynamics in JavaScript’s recent history: first, the rapidly accelerated development of the language itself, in the ver- sions of its standard specification; second, the recent, abruptly emerg- ing, yet rapid growth of JavaScript in server-side networking, data pro- cessing, and other so-called back-end development tasks previously off limits to it; third, the equally recent and abrupt, yet decisive emer- gence of JavaScript as the dominant language of a new generation of dynamic web application frameworks and the developer tool chains or tooling suites that support them. Introduction 2016 was an inconspicuously transitional year for the information space once commonly referred to as the World Wide Web. Those Configurations, 2018, 26:47–72 © 2018 by Johns Hopkins University Press and the Society for Literature, Science, and the Arts.
    [Show full text]
  • A Survey of the Practice of Computational Science
    A Survey of the Practice of Computational Science Prakash Prabhu Thomas B. Jablin Arun Raman Yun Zhang Jialu Huang Hanjun Kim Nick P. Johnson Feng Liu Soumyadeep Ghosh Stephen Beard Taewook Oh Matthew Zoufaly David Walker David I. August Princeton University fpprabhu,tjablin,rarun,yunzhang,[email protected] fhanjunk,npjohnso,fengliu,soumyade,[email protected] ftwoh,mzoufaly,dpw,[email protected] ABSTRACT even for computer scientists [33]. Given this background, Computing plays an indispensable role in scientific research. this paper seeks to answer the question: How are scientists Presently, researchers in science have different problems, coping with the growing computing demands? needs, and beliefs about computation than professional pro- Recently, an online survey conducted a broad study of the grammers. In order to accelerate the progress of science, programming practices of a wide range of researchers, re- computer scientists must understand these problems, needs, vealing many potential problems encountered in correctly and beliefs. To this end, this paper presents a survey of writing scientific programs [30, 43]. Continuing in the same spirit, this paper presents an in-depth study of the practice of scientists from diverse disciplines, practicing computational 1 science at a doctoral-granting university with very high re- computational science at Princeton University, a RU/VH search activity. The survey covers many things, among institution. This study is conducted through a survey of them, prevalent programming practices within this scien- researchers from diverse scientific disciplines. This survey tific community, the importance of computational power in covers important aspects of computational science including different fields, use of tools to enhance performance and soft- programming practices commonly employed by researchers, ware productivity, computational resources leveraged, and the importance of computational power, and the perfor- prevalence of parallel computation.
    [Show full text]
  • Brief History of Computer and Their Application
    Brief History Of Computer And Their Application Displeased and well-meant Mort parbuckle her aerodynes wale or ceases globularly. Ira is lubberly transformative after disquisitional Knox rook his fibs cumulatively. Unblissful Silvester exonerating some funnel-webs and unknit his etherealisation so anes! From adversely affecting other and their instructions Apple computer history of applications emerge that you can be used. The computer industry to their cursor control readers are needed networking, special offers color monitor. Among computer and computational physics, and users could play a military. She is history of application to search and brief history of software that honour goes further. In computer industry might already gone their appreciation to calculate predicted tide levels. This and application may request that he began working in bedrock and his machine to continuously. This brief content and their locations and card, are partially sponsored by comparing long does cctv solutions more. In their way to left to store one application, practical use buffers that could now about greater speed and forwarding of computable. Still be hosted within a computer system level format on their computers. Nothing could perform addition to applications include the history of computable functions of. Zuse and its use and history of computer and brief sketch of. In a universal use similar experiments and brief history. As computer history, computing applications written in brief history of application software that. Languages in europe, you hear about greater amount of application of computer history and brief, helped their use today, microsoft windows xp, the unix as the routers in these data that.
    [Show full text]
  • SHORT HISTORY of SOFTWARE METHODS by David F. Rico This Short History Identifies 32 Major Classes of Software Methods That Have Emerged Over the Last 50 Years
    SHORT HISTORY OF SOFTWARE METHODS by David F. Rico This short history identifies 32 major classes of software methods that have emerged over the last 50 years. There are many variations of each major class of software method, which renders the number of software methods in the hundreds. This short history contains a brief synopsis of each of the 32 major classes of software methods, identifying the decade and year they appeared, their purpose, their major tenets, their strengths, and their weaknesses. The year each software method appeared corresponds to the seminal work that introduced the method based on extensive bibliographic research and the strengths and weaknesses were based on scholarly and empirical works to provide an objective capstone for each method. Mainframe Era Midrange Era Microcomputer Era Internet Era Personalized Era 1960s 1970s 1980s 1990s 2000s Flowcharting Structured Design Formal Specification Software Estimation Software Reliability Iterative/Incremental Software Inspections Structured Analysis Software Testing Configuration Control Quality Assurance Project Management Defect Prevention Process Improvement CASE Tools Object Oriented Software Reuse Rapid Prototyping Concurrent Lifecycle Software Factory Domain Analysis Quality Management Risk Management Software Architecture Software Metrics Six Sigma Buy versus Make Personal Processes Product Lines Synch-n-Stabilize Team Processes Agile Methods Market Conditions Market Conditions Market Conditions Market Conditions Market Conditions y Mainframe computer y Computer
    [Show full text]
  • By Michael L. Black Dissertation
    TRANSPARENT CULTURES: IMAGINED USERS AND THE POLITICS OF SOFTWARE DESIGN (1975-2012) BY MICHAEL L. BLACK DISSERTATION Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in English in the Graduate College of the University of Illinois at Urbana-Champaign, 2014 Urbana, Illinois Doctoral Committee: Professor Robert Markley, Chair Associate Professor Ted Underwood Associate Professor Melissa Littlefield Associate Professor Spencer Schaffner Associate Professor John T. Newcomb ii Abstract The rapid pace of software’s development poses serious challenges for any cultural history of computing. While digital media studies often sidestep historicism, this project asserts that computing’s messy, and often hidden, history can be studied using digital tools built to adapt text-mining strategies to the textuality of source code. My project examines the emergence of personal computing, a platform underlying much of digital media studies but that itself has received little attention outside of corporate histories. Using an archive of technical papers, professional journals, popular magazines, and science fiction, I trace the origin of design strategies that led to a largely instrumentalist view of personal computing and elevated “transparent design” to a privileged status. I then apply text-mining tools that I built with this historical context in mind to study source code critically, including those features of applications hidden by transparent design strategies. This project’s first three chapters examine how and why strategies of information hiding shaped consumer software design from the 1980s on. In Chapter 1, I analyze technical literature from the 1970s and 80s to show how cognitive psychologists and computer engineers developed an ideal of transparency that discouraged users from accessing information structures underlying personal computers.
    [Show full text]