An Open Distributed Computing System Lukasz Swierczewski [email protected]

Total Page:16

File Type:pdf, Size:1020Kb

An Open Distributed Computing System Lukasz Swierczewski Luk.Swierczewski@Gmail.Com Berkeley Open Infrastructure for Network Computing - an open distributed computing system Lukasz Swierczewski [email protected] The Berkeley Open Infrastructure for Network Computing (BOINC) is an open source middleware system for volunteer and grid computing. It was originally developed to support the SETI@home project before it became useful as a platform for other distributed applications in areas as diverse as mathematics, medicine, molecular biology, climatology, and astrophysics. The intent of BOINC is to make it possible for researchers to tap into the enormous processing power of personal computers around the world. BOINC was originally developed to manage the SETI@home project. The original SETI client was a non-BOINC software exclusively for SETI@home. As one of the first volunteer grid computing projects, it was not designed with a high level of security. Some participants in the project attempted to cheat the project to gain "credits", while some others submitted entirely falsified work. BOINC was designed, in part, to combat these security breaches. The BOINC project started in February 2002 and the first version was released on 10 April 2002. The first BOINC-based project was Predictor@home launched on 9 June 2004. In 2009 AQUA@home deployed multi-threaded CPU applications for the first time, followed by the first OpenCL application in 2010. Operating systems and processors of the BOINC users. The country ranking in terms of quantity calculations carried out by members of the appropriate nationality. SETI@home ("SETI at home") is an Internet-based public volunteer computing project employing the BOINC software platform, hosted by the Space Sciences Laboratory, at the University of California, Berkeley, in the United States. SETI is an acronym for the Search for Extra- Terrestrial Intelligence. Its purpose is to analyze radio signals, searching for signs of extra terrestrial intelligence, and is one of Snapshot of BOINC SETI@home Astropulse Screensaver. many activities undertaken as part of SETI. Rosetta@home is a distributed computing project for protein structure prediction on the BOINC platform, run by the Baker laboratory at the University of Washington. Rosetta@home aims to predict protein–protein docking and design new proteins with the help of about sixty thousand active volunteered computers processing at 62 teraFLOPS on average as of October 18, 2011.Though much of the project is oriented towards basic research on improving the accuracy and robustness of the proteomics methods, Rosetta@home also does Rosetta@home screensaver, showing the progress of a applied research on malaria, structure prediction for a synthetic ubiquitin protein Alzheimer's disease and other (PDB ID: 1ogw) pathologies. Einstein@Home is a volunteer distributed computing project hosted by the University of Wisconsin– Milwaukee and the Max Planck Institute for Gravitational Physics (Albert Einstein Institute, Hannover, Germany). The project is directed by Bruce Allen. Einstein@Home searches through data from the LIGO detectors for evidence of continuous gravitational-wave sources, which are expected for instance from rapidly spinning non-axisymmetric neutron stars. On August 12, 2010, the first discovery by Einstein@Home of a previously undetected radio pulsar J2007+2722, found in data from the Arecibo Observatory, was published Einstein@Home Screensaver in Science. The project has discovered 46 pulsars as of August 2012. Climateprediction.net, or CPDN, is a distributed computing project to investigate and reduce uncertainties in climate modelling. It aims to do this by running hundreds of thousands of different models (a large climate ensemble) using the donated idle time of ordinary personal computers, thereby leading to a better understanding of how models are affected by small changes in the many parameters known to influence the global climate. Climateprediction.net Screensaver Radioactive@Home is a polish science project using distributed computing capabilities of BOINC platform. The main goal of the project is to create free and constantly updated map of radiation available for all people, by gathering information about gamma radiation using using sensors connected to computers of volunteers willing to participate in the project. The project uses dedicated hardware sensor; without it the app does nothing and no credits are granted. Project is completely non-commercial, participating will be free of charge and the software will be licensed under the GNU General Public License (GPL). on your computer. Radioactive@Home Sensor Version 2 Thank you for your attention. Lukasz Swierczewski [email protected] .
Recommended publications
  • Packet Switched, General Purpose FPGA-Based Modules For
    Peta-Flop Radio Astronomy Signal Processing and the CASPER Collaboration (and correlators too !) Dan Werthimer and 800 CASPER Collaborators http://casper.berkeley.edu Two Types of Signal Processing 1. Embarrassingly Parallel – Low Data Rates (record the data and process it later) (high computation per bit) 2. Real Time in-situ Processing Petabits per second (can not record data) TYPE 1 Embarrassingly Parallel – Low Data Rates (record the data and process it later) (high computation per bit) VOLUNTEER COMPUTING BOINC - Berkeley Open Infrastructure for Network Computing From Download Work Server Generator Arecibo Feeder Transitioner Shared Database Memory Purger MySQL Volunteers Scheduler Database File Deleter To Nobel Upload Validator Assimilator Server Prize Committee BERKELEY SETI RESEARCH CENTER BERKELEY ASTRONOMY CollaboratorsDEPARTMENT Berkeley SETI and Volunteer Computing Group David Anderson, Hong Chen, Jeff Cobb, Matt Dexter, Walt Fitelson, Eric Korpela, Matt Lebofsky, Geoff Marcy, David MacMahon, Eric Petigura, Andrew Siemion, Charlie Townes, Mark Wagner, Ed Wishnow, Dan Werthimer NSF , NASA, Individual Donors Agilent, Fujitsu, HP, Intel, Xilinx Arecibo Observatory High performance data storage silo UC Berkeley Space Sciences Lab Public Volunteers SETI@Home ✴ Polyphase Channelization ✴ Coherent Doppler Drift Search ✴ Narrowband Pulse Search ✴ Gaussian Drift Search ✴ Autocorrelation ✴ <insert your algorithm here> SETI@home Statistics TOTAL RATE 8,464,550 2,000 per day participants (in 226 countries) 3 million years 1,000 years
    [Show full text]
  • Tesi Di Laurea La Solidarietà Digitale
    Tesi di Laurea La Solidarietà Digitale Da Seti@home a Boinc. Ipotesi di una società dell’elaborazione. di Francesco Morello 1 INDICE Introduzione............................................................... 4 Capitolo I Calcolo Vontario....................................... 5 1.1 Dai media di massa al calcolo distribuito......... 5 1.2 Calcolo Distribuito............................................... 6 1.3 Calcolo Volontario............................................... 8 1.3.1 Come funziona il calcolo volontario ?.......... 10 1.3.2 Applicazioni del Calcolo Volontario.............. 11 Capitolo II Analisi di BOINC.................................... 23 2.1 Piattaforma per il calcolo volontario............... 23 2.2 Architettura di BOINC........................................ 25 2.2.1 L'interfaccia di BOINC.................................... 25 2.2.2 Progetti e Volontari......................................... 31 2.2.3 Checkpointing e Work unit............................ 32 2.2.4 Crediti e ridondanza....................................... 32 2.2.5 Gli scopi di BOINC.......................................... 33 Capitolo III Aspetti tecnici del calcolo distribuito 36 3.1 Grid Computing vs Volunteer Computing....... 36 3.2 Hardware e Software per il Distributed Computing38 3.2.1 La Playstation 3 per raggiungere il Petaflop.41 Capitolo IV Aspetti sociali del calcolo volontario 45 4.1 Riavvicinarci alla scienza.................................. 45 2 4.2 Volontari oltre la CPU........................................ 47 4.2.1 Forum, Blog
    [Show full text]
  • Volunteer Computing Different Grids for Different Needs
    From the Web to the Grid How did the Grid start? • Name “Grid” chosen by analogy with electric power grid (Foster and Kesselman 1997) • Vision: plug-in computer for processing power just like plugging in toaster for electricity. • Concept has been around for decades (distributed computing, metacomputing) • Key difference with the Grid is to realise the vision on a global scale. From the Web to the Grid – 2007 HST 2011: Grids and Volunteer Computing Different Grids for different needs There is as yet no unified Grid, like there is a single web. Rather there are many Grids for many applications: • Enterprise Grids link together PCs within one company. • Volunteer computing links together public computers. • Scientific Grids link together major computing centres. • Latest trend federates national Grids into global Grid infrastructure. • High Energy Physics is a driving force for this. HST 2011: Grids and Volunteer Computing The LHC data challenge 1 Megabyte (1MB) • 40 million bunch collisions per second A digital photo 1 Gigabyte (1GB) = 1000MB • After filtering, ~100 collisions of 5GB = A DVD movie interest per second per detector 1 Terabyte (1TB) = 1000GB World annual book production • > 1 Megabyte of data per collision 1 Petabyte (1PB) recording rate > 1 Gigabyte/sec = 1000TB Annual production of one LHC experiment 10 • 10 collisions recorded each year 1 Exabyte (1EB) stored data ~15 Petabytes/year = 1000 PB 3EB = World annual information production …for more than 10 years HST 2011: Grids and Volunteer Computing Data Storage for the LHC Balloon (30 Km) • LHC data correspond to about 20 million CDs each year! CD stack with 1 year LHC data! (~ 20 Km) Concorde Where will the (15 Km) experiments store all of these data? Mt.
    [Show full text]
  • Spin-Off Successes of SETI Research at Berkeley
    **FULL TITLE** ASP Conference Series, Vol. **VOLUME**, c **YEAR OF PUBLICATION** **NAMES OF EDITORS** Spin-Off Successes of SETI Research at Berkeley K. A. Douglas School of Physics, University of Exeter, Exeter, United Kingdom D. P. Anderson, R. Bankay, H. Chen, J. Cobb, E.J. Korpela, M. Lebofsky, A. Parsons, J. Von Korff, D. Werthimer Space Sciences Laboratory, University of California Berkeley, Berkeley CA, USA 94720 Abstract. Our group contributes to the Search for Extra-Terrestrial Intelligence (SETI) by developing and using world-class signal processing computers to analyze data collected on the Arecibo telescope. Although no patterned signal of extra-terrestrial origin has yet been detected, and the immediate prospects for making such a detection are highly uncertain, the SETI@home project has nonetheless proven the value of pursuing such research through its impact on the fields of distributed computing, real-time signal pro- cessing, and radio astronomy. The SETI@home project has spun off the Center for Astronomy Signal Processing and Electronics Research (CASPER) and the Berkeley Open Infrastructure for Networked Computing (BOINC), both of which are responsi- ble for catalyzing a smorgasbord of new research in scientific disciplines in countries around the world. Futhermore, the data collected and archived for the SETI@home project is proving valuable in data-mining experiments for mapping neutral galatic hy- drogen and for detecting black-hole evaporation. 1 The SETI@home Project at UC Berkeley SETI@home is a distributed computing project harnessing the power from millions of volunteer computers around the world (Anderson 2002). Data collected at the Arecibo radio telescope via commensal observations are filtered and calibrated using real-time signal processing hardware, and selectable channels are recorded to disk.
    [Show full text]
  • New SETI Sky Surveys for Radio Pulses
    New SETI Sky Surveys for Radio Pulses Andrew Siemiona,d, Joshua Von Korffc, Peter McMahond, Eric Korpelab, Dan Werthimerb,d, David Andersonb, Geoff Bowera, Jeff Cobbb, Griffin Fostera, Matt Lebofskyb, Joeri van Leeuwena, William Mallardd, Mark Wagnerd aUniversity of California, Berkeley - Department of Astronomy, Berkeley, California bUniversity of California Berkeley - Space Sciences Laboratory, Berkeley, California cUniversity of California, Berkeley - Department of Physics, Berkeley, California dUniversity of California, Berkeley - Berkeley Wireless Research Center, Berkeley, California Abstract Berkeley conducts 7 SETI programs at IR, visible and radio wavelengths. Here we review two of the newest efforts, Astropulse and Fly’s Eye. A variety of possible sources of microsecond to millisecond radio pulses have been suggested in the last sev- eral decades, among them such exotic events as evaporating primordial black holes, hyper-flares from neutron stars, emissions from cosmic strings or perhaps extraterrestrial civilizations, but to-date few searches have been conducted capable of detecting them. The recent announcement by Lorimer et al. of the detection of a powerful (≈ 30 Jy) and highly dispersed (≈ 375 cm−3 pc) radio pulse in Parkes multi-beam survey data has fueled additional interest in such phenomena. We are carrying out two searches in hopes of finding and characterizing these uS to mS time scale dispersed radio pulses. These two observing programs are orthogonal in search space; the Allen Telescope Array’s (ATA) ”Fly’s Eye” experiment observes a 100 square degree field by pointing each 6m ATA antenna in a different direction; by contrast, the Astropulse sky survey at Arecibo is extremely sensitive but has 1/3,000 of the instantaneous sky coverage.
    [Show full text]
  • Analysis and Predictions of DNA Sequence Transformations on Grids
    Analysis and Predictions of DNA Sequence Transformations on Grids A Thesis Submitted for the Degree of Master of Science (Engineering) in the Faculty of Engineering By Yadnyesh R. Joshi Supercomputer Education and Research Centre INDIAN INSTITUTE OF SCIENCE BANGALORE – 560 012, INDIA August 2007 Acknowledgments First of all I would like to extend my sincere thanks to my research supervisor Dr. Sathish Vadhiyar for his constant guidance and support during the entire period of my post-graduation at IISc. He was always approachable, supportive and ready to help in any sort of problem. I am very thankful to him for being extremely patient and understanding about the silly mistakes that I had made. Under his guidance I learned to approach problems in an organized manner and set realistic goals for my research. I thank him for his extreme patience and excellent technical guidance in writing and presenting research. Finally, he was and continues to be my role model for his hard work and passion for research. I am also thankful to Dr. Nagasuma Chandra, Dr. Debnath Pal from S.E.R.C. and Dr. Narendra Dixit from Chemical Engineering department for their very useful and interesting insights into the biological domain of our research. I am also thankful to all the faculty of S.E.R.C. for always inspiring us with their motivational talks. I would like to mention the names of my colleagues Sandip, Sanjay, Rakhi, Sundari, Antoine and Roshan for making their technical and emotional support. Special thanks to vatyaa kya group members for the adventures and the routines inside and outside the institute.
    [Show full text]
  • "Challenges and Formal Aspects of Volunteer Computing"
    BUDAPEST UNIVERSITY OF TECHNOLOGY AND ECONOMICS FACULTY OF ELECTRICAL ENGINEERING AND INFORMATICS DEPARTMENT OF AUTOMATION AND APPLIED INFORMATICS Attila Csaba Marosi Önkéntes Számítási Rendszerek Kihívásai és Formális Aspektusai Challenges and Formal Aspects of Volunteer Computing Ph.D. Thesis Thesis Advisors: Péter Kacsuk MTA SZTAKI, LPDS Sándor Juhász BME, AUT Budapest, 2016. ‘‘The important thing is not to stop questioning. Curiosity has its own reason for existing. One cannot help but be in awe when he contemplates the mysteries of eternity, of life, of the marvelous structure of reality. It is enough if one tries merely to comprehend a little of this mystery every day. Never lose a holy curiosity.’’ -- Albert Einstein Abstract Volunteer Computing (VC) and Desktop Grid (DG) systems collect and make available the donated the resources from non-dedicated computers like office and home desktops. VC systems are usually deployed to solve a grand compute intensive problem by researchers who either don’t have access to or don’t have the resources to buy a dedicated infrastruc- ture; or simply don’t want to maintain such an infrastructure. VC and DG paradigms seem similar, however they target different use cases and environments: DG systems operate within the boundaries of institutes, while VC systems collect resources from the publicly accessible internet. Evidently VC resembles DGs whereas DGs are not fully equivalent to VC. Contrary to “traditional grids” [1,2] there is no formal definition for the relationship of DG and VC that could be used to categorize existing systems. There are informal at- tempts to categorize them and compare with grid systems [3,4,5].
    [Show full text]
  • New SETI Sky Surveys for Radio Pulses
    New SETI Sky Surveys for Radio Pulses Andrew Siemiona,d, Joshua Von Korffc, Peter McMahond,e, Eric Korpelab, Dan Werthimerb,d, David Andersonb, Geoff Bowera, Jeff Cobbb, Griffin Fostera, Matt Lebofskyb, Joeri van Leeuwena, Mark Wagnerd aUniversity of California, Berkeley - Department of Astronomy, Berkeley, California, USA bUniversity of California Berkeley - Space Sciences Laboratory, Berkeley, California, USA cUniversity of California, Berkeley - Department of Physics, Berkeley, California, USA dUniversity of California, Berkeley - Berkeley Wireless Research Center, Berkeley, California, USA eStanford University - Department of Computer Science, Stanford, California, USA Abstract Berkeley conducts 7 SETI programs at IR, visible and radio wavelengths. Here we review two of the newest efforts, Astropulse and Fly’s Eye. A variety of possible sources of microsecond to millisecond radio pulses have been suggested in the last sev- eral decades, among them such exotic events as evaporating primordial black holes, hyper-flares from neutron stars, emissions from cosmic strings or perhaps extraterrestrial civilizations, but to-date few searches have been conducted capable of detecting them. The recent announcement by Lorimer et al. of the detection of a powerful (≈ 30 Jy) and highly dispersed (≈ 375 cm−3 pc) radio pulse in Parkes multi-beam survey data has fueled additional interest in such phenomena. We are carrying out two searches in hopes of finding and characterizing these µs to ms time scale dispersed radio pulses. These two observing programs are orthogonal in search space; the Allen Telescope Array’s (ATA) “Fly’s Eye” experiment observes a 100 square degree field by pointing each 6m ATA antenna in a different direction; by contrast, the Astropulse sky survey at Arecibo is extremely sensitive but has 1/3,000 of the instantaneous sky coverage.
    [Show full text]
  • Analyzing Daily Computing Runtimes on the World Community Grid
    https://doi.org/10.48009/3_iis_2020_289-297 Issues in Information Systems Volume 21, Issue 3, pp. 289-297, 2020 EXPLORING GRID COMPUTING & VOLUNTEER COMPUTING: ANALYZING DAILY COMPUTING RUNTIMES ON THE WORLD COMMUNITY GRID Kieran Raabe, Bloomsburg University of Pennsylvania, [email protected] Loreen M. Powell, Bloomsburg University of Pennsylvania, [email protected] ABSTRACT In the early 1990s, the concept of grid computing was little more than a simple comparison which stated that making computing power as accessible as electricity from a power grid. Today, grid computing is far more complex. Grid computing involves utilizing unused computing power from devices across the globe to run calculations and simulations and submit the results to move a project closer to its goal. Some projects, such as World Community Grid, work to achieve numerous goals, herein referred to as subprojects. The goal of this research is to explore grid computing and volunteer computing. Specifically, this research involved a daily collection of statistics on daily results returned to World Community Grid and daily computing runtimes for three subprojects named Africa Rainfall Project, Microbiome Immunity Project, and Help Stop TB. The data collection lasted four weeks and resulted in five graphs for each subproject being created with the collected data. Results revealed a correlation between daily results returned and daily runtimes, with each data point for results returned being slightly lower than the runtime data point from the same day. Keywords: Grid Computing, Volunteer Computing, BOINC, World Community Grid INTRODUCTION The concept of grid computing has existed since the early 1990s. Originally “a metaphor for making computer power as easy to access by the user as electricity from a power grid” (Dutton & Jeffreys, 2010).
    [Show full text]
  • EDGES Project Meeting
    International Desktop Grid Federation - Support Project Contract number: FP7-312297 Desktop Grids for e-Science Road Map Project deliverable: D5.5.1 Due date of deliverable: 2013-10-31 Actual submission date: 2013-12-27 Lead beneficiary: AlmereGrid Workpackage: WP5 Dissemination Level: PU Version: 1.2 (Final) IDGF-SP is supported by the FP7 Capacities Programme under contract nr FP7-312297. D5.5.1 – Desktop Grids for e-Science Road Map CopyriGht (c) 2013. MemBers of IDGF-SP consortium, see http://IDGF-SP.eu for details on the copyriGht holders. You are permitted to copy and distriBute verBatim copies of this document containinG this copyriGht notice But modifyinG this document is not allowed. You are permitted to copy this document in whole or in part into other documents if you attach the followinG reference to the copied elements: ‘Copyright (c) 2013. Members of IDGF-SP consortium - http://IDGF-SP.eu’. The commercial use of any information contained in this document may require a license from the proprietor of that information. The IDGF-SP consortium memBers do not warrant that the information contained in the deliveraBle is capaBle of use, or that use of the information is free from risk, and accept no liaBility for loss or damaGe suffered By any person and orGanisation usinG this information. WP3 © 2013. Members of IDGF-SP consortium - http://IDGF-SP.eu 2/8 D5.5.1 – Desktop Grids for e-Science Road Map Table of Contents 1 Status and ChanGe History .................................................................................................
    [Show full text]
  • Annual Report 2013 Cover Photo: the International LOFAR Telescope (ILT) & Big Data, Danielle Futselaar © ASTRON
    Annual report 2013 Cover photo: The International LOFAR Telescope (ILT) & Big Data, Danielle Futselaar © ASTRON. Photo on this page: prototype for the Apertif phased array feed. The Westerbork Synthesis Radio Telescope (WSRT) will be upgraded with Phased Array Feeds (PAFs), which will allow scientists to perform much faster observations with the telescope with a wider field of view. More information is available on the ASTRON/ JIVE daily image: http://www.astron.nl/ dailyimage/main.php?date=20130624. 2 ASTRON Annual report 2013 Facts and figures of 2013 8 Awards or grants 163 employees 162 refereed articles Funding: € 17,420,955 Expenditure: € 17,091,022 Balance: € 329.33 25 press releases 3 ASTRON Annual report 2013 Contents Facts and figures 2013 3 Director’s report 5 ASTRON Board and Management Team 7 ASTRON in brief 8 Contribution to top sectors 11 Performance indicators 12 Astronomy Group 17 Radio Observatory 22 R&D Laboratory 30 Connected legal entities 36 NOVA Optical/ Infrared Instrumentation Group 38 Joint Institute for VLBI in Europe 41 Outreach and education 43 Appendices 58 Appendix 1: financial summary 59 Appendix 2: personnel highlights 60 Appendix 3: WSRT & LOFAR proposals in 2013 62 Appendix 4: board, committees & staff in 2013 65 Appendix 5: publications 67 4 ASTRON Annual report 2013 Report 2013 was a year in which earlier efforts began to bear fruit. In particular, the various hardware and firmware changes made to the LOFAR telescope system in 2012, resulted in science quality data being delivered to the various Key Science Projects, and in particular the EoR (Epoch of Reionisation) Team.
    [Show full text]
  • Toward Crowdsourced Drug Discovery: Start-Up of the Volunteer Computing Project Sidock@Home
    Toward crowdsourced drug discovery: start-up of the volunteer computing project SiDock@home Natalia Nikitina1[0000-0002-0538-2939] , Maxim Manzyuk2[000-0002-6628-0119], Marko Juki´c3;4[0000-0001-6083-5024], Crtomirˇ Podlipnik5[0000-0002-8429-0273], Ilya Kurochkin6[0000-0002-0399-6208], and Alexander Albertian6[0000-0002-6586-8930] 1 Institute of Applied Mathematical Research, Karelian Research Center of the Russian Academy of Sciences, Petrozavodsk, Russia, [email protected] 2 Internet portal BOINC.ru, Moscow, Russia, [email protected] 3 Chemistry and Chemical Engineering, University of Maribor, Maribor, Slovenia 4 Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Koper, Slovenia [email protected] 5 Faculty of Chemistry and Chemical Technology, University of Ljubljana, Ljubljana, Slovenia, [email protected] 6 Federal Research Center \Computer Science and Control" of the Russian Academy of Sciences, Moscow, Russia, [email protected], [email protected], [email protected] Abstract. In this paper, we describe the experience of setting up a computational infrastructure based on BOINC middleware and running a volunteer computing project on its basis. We characterize the first series of computational experiments and review the project's development in its first six months. The gathered experience shows that BOINC-based Desktop Grids allow to to efficiently aid drug discovery at its early stages. Keywords: Desktop Grid · Distributed computing · Volunteer comput- ing · BOINC · Virtual drug screening · Molecular docking · SARS-CoV-2 1 Introduction Among the variety of high-performance computing (HPC) systems, Desktop Grids hold a special place due to their enormous potential and, at the same time, high availability.
    [Show full text]