An Interoperable & Optimal Data Grid Solution For
Total Page:16
File Type:pdf, Size:1020Kb
An Interoperable & Optimal Data Grid Solution for Heterogeneous and SOA based Grid- GARUDA Payal Saluja, Prahlada Rao B.B., ShashidharV, Neetu Sharma, Paventhan A. Dr. B.B Prahlada Rao [email protected] s IPDPS 2010, Atlanta 2010, IPDPS s ’ 19 April 2010 IPDPS10 System Software Development Group Centre for Development of Advanced Computing HPGC Workshop, IEEE Workshop, HPGC C-DAC Knowledge Park, Bangalore, India GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all 1 Presentation Outline • Grid Storage Requirements • Various Data Grid Solutions • Comparision of Grid data Solutions • Indian National Grid –GARUDA • Data Management Challenges for GARUDA • GARUDA Data Storage Solution - GSRM • GSRM Highlights • GSRM Architecture Integration with GARUDA s IPDPS 2010, Atlanta 2010, IPDPS s middleware components ’ • GSRM Usage Scenario for Par. applications • Conclusions HPGC Workshop, IEEE Workshop, HPGC GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all 2 Grid Storage Requirements • Data Availability • Security • Performance & Latency • Scalability • Fault Tolerance s IPDPS 2010, Atlanta 2010, IPDPS s ’ HPGC Workshop, IEEE Workshop, HPGC GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all Data Grid Solutions & Supported Storage Systems Data Grid Storage Systems Storage Grid File Storage Resource System Resource iRODS WS-DAI Broker Manager Gfarm StoRM GPFS DPM Lustre dCache Xtreemfs Grid Storage Storage Grid Solutions Bestman NFSv4 1.File Systems 1.File systems 1. File systems 1.File Systems AMG 2. Archives 2.Parallel File 2. Parallel File 2. Archives A WS- 3. Storage Area Systems Systems 3. Storage DAI s IPDPS 2010, Atlanta 2010, IPDPS s ’ Network (SAN) 3.Object 3.Object storage Area Network 4. Data Bases storage device (SAN) WS- 4. Mass storage 5. CAS devices 4. Data Bases DAIX Supported Supported Storage Systems systems 6. Mass Storage 5. CAS systems 6. MSS Hierarchy of Data Grid Solutions and Supported Storage Systems HPGC Workshop, IEEE Workshop, HPGC GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all Survey of Data Grid Solutions • Storage Resource Broker (Nirvana SRB) • iRODS (Integrated Rule Oriented Data System) • GFS (Grid File Systems) • WS-DAI (Web Service-data Access & Integration) s IPDPS 2010, Atlanta 2010, IPDPS s ’ • SRM (Storage Resource manager) HPGC Workshop, IEEE Workshop, HPGC GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all Feature comparison of Grid Data Solutions Features SRB iRODS SRM GFS WS- (Nirvana) DAI Organization SDSC, SDSC EGEE GFS-WG OGSA group Nirvana (GGF) Tool/ Spec Tool Tool Spec Spec Spec Storage File system, MSS, File File systems,, File systems database Support Database, system MSS, Object based Global Namespace Yes Yes Yes Yes Yes Security GSI, GSI, GSI, VOMS GSI WS-Security Unix Auth, Unix auth, kerberos kerberos s IPDPS 2010, Atlanta 2010, IPDPS s ’ Standardization Proprietary tool No OGF OGF GGF Interoperability No No Yes Yes Space Management No No Yes No No HPGC Workshop, IEEE Workshop, HPGC Replication yes yes yes GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all Indian National Grid Computing Initiative-GARUDA s IPDPS 2010, Atlanta 2010, IPDPS s ’ Website : www.garudaindia.in HPGC Workshop, IEEE Workshop, HPGC GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all 7 GARUDA- Indian National Grid Computing Initiative - Objectives • Share High-end Computational Resources with the larger Scientific and Engineering community across India. • Emerging High Performance Computing (HPC) Applications require integration of geographically distributed resources • Collaborative Frameworks for solving applications that are interdisciplinary, experts participation from multiple domains and distributed locations • Universal (location-independence, ubiquitous) access to resources s IPDPS 2010, Atlanta 2010, IPDPS s ’ HPGC Workshop, IEEE Workshop, HPGC GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all 8 Components Evolution in GARUDA Project Phases Phase GARUDA PoC GARUDA Found GARUDA Main Features Phase Phase Phase Middleware Globus 2.4.3 Globus 4.0.7 Globus + Clouds (Stable release) (Stable release) Web Pre WS Web Service Based Web Service Based compliance SOA Support Not supported Service Oriented Grid Supported Architecture Centralized Peer to Peer Peer to Peer Grid Meta Moab Gridway NA Scheduler s IPDPS 2010, Atlanta 2010, IPDPS s ’ QOS Rudimentary Advanced Reservation Yes Compliance Storage SRB-Commercial SRM- Open source S/W NA Solutions Virtual Virtual Community Enabling Virtual Fully Supported HPGC Workshop, IEEE Workshop, HPGC Community Groups formed. Communities through Support VOMS GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all 9 GARUDA Partners (Currently -45) . Institute of Plasma Research, Ahmedabad . University of Hyderabad, Hyderabad . Physical Research Laboratory, Ahmedabad . Centre for DNA Fingerprinting and Diagnostics, . Space Applications Centre, Ahmedabad Hyderabad . Harish Chandra Research Institute, Allahabad . Jawaharlal Nehru Technological University, . Motilal Nehru National Institute of Technology, Hyderabad Allahabad . Indian Institute of Technology, Kanpur . Jawaharlal Nehru Centre for Advanced Scientific . Indian Institute of Technology, Kharagpur Research, Bangalore . Saha Institute of Nuclear Physics, Kolkatta . Indian Institute of Astrophysics, Bangalore . Central Drug Research Institute, Lucknow . Indian Institute of Science, Bangalore . Sanjay Gandhi Post Graduate Institute of Medical . Institute of Microbial Technology, Chandigarh Sciences, Lucknow . Punjab Engineering College, Chandigarh . Bhabha Atomic Research Centre, Mumbai . Madras Institute of Technology, Chennai . Indian Institute of Technology, Mumbai . Indian Institute of Technology, Chennai . Tata Institute of Fundamental Research, Mumbai . Institute of Mathematical Sciences, Chennai . IUCCA, Pune . Indian Institute of Technology, Delhi . National Centre for Radio Astrophysics, Pune s IPDPS 2010, Atlanta 2010, IPDPS s ’ . Jawaharlal Nehru University, Delhi . National Chemical Laboratory, Pune . Institute for Genomics and Integrative Biology, Delhi . Pune University, Pune . Indian Institute of Technology, Guwahati . Indian Institute of Technology, Roorkee . Guwahati University, Guwahati . Regional Cancer Centre, Thiruvananthapuram . Vikram Sarabhai Space Centre, Thiruvananthapuram HPGC Workshop, IEEE Workshop, HPGC . Institute of Technology, Banaras Hindu University, Varanasi GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all 10 Cyber Infrastructure – Resources • PARAM Padma (Aix, Bangalore), Linux Clusters at Pune, Hyderabad & Chennai • Grid Labs have been setup at Bangalore, Pune & Hyderabad • Fourteen of the partner institutions contributed resources including Satellite Terminals (compute aggregating to 1600+ CPUs) s IPDPS 2010, Atlanta 2010, IPDPS s ’ HPGC Workshop, IEEE Workshop, HPGC GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all 11 GARUDA Component Architecture Access Methods Management, Monitoring • Access Portal for & Accounting SOA • Paryaveekshanam • Problem Solving • Web MDS Environments • GARUDA Information Service • GARUDA Accounting Security Framework • IGCA Certificates MyProxy GARUDA Resources • • VOMS s IPDPS 2010, Atlanta 2010, IPDPS s ’ • Compute, Data, Storage, Scientific Instruments, • Resource Mgmt & Scheduling • Software,.. • GridWay Meta-scheduler • Resource Reservation • Torque, Load Leveler Globus 4.x (WS Components) HPGC Workshop, IEEE Workshop, HPGC • GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all 12 • Indian Grid Certification Authority (IGCA): Located at C-DAC, Knowledge Park, Bangalore, India. • IGCA is the first CA in India for the purpose of Grid research. • Managed by GARUDA -Grid Operation Centre. s IPDPS 2010, Atlanta 2010, IPDPS s ’ • Issues X.509 Certificates to support the secure environment in Grid ( to institutes doing grid research in India and Internationaly collaborating with GARUDA). HPGC Workshop, IEEE Workshop, HPGC GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all 13 GARUDA current Phase: Objectives • Provide an Operational Stable Cyber Infrastructure with Service oriented technologies for scientific/ Commercial applications • Deliver A Service Level Architecture ie usable by a wide range of scientific disciplines • Integrate GARUDA with other International Grids • Address long-term research issues in Grid Computing Deliverables: s IPDPS 2010, Atlanta 2010, IPDPS s ’ • Grid Technologies & Research • SOA based Infrastructure • Applications • Capacity and Community Building HPGC Workshop, IEEE Workshop, HPGC GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all 14 Grid Technologies & Research Works • Secure Access Methods Collaborative Environments • Grid Middleware-SOA and QOS – Create & Manage Virtual Organizations • PSE & Program Devlp. Environmenrts – Multi-Comp Distr Application-building • Data Management Solutions – Managing Resources through – Managing Data Collection common Access methods – Parallel File System • Research Initiatives – Parallel & Distributed DB systems – Scheduling – Rescheduling, Migration, – I/O Libraries Redistribution • Grid Monitoring & Management – Checkpointing – Fault tolerance s IPDPS 2010, Atlanta 2010, IPDPS s ’ • Collaborative Environment – Application Specific MW Development – Performance Modelling of Applications HPGC Workshop, IEEE Workshop, HPGC GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all 15 GARUDA-Project Dissemination Mechanisms • Website : www.garudaindia.in • Workshops on Grid Computing – Held in collaboration with CERN at Bangalore,