HPGC Workshop, IEEE’s IPDPS 2010, Atlanta AnInteroperable &Optimal Solution GARUDA for Heterogeneousfor and SOA basedGrid PayalSaluja, DataGrid Centrefor Development Advanced of Computing Solutions:GSRM PrahladaRao - DACBangalore, Park, Knowledge India SystemSoftwareDevelopment Group – Dr. Prahlada B.B Rao Prahlada Rao.. et all [email protected] B.B., GARUDA 19 April 19 2010 ShashidharV IPDPS10 , Neetu Sharma,Paventhan A. - 1 HPGC Workshop, IEEE’s IPDPS 2010, Atlanta Presentation Outline • • • • • • • • • • GARUDA Indian National Grid Comparision Various Data Grid Solutions Grid Requirements Storage Conclusions GSRM Usage Scenario for Par. applications middleware components GSRM Architecture Integration GARUDA with GSRM Highlights GARUDA Data Storage Solution Data Management Challenges GARUDA for DataGrid Solutions:GSRM of Grid data Solutions – Prahlada Rao.. et all – GARUDA - GSRM 2 HPGC Workshop, IEEE’s IPDPS 2010, Atlanta Requirements Storage Grid • • • • • Fault Tolerance Scalability Performance & Latency Security Data Availability GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all HPGC Workshop, IEEE’s IPDPS 2010, Atlanta Systems Data Grid Solutions& Supported Storage GARUDA

Supported Grid Storage Storage Solutions

DataGrid Systems Hierarchy of Data Grid Solutions and Supported Systems and Storage Solutions Hierarchy of Data Grid 1.FileSystems Broker Resource Storage systems 6. MassStorage CAS5. Data 4. Bases Network(SAN) 3. StorageArea Archives 2. Solutions:GSRM System Grid File devices storage 3.Object Systems 2.Parallel File 1.Filesystems – Prahlada Rao.. et all Data Grid Storage Systems Storage GridData Xtreemf Lustre GPFS Gfarm NFSv4 s Manager Resource Storage systems storage 4. Mass device storage 3.Object Systems File Parallel 2. 1. File systems Bestman dCache DPM StoRM iRODS 6. 6. MSS CAS5. Data 4. Bases (SAN) NetworkArea 3. Storage Archives 2. 1.FileSystems WS - DAI DAIX WS DAI WS A AMG - - HPGC Workshop, IEEE’s IPDPS 2010, Atlanta Solutions Grid Data Survey of • • • • • SRM (Storage Resource manager) WS GFS (GridSystems) File iRODS ( SRB) GARUDA - DAI (WebService DataGrid (IntegratedRule OrientedSystem) Data Solutions:GSRM – Prahlada Rao.. et all - data data AccessIntegration) & HPGC Workshop, IEEE’s IPDPS 2010, Atlanta Solutions Data Grid of comparison Feature Space Management Space Global NamespaceGlobal Standardization Interoperability GARUDA Organization Features Tool/Spec Security Storage Support DataGrid Solutions:GSRM , MSS, MSS, Filesystem, Proprietary tool Proprietary Object based Object (Nirvana) UnixAuth, , Nirvana SDSC, SRB GSI, GSI, Tool Yes yes No No – Prahlada Rao.. et all Unixauth, system File kerberos iRODS SDSC GSI, GSI, Tool Yes yes No No No Filesystems,, GSI, VOMS GSI, EGEE SRM MSS, OGF Spec Yes Yes Yes yes Filesystems GFS (GGF) GFS OGF Spec GSI Yes No - WG OGSA groupOGSA WS database - WS GGF DAI Spec Security Yes Yes No - HPGC Workshop, IEEE’s IPDPS 2010, Atlanta Indian National Grid Computing Initiative GARUDA DataGrid Solutions:GSRM - GARUDA Website : – Prahlada Rao.. et all www.garudaindia.in 7 HPGC Workshop, IEEE’s IPDPS 2010, Atlanta GARUDA • • • • - GARUDA Indian National Grid Computing Initiative Universal (location locations domains and distributed participation from multiple interdisciplinary,experts Collaborative Frameworks resources distributed requireof geographically integration Emerging and Engineering across India. Scientific community Share High resources DataGrid Solutions:GSRM High Performance Computing (HPC) Applications High Performance - end Computational Resources Computational end – Prahlada Rao.. et all - independence for solving applications that are applications for solving ,ubiquitous) access - Objectives with the larger to to 8 HPGC Workshop, IEEE’s IPDPS 2010, Atlanta Components Evolution in GARUDA Project Phases Project GARUDA in Evolution Components Support Community Virtual Solutions Storage Compliance QOS Scheduler GridMeta Architecture SupportSOA compliance Web Middleware Features GARUDA Phase DataGrid Moab Centralized Not supported Pre WS (Stable release) Globus 2.4.3 Phase GARUDAPoC Groups formed. Virtual Community SRB Rudimentary Solutions:GSRM - Commercial – Prahlada Rao.. et all VOMS Communities through Enabling Virtual SRM Advanced Reservation Gridway Peer to Peer Service Oriented Grid Web Service Based (Stable release) Globus 4.0.7 Phase GARUDAFound - Open source S/W Fully Supported NA Yes NA Peer to Peer Supported Web Service Based Globus Clouds + Phase GARUDAMain 9 HPGC Workshop, IEEE’s IPDPS 2010, Atlanta ...... Research, Bangalore Jawaharlal Nehru Centre for Advanced Scientific Allahabad Motilal Harish Space Applications Centre, Physical Research Laboratory, Institute of Plasma Research, Guwahati Indian Institute of Technology, Institute for Genomics and Integrative Biology, Delhi Jawaharlal Nehru University, Delhi Indian Institute of Technology, Delhi Institute of Mathematical Sciences, Chennai Indian Institute of Technology, Chennai Madras Institute of Technology, Chennai Punjab Engineering College, Institute of Microbial Technology, Indian Institute of Science, Bangalore Indian Institute of Astrophysics, Bangalore GARUDA Chandra Research Institute, Nehru National Institute of Technology, GARUDA Partners ( Partners GARUDA University, DataGrid Guwahati Solutions:GSRM Ahmedabad Chandigarh Ahmedabad Guwahati Ahmedabad Chandigarh Allahabad – Prahlada Rao.. et all ...... University of Hyderabad, Hyderabad Varanasi Institute of Technology, Thiruvananthapuram Vikram Regional Cancer Centre, Indian Institute of Technology, Pune National Chemical Laboratory, National Centre for Radio Astrophysics, IUCCA, Tata Indian Institute of Technology, Mumbai Bhabha Sciences, SanjayMedical Gandhi Post Graduate Institute of Central Drug Research Institute, Saha Indian Institute of Technology, Indian Institute of Technology, Hyderabad Jawaharlal Nehru Technological University, Hyderabad Centre for DNA Fingerprinting and Diagnostics, Institute of Fundamental Research, Mumbai University, Institute of Nuclear Physics, Sarabhai Pune Atomic Research Centre, Mumbai Currently Lucknow Pune Space Centre, Banaras - Thiruvananthapuram 45 Roorkee Kharagpur Kanpur Pune Lucknow Hindu University, Kolkatta ) Pune 10 HPGC Workshop, IEEE’s IPDPS 2010, Atlanta • • • aggregating to 1600+ to aggregating CPUs) (compute Terminals Satellite including resources contributed institutions partner the Fourteen of Bangalore, at Labs setup have been Grid Chennai at Clusters PARAM GARUDA Cyber Infrastructure DataGrid Padma Pune Solutions:GSRM (Aix, Bangalore), (Aix, Bangalore), & Hyderabad Pune – , Hyderabad & , Hyderabad Prahlada Rao.. et all – Resources 11 HPGC Workshop, IEEE’s IPDPS 2010, Atlanta GARUDA Component Architecture • • • • Accounting& Management, Monitoring • • • Resources GARUDA Service Software,.. Scientific Instruments, Compute,Data, Storage, Accounting GARUDA Information GARUDA Web MDS Paryaveekshanam GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all • • • • Resource& Scheduling Mgmt Globus Components) (WS 4.x Torque,Load Leveler Resource Reservation GridWay Meta - scheduler • • • Security Framework • • Access Methods VOMS VOMS MyProxy IGCA Certificates Environments Solving Problem SOA Access for Portal 12 HPGC Workshop, IEEE’s IPDPS 2010, Atlanta • • • • GARUDA Located at C at Located collaborating withcollaboratingGARUDA). institutes to research grid indoing and India Issues Managedby IGCA is the GridCertification Indian DataGrid X.509 Certificates X.509 Solutions:GSRM - first GARUDA GARUDA DAC, Knowledge Park, Bangalore, India. Bangalore, Park, Knowledge DAC, CA – Prahlada Rao.. et all in India in - Grid Operation Centre Operation Grid to support the secure support ( environment to Grid in for the purpose of thefor purpose Authority Internationaly (IGCA): Grid . research. 13 HPGC Workshop, IEEE’s IPDPS 2010, Atlanta GARUDA current Phase: Objectives Deliverables: • • • • GARUDA Address long Address Integrate GARUDA International other with Grids of scientific disciplines ADeliver applications Service oriented Provide an Operational • • • • Capacity and Community Building Applications SOA based Infrastructure TechnologiesGrid & Research DataGrid Solutions:GSRM Service Level Architecture - term research issues in Grid Computing technologies for scientific/ Commercial – Prahlada Rao.. et all Stable Cyber Infrastructure ie usable by awide range with 14 HPGC Workshop, IEEE’s IPDPS 2010, Atlanta Grid Technologies & Research Works • • • • • • Collaborative Environment Grid MonitoringManagement & PSE&Program Grid Middleware AccessSecure Methods GARUDA Data Management Solutions – – – – I/O Libraries I/O Parallel&Distributed systems DB ParallelFileSystem ManagingData Collection DataGrid Solutions:GSRM Devlp - SOA and QOS . – Environmenrts Prahlada Rao.. et all • CollaborativeEnvironments ResearchInitiatives – – – – – – – – – Applications Performance Development MW Specific Application tolerance Fault Checkpointing Redistribution Migration, Rescheduling, Scheduling methods Access common through Resources Managing Multi Organizations Virtual & Manage Create - Comp Comp Distr Modelling Application of - building 15 HPGC Workshop, IEEE’s IPDPS 2010, Atlanta GARUDA • • • • • • • intervals intervals regular at Meets: Partner GARUDA locations. GARUDA partner Workshops Tools & Technologies on Grid Trainings Internal Garuda On Enablement on Workshops National on Workshops on Workshops Website GARUDA – – – – deployment at partner sites partner at deployment C USA Nirvana, by Broker Resource Storage USA Resources, by Cluster Scheduler Grid Moab Pune and Delhi Bangalore, at CERN with collaboration in Held DataGrid - DAC GARUDA SIGMA for for SIGMA GARUDA DAC - : Project Dissemination Mechanisms www.garudaindia.in in February 2006 February in GARUDA Introduction GARUDA Solutions:GSRM GARUDA deployment GARUDA Computing Grid (DAG). Applications Applications – Prahlada Rao.. et all at at . 16 HPGC Workshop, IEEE’s IPDPS 2010, Atlanta GARUDA for Challenges Management Data • • • • • • • Gridservice as GARUDA is basedon applications. Highperformance accessIO scientificapplications fieldsof like : Scalability interoperability Adherence tointernational grid storage standards DynamicSpace Management supportstorageof reservation. 24x7 resources. accessUnified point for GARUDA • • • • availability DataGrid ParticlePhysics Bioinformatics Healthcare Biomedicalinformatics tocaterthe huge IO storage requirements dataof intensive Solutions:GSRM withother grids like grid. EU of storageof forjobs submitted to GARUDA with the – distributed Prahlada Rao.. et all tostorage Realfor Timeparallel thatenables efficientstorage usage. and serviceoriented architecture heterogeneous tosupport storage . HPGC Workshop, IEEE’s IPDPS 2010, Atlanta Solution Data GARUDA Based SRM • • • • • GSRM GSRM SRM implementation GSRM based OGSA on are SRM implementations GARUDA GARUDA DataGrid support high performance file systems. file performance high support for as web services. users available are services sourceopen on based is is based on based is Solutions:GSRM – Prahlada Rao.. et all SOA adhering to adhering Disk Pool Manager Pool Disk - GSRM OGSA model OGSA . (DPM) (DPM) HPGC Workshop, IEEE’s IPDPS 2010, Atlanta Implementations SRM GARUDAusers requirements Theobjective ofproject this isto a Storagedevelop c Solution • • • • • • • SRMAdvantages of Garuda in Grid: Implementations are interoperable Mass systems storage systems, File Access to Scalability VOMS using GSI , Provides security applications data for intensive Provides space reservation Information , like service services with GRID Easily integrates OGF standards that adheres to webasworks a service SRM GARUDA DataGrid Solutions:GSRM – . Prahlada Rao.. et all MDS , RLS . MDS , RLS ustomizedfor HPGC Workshop, IEEE’s IPDPS 2010, Atlanta Implementations SRM with Experiences StoRM ns Implementatio SRM DPM Bestman GARUDA DataGrid • • • • • worksHatRedLinux on • charges      • • Pros Parallel filesystemsupport Parallel supportGT4 Parallel filesystemsupport Parallel Replicationinbuiltfacility Data availableSourcecodeFree ofthe required Most functionalities fineworking maintenanceEasy & installation Recommendedplatformalso Linux:Scientific but Licensed source code available with nominalLicensedavailable codewith source Providessupportgroup throughorchat email reservationSpace working with maintenanceEasy & installation Worksallofversions on Solutions:GSRM – Prahlada Rao.. et all linux cmd line Cons • • • troubleshoot • • • rpms • • No SupportNo for storagesystem Mass Lackcompleteofset of supportNo for GT4 supportNo from forstorm team XML workingreservation Spacenot is properly onlybinary distributionavailableis through StrictbindingwithOS(SE & RHEL4 , no source no availability, code - RPCCommunication error ing BestMAN JAVAAPIs - update4) HPGC Workshop, IEEE’s IPDPS 2010, Atlanta Highlights GSRM GARUDA • • • • • • • DataGrid Persistence Performance Interoperability Security Space Management Availability Global Namespace Solutions:GSRM – Prahlada Rao.. et all HPGC Workshop, IEEE’s IPDPS 2010, Atlanta Mapping Components & Highlights GSRM Availability AdminRequest GARUDA SRM Performance Data GARUDA GARUDA SRM Userrequest DataGrid Gsiftp server server Rfio Solutions:GSRM Namespace Space Mgmt Space Global – Namespace Manager Namespace Prahlada Rao.. et all Space Manager Space Persistence Request User DB (namespace Security Metadata DB Bestman StoRM Request SRM server SRM / Interoperability HPGC Workshop, IEEE’s IPDPS 2010, Atlanta GARUDAmiddleware components ArchitectureGSRM &Integration other with IO ServersIO CPFS Broker Resource Storage Storage based Disk GARUDA DataGrid Solutions:GSRM GSRMserver server Information GSRM - manager - manager - namespace Space Srm server – Prahlada Rao.. et all Internet Myproxy server / voms portal GARUDA clients Voms Network GARUDA /proxy Head Node Head scheduler Meta CN Clients GSRM system Information Federated GARUDA CN HPGC Workshop, IEEE’s IPDPS 2010, Atlanta Applications Parallel By GSRM of Usage Scenario GARUDA DataGrid Solutions:GSRM – Prahlada Rao.. et all • • • • • request to reconstruct the files. the to reconstruct request C backend the file ( GridFTP out by carried be can transfer file actual storage. attached locally with servers IO multiple with C by maintained is space storage srmReserveSpace token space and files the for required lifetime as such parameters requisite specifies also Client srmPrepareToPut ( element storage the from files inputs the of all transfer initiates Client TURL data is striped is data ) returned by returned ) protocol using the the using protocol received from from received - PFS driver can issue issue can driver PFS command command command across file servers and and servers file across srmPrepareToPut transfer URL transfer SE - PFS PFS )using . HPGC Workshop, IEEE’s IPDPS 2010, Atlanta Multisite GARUDA

Time Taken(sec) DatausingSRB,SRM,transfer DataGrid 10000 12000 2000 4000 6000 8000 Solutions:GSRM 0

500

SRB 1024 – Prahlada Rao.. et all 10240 FileSize (KB) 51200 iRODS 102400

512000 iRODS

1048576 SRM HPGC Workshop, IEEE’s IPDPS 2010, Atlanta Conclusions • • • • GARUDA Proposed GARUDA : SRM (GSRM) opengrid standards, support interoperability SRM based GARUDASolution Data SRM applications. distributed environments integrated to develop scientific GARUDA: – – – DataGrid applications. Fascilitate systems HighIntegrate with performance C wellprovides is theoptimalis data grid solution for GARUDA Solutions:GSRM SOA based Grid Architecture providing high aggregate IO bandwidth parallel for - – defined and interoperable interfaces Prahlada Rao.. et all - GSRM - DAC parallel file : adheres to: adheres HPGC Workshop, IEEE’s IPDPS 2010, Atlanta 3. 2. 1. 6. 5. 4. Service and Utility Oriented Distributed Computing Systems: Chal Systems: Computing Distributed Oriented Utility and Service Development and Research 2009,Hamburg Initiative R., Sridharan S, Ramakrishnan B.B, Rao Prahlada 7 2008, Computing Grid on Symposium International the in accepted Inter Shamjith 2008. Conference NORDUnet2008 GARUDA, GRID INDIAN FOR TOOL MONITORING STATUS A PARYAVEKSHANAM: Karuna 10 Dec India, Bangalore, Computing, on Conference International IEEE of 3rd Proceedings In IDE), for GAR Environment Development Integrated Ram, Mohan and BB Rao Sukeshini 2005 Latha of Service An Overview Anthony Communities; Simulation and Modeling for Opportunities GARUDA - References 11 April 2008. 11 April - operability among Grids: A Case Study with GARUDA Grid and the E the and Grid GARUDA with Study Case A Grids: among operability Srinivasan , DataGrid Deepika Sulistio - K. V., V., K. , K , GARUDA Kalaiselvan “ Solutions:GSRM e - Asvija Germany. Special ed. of Springer's journal on journal ed. of Springer's Special Germany. - ; February 2008 ; February infrastructures in IT: A Case study on Indian National Grid Comp Grid National Indian on study Case A IT: in infrastructures H.V, Mangala N., Prahlada Rao BB, BB, Rao Prahlada N., Mangala H.V, and and ” , International Supercomputing Conference (ISC Conference Supercomputing International , Jem B., Sridharan R., Prahlada Rao BB., Rao Prahlada R., Sridharan B., - oriented Architecture, Web Services and Grid Computing by Computing Grid and Services Web Architecture, oriented , P , Treadwell HP Software Global Business Unit, November 3, 3, November Unit, Business Global Software HP Treadwell – Vallinayagam ” Prahlada Rao.. et all - , “ The Biosphere of Grids and Networks and Grids of Biosphere The Vol 23, Issue 3 23, Issue RajaGopalan - 13th, 2007, pp 499 pp 13th, 2007, , MS VijayaNagamani, N Mangala, Prahlada Prahlada Mangala, N VijayaNagamani, , MS - 4, pp 283 4, pp M.R , Dr Subrata C, Mangala N, Mangala C, Subrata Dr M.R , MohanRam - 290, June 2009. Springer. Springer. 2009. 290, June - Mohanram Rajkumar 506. “ eScience Computer Science Computer N., N., lenges and and lenges ” Taipei, Taiwan, Taiwan, Taipei, Buyya , , N., Realizing N., Realizing ’ 09), June 23 09), June Espoo UDA Grid (G Grid UDA and Grid Grid and GEE Grid , Grid GEE and and , Finland, , Finland, 24th 24th - uting uting - 26, 26, - 27 HPGC Workshop, IEEE’s IPDPS 2010, Atlanta GARUDA DataGrid Solutions:GSRM Thank you! – Prahlada Rao.. et all 28