Glite Middleware
Total Page:16
File Type:pdf, Size:1020Kb
Enabling Grids for E-sciencE Overview of the EGEE project and the gLite middleware www. eu-egee.org EGEE-III INFSO-RI-222667 Outline Enabling Grids for E-sciencE • What is EGEE? – The project – The infrastructure • gLite middleware • EGEE applications • Sources of further information EGEE-III INFSO-RI-222667 2 Defining the Grid Enabling Grids for E-sciencE • A Grid is the combination of networked resources and the corresponding middleware, which provides services for the user. EGEE-III INFSO-RI-222667 Providing a Production Grid Infrastructure for Collaborative Science 3 The EGEE Project Enabling Grids for E-sciencE • Aim of EGEE: “to establish a seamless European Grid infrastructure for the support of the European Research Area (ERA)” • EGEE – 1 April 2004 – 31 March 2006 – 71 partners in 27 countries, federated in regional Grids • EGEE-II – 1 April 2006 – 30 April 2008 – EddtiExpanded consortium • EGEE-III – 1 May 2008 – 30 April 2010 – Transition to sustainable model EGEE-III INFSO-RI-222667 Providing a Production Grid Infrastructure for Collaborative Science 4 Defining the Grid Enabling Grids for E-sciencE • A Grid is the combination of networked resources and the corresponding middleware, which provides services for the user. EGEE-III INFSO-RI-222667 Providing a Production Grid Infrastructure for Collaborative Science 5 EGEE working with related Enabling Grids for E-sciencE infrastructure projects GIN EGEE-III INFSO-RI-222667 Providing a Production Grid Infrastructure for Collaborative Science 6 What is happening now? Enabling Grids for E-sciencE Real Time Monitor – Java tool – Displays jobs running (submitted througg)h RBs) – Shows jobs moving around world map in real time, along with changes in status http://gridportal.hep.ph.ic.ac.uk/rtm/ (snapshot 16 January 2007) EGEE-III INFSO-RI-222667 7 EGEE Infrastructures Enabling Grids for E-sciencE • Production service – Scaling up the infrastructure with resource centres around the globe – Stable, well-supported infrastructure, running only well-tested and reliable middleware • Pre-production service – Run in parallel with the production service (restricted nr of sites) – First deployment of new versions of the gLite middleware – Test-bed for applications and other external functionality • T-Infrastructure (Training&Education) – Complete suite of Grid elements 20 sites on 3 continents and application (Testbed, CA, VO, monitoring, support, …) – Everyone can register and use GILDA for training and testing EGEE-III INFSO-RI-222667 Providing a Production Grid Infrastructure for Collaborative Science 8 NA3 activity: Enabling Grids for E-sciencE User training and induction • Expand portfolio of training SA3 JRA1 NA1 NA2 NA3 SA2 NA4 9% 5% 2% 5% 8% materials & courses 2% 19% • Tra in a wid e var iety o f EGEE users (internal/external) SA1 49% • Develop effective mechanisms for training end-users of the NA5 EGEE infrastructure 29 Active 1% • Collaborate in cross-activity partners ~ 29 FTEs initiatives 89 Individuals – ICEAGE Project Digital Library 6 Federations – http://libraryyg.iceage-eu.org/ – Videos, MP3 talks on grid computing • http://www.egee.nesc.ac.uk/ – Training events – Training material repository • http://egee.lib.ed.ac.uk/ – EGEE Digital Library – Repository of training materials EGEE-III INFSO-RI-222667 9 NA4 Activity: Enabling Grids for E-sciencEApplication identification and support • Application Identification and Support (NA4) – 25 countries, 40 partners, 280+ participants, 1000s of users • Suppor t the large an d diverse EGEE user communit y: – Promote dialog: Users’ Forums & EGEE Conferences – Technical Aid: Porting support, procedural issues – Liaison: Software and operational requirements • Main activities: – 5 application clusers: HEP, Life sciences , Astronomy & astrophysics, Earth science, Computational chemistry, Fusion, Grid observatory – Support: Application porting support www.lpds.sztaki.hu/gasuc VO support Direct user support www.ggus.org Regional support • http://egeena4.lal. in2p3.fr EGEE-III INFSO-RI-222667 10 EGEE Infrastructure Enabling Grids for E-sciencE Country participating in EGEE No. Cores 80000 70000 60000 50000 40000 30000 20000 10000 0 Jul-04 Jul-05 Jul-06 Jul-07 Apr-04 Oct-04 Apr-05 Oct-05 Apr-06 Oct-06 Apr-07 Oct-07 Apr-08 Jan-05 Jan-06 Jan-07 Jan-08 > 200 sites in 40 countriesNo. Sites 300 250 ~ 38 000 CPUs 200 ~ 5 PB storage 150 100 98k jobs/day 50 > 200 Virtual Organizations 0 ⇨The world’s largest multi-disciplinary Grid Jul-07 Jul-06 Jul-05 EGEE-III INFSO-RI-222667Jul-04 Oct-07 Apr-08 Oct-06 Apr-07 Oct-05 Apr-06 Oct-04 Apr-05 Apr-04 Jan-08 Jan-07 Jan-06 Jan-05 Providing a Production Grid Infrastructure for Collaborative Science 11 Resource management: structure Enabling Grids for E-sciencE • Operations Coordination Centre (OCC) – management, oversight of all operational and support activities • Regional Operations Centres (ROC) – providing the core of the support infrastructure, each supporting a number of resource centres within its region – Grid Operator on Duty • Resource centres – providing resources (computing, storage, network, etc.); • Grid User Support (GGUS) – At FZK, coordination and management of user support, single point of contact for users EGEE-III INFSO-RI-222667 12 VO concept Enabling Grids for E-sciencE • gLite middleware runs on each shared resource to provide – Data services – Computation services – Security service INTERNET • Resources and users form Virtual organisations: basis for collaboration • Distributed services (both people and middleware) enable the grid EGEE-III INFSO-RI-222667 13 Defining the Grid Enabling Grids for E-sciencE • A Grid is the combination of networked resources and the corresponding middleware, which provides services for the user. EGEE-III INFSO-RI-222667 Providing a Production Grid Infrastructure for Collaborative Science 14 Grid middleware Enabling Grids for E-sciencE • The Grid relies on advanced software, called middleware, which interfaces between resources and the applications • The G rid middl eware: – Basic services Secure a nd e ffect ive access to resources – High level services Optimal use of resources Authentication to the different sites that are used Job execution & monitoring of progress Problem recovery TffltbktthTransfer of results back to the user EGEE-III INFSO-RI-222667 15 Grid Middleware Enabling Grids for E-sciencE • When using a PC or • When using a Grid you workstation you – Login with digital – Login with a username credentials – single sign- and password on (“Authentication”) (“Authentication ” ) – Use rights given you – Use rights given to you (“Authorisation”) (“Authorisation”) – Run jobs – Run jobs – Manage files: create – Manage files: create them, read/write, list them, read/write, list directories directories • Components are • Services are linked by link ed b y a b us the I nt ernet • Operating system • Middleware • One admin . domain • MdidiMany admin. domains EGEE-III INFSO-RI-222667 16 EGEE Middleware: gLite Enabling Grids for E-sciencE • gLite 3.0, gLite 3.1 ⇨M⇨ Merger of LCG 2. 7 and GLite 1. 5 – Exploit experience and existing components from VDT (Condor, Globus), EDG/LCG, and others – Develop a lightweight stack of generic middleware useful to EGEE applications (HEP and Biomedics are pppp)ilot applications). Should eventually deploy dynamically (e.g. as a globus job) Pluggable components – cater for different implementations – Focus is on providing a stable and usable infrastructure EGEE-III INFSO-RI-222667 17 Basic gLite use case: Enabling Grids for E-sciencE Job submission User Interface Information System Submit job ((p)executable + small inputs) Resource Broker query Retrieve status & create (small) output files proxy query Submit job publish state File and Replica Catalog Retrieve output Job Logging status Site X Register file Computing Element Storage Element Inpp()ut file(s) JbJob status VO Management process Service Outpu t file(s) (DB of VO users) Logging and EGEE-III INFSO-RI-222667 bookkeeping 18 Main components Enabling Grids for E-sciencE User Interface (UI): The place where users logon to the Grid Resource Broker (RB) (Workload Management System (WMS): Matches the user requirements with the available resources on the Grid Information System: Characteristics and status of CE and SE File and replica catalog: Location of grid files and grid file replicas LiLogging and dBkki(LB) Bookkeeping (LB): LiftifjbLog information of jobs Computing Element (CE): A batch queue on a site’s computers where the user’s job is executed Storage Element (SE): provides (large-scale) storage for files EGEE-III INFSO-RI-222667 19 Main components Enabling Grids for E-sciencE User Interface (UI): The place where users logon to the Grid Resource Broker (RB) (Workload Management System (WMS): Matches the user requirements with the available resources on the Grid Information SystemAll: Characteristics built upon and status of CE and SE File and replica authorisationcatalog: Location of grid files, and grid file replicas authentication, LiLogging and dBkki(LB) Bookkeepsecuritying (LB): LiftifjbLog information of jobs Computing Element (CE): A batch queue on a site’s computers where the user’s job is executed Storage Element (SE): provides (large-scale) storage for files EGEE-III INFSO-RI-222667 20 Who provides the resources?! Enabling Grids for E-sciencE Service Provider Note User interface User / institute / VO Computer with client SW Resource Broker VOs - EGEE does (WMS) not fund RBs Information System Grid operations - EGEE funded effort File