Resource Management and Scheduling Cluster

Lecture 7 Overview • Making it all work Grid/MetaComputing Lecture-7 – Resource Management and Scheduling • From Cluster/Batch schedulers to MetaComputing Spring 2003 Dr Graham Fagg – Scheduling MPI jobs • All we really need to do with support material from Mark Baker and Michael Resch Contents - Resource Management Cluster, Super Computers and and Scheduling Metacomputing • Clusters, MetaComputing and Super Computers • A main goal of distributed computing research is to • Motivation for CMS provide users with simple and transparent access to a distributed set of heterogeneous computing resources. • Motivation for using Clusters • This is often called Metacomputing - the user submits • Cycle Stealing jobs to a virtual distributed computer, rather than • Management Software specifying particular computers. • Problems with Distributed Computing Resource • Super Computing on the other hand is designed to Management provide massive computational power to users. • Cluster Management Software Systems Performance is more important than usability. • From Clusters to MetaComputing • MetaComputing promises both. Cluster and Metacomputing Cluster and Metacomputing • Metacomputing is still a research area - current implementations are limited, mostly applying to LANs • Cluster Management Software rather than WANs. – Managing clusters of (mostly) workstations as a – Except for when SuperComputing Confferences are taking place! distributed compute resource. – Situation is getting better all the time. – Built on top of existing OS. • LAN implementations are Cluster Management • Cluster Computing Environments Software (CMS ) or Cluster Computing Environments – Software to allow the cluster to be used as an (CCE ). applications environment, similar to Distributed • MetaComputing systems can be built by extending Shared Memory (DSM) systems. LAN systems, or by using many LAN systems together. – Built into OS kernel for improved performance. 1 Motivation for Resource Cluster and Metacomputing Management • The World Wide Web is now so ubiquitous • Users want to be able to submit their jobs without that it is becoming the platform of choice for having to worry about where they run - i.e. submit distributed computing (Internet or Intranet) jobs to a metacomputer (virtual computer) rather than search for spare cycles on a real computer. and Metacomputing. – Ease of use. Requires both distributed code as well as data! – See the Webflow project later. • Large organisations (companies, universities, national – Mostly used as the server to client binding labs, etc.) typically have hundreds or thousands of structure. powerful workstations for use by employees, which is • I.e. submit a web form rather than a job request ticket. a major under-utilised compute resource. – Check lecture 3 notes on Resource Management. • What is Spare Cycles and do we want to use them? Motivation for using Clusters Usage • Surveys show utilisation of CPU cycles of desktop workstations is typically <10%. • Performance of workstations and PCs is rapidly improving (my Laptop > 60 Mflops/s on Fortran 77 code). • As performance grows, percent utilisation will decrease even further! • Organisations are reluctant to buy large supercomputers, due to the large expense and short Usage depends on the class of the users.. useful life span. As shown here; Meteorology verse Psychology. Motivations for Clusters Motivation for using Clusters • The development tools for workstations are more mature than • The communications bandwidth between workstations is the contrasting proprietary solutions for parallel computers - mainly due to the non-standard nature of many parallel systems. increasing as new networking technologies and protocols are implemented in LANs and WANs. • Workstation clusters are a cheap and readily available alternative to specialised High Performance Computing (HPC) • Workstation clusters are easier to integrate into existing platforms. networks than special parallel computers. • Use of clusters of workstations as a distributed compute – Install Linux from the local NFS copy… (look at NASA Beowulf) resource is very cost effective - incremental growth of system!!! – MPPs require special HiPPi switches and interface hookups. – CPUs and disks are a lot cheaper, but some of the better • Many MetaComputers will be made from Clusters InterConnection cards like Myranet, Gigabit etc are expensive. although most of the larger research efforts prefer to – Well almost… try upgrading all your systems to PIIIs after you integrate different MPPs rather than clusters. just brought all the PIIs… with the wrong motherboards. • Do a few a week maybe? – I.e. more output for less effort from the MetaComputing System itself (I.e getting the Bell Award). 2 Cycle Stealing Cycle Stealing • Usually a workstation will be owned by an individual, • Typically, there are three types of owner, who group, department, or organisation - they are use their workstations mostly for: dedicated to the exclusive use by the owners. 1. Sending and receiving mail and preparing • This brings problems when attempting to form a documents. cluster of workstations for running distributed applications. 2. Software development - edit, compile, debug and test cycle. – Unless it is a dedicated cluster like the TORC cluster. (If it is managed correctly that is). 3. Running compute-intensive applications. • TORC runs too many different tests/configuration – I.e. not a stable platform? Cycle Stealing Cycle Stealing • Cluster computing aims to steal spare cycles • Stealing cycles outside standard work hours from (1) and (2) to provide resources for (e.g. overnight) is easy, stealing idle cycles (3). during work hours without impacting • However, this requires overcoming the interactive users (both CPU and memory) is ownership hurdle - people are very protective much harder. of their workstations. • Usually requires an organizational mandate that computers are to be used in this way. Management Software Management Software • Software for managing clusters or metacomputers – Fault tolerance. must handle many complex issues: – Support for batch and interactive jobs. – Heterogeneous environments (computer and network hardware, software, OS, protocols, etc.). – Should support all programming paradigms – Resource Management. (sequential, data parallel, message passing, shared • CPUs, disk arrays, and sometimes long haul network connections. memory, threads, etc.). – Job scheduling. – Should support legacy applications. • Handling multiple schedulers at the same time. – Job allocation policy (prioritisation). – User interface and job specification mechanism. – Security and authentication. – Cycle stealing from desktop computers without impacting interactive use by owner. 3 Problems with Distributed Problems with Distributed Computing Computing • Portability problems in heterogeneous environment. • Relatively immature technology. – Use Corba? But then which ORB are you using? • Few standards (particularly for Metacomputing). – Use Java, maybe the native compilers will be available soon? • Lack of expertise of users, developers, and systems • Difficult to tune and optimise across all platforms. support staff. – On MPPs very true.. Sometime you have to use threads and sometimes just different compiler flags. • High cost of good commercial software. • Scheduling and load balancing. • Relatively immature software development tools • Trade off between performance hit in using remotely mounted (debugger/profiler/etc). files, and diskspace requirements for using local disk. • Applications difficult and time consuming to develop. – Local disks or local disk arrays (I.e. PARALLEL IO) • How do you get the data there and back? All 15 TerraBytes of it? • How to handle legacy systems - rewrite, add interface, or use wrappers? Problems with Distributed Problems with Distributed Computing Computing • Management becomes much more difficult when – One motivation for WAN-based systems is access to remote going from: data repositories, which involves complex issues with storage and migration of large data sets. LAN to WAN Sequential to Parallel – Administration, access permissions and authentication issues are much harder across site boundaries. – Latency and bandwidth more problematic on distributed computers. – Maintenance of management software is harder - administration... – Load balancing is very difficult on multi-user systems, particularly for cycle-stealing of desktop systems (what – Latency increased and bandwidth decreased. happens if someone runs a job on a machine that is currently – Need wide-area file system access, temporary disk space. running a parallel program?). • Moving data again. What about IBP? – Fault tolerance and process migration are issues. Existing Cluster Management Cluster Computing - Research Software Projects • Many research and commercial packages exist - • Batch - UCSF, USA almost all originate from academic research projects. • CCS (Computing Centre Software) - Paderborn, Germany • Condor - Wisconsin State University, USA • Widespread use in large organisations, National labs., • DJM (Distributed Job Manager) - Minnesota Supercomputing Center supercomputer centres. • DQS (Distributed Queuing System) - Florida State University, USA • EASY - Argonne National Lab, USA • Mostly designed for Unix - some support for PCs via • far - University of Liverpool, UK Linux, support for NT sparse... • Generic NQS (Network Queuing System) - University of Sheffield, UK • Not much use of Web as yet - some packages can use • MDQS

Resource Management and Scheduling Cluster

A Flexible Middleware for Metacomputing a Flexible

Four-Dimensional Model for Describing the Status of Peers in Peer-To-Peer Distributed Systems

From Computational Science to Internetics: Integration of Science with Computer Science

Globus: a Metacomputing Infrastructure Toolkit

Using Metacomputing Tools to Facilitate Large Scale Analyses Of

Metacomputing Across Intercontinental Networks S.M

G2-P2P: a Fully Decentralised Fault-Tolerant Cycle-Stealing Framework

Web Based Metacomputing

Decision Support System on the Grid 1 Introduction

Kenneth A. Hawick PD Coddington HA James

Sorcer: Computing and Metacomputing Intergrid

Distributed Frameworks and Parallel Algorithms for Processing Large-Scale Geographic Data