Grid and Cloud Computing
Total Page:16
File Type:pdf, Size:1020Kb
CS6703 – GRID AND CLOUD COMPUTING M.I.E.T. ENGINEERING COLLEGE (Approved by AICTE and Affiliated to Anna University Chennai) TRICHY PUDUKKOTTAI ROAD, TIRUCHIRAPPALLI 620 007 – – DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING COURSE MATERIAL CS6703-GRID AND CLOUD COMPUTING M.I.E.T./CSE/IV YR/ GRID AND CLOUD COMPUTING CS6703 – GRID AND CLOUD COMPUTING M.I.E.T. ENGINEERING COLLEGE (Approved by AICTE and Affiliated to Anna University Chennai) TRICHY – PUDUKKOTTAI ROAD, TIRUCHIRAPPALLI – 620 007 DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING SYLLABUS (THEORY) Sub. Code : CS6703 Branch/Year/Sem : CSE/IV/VII Sub Name : GRID AND CLOUD COMPUTING Batch : 2015-2019 Staff Name : A.BARVEEN Academic Year : 2018-2019 L T P C 3 0 0 3 UNIT I INTRODUCTION 9 Evolution of Distributed computing: Scalable computing over the Internet Technologies for network based systems clusters of cooperative computers- Grid computing– Infrastructures cloud computing - service– oriented architecture Introduction to Grid Architecture and standard– s Elements of Grid Overview of Grid Architecture.– – UNIT II GRID– SERVICE 9 Introduction to Open Grid Services Architecture (OGSA) Motivation Functionality Requirements Practical & Detailed view of OGSA/OGSI Data –intensive grid servi– ce models OGSA services. – – – UNIT III VIRTUALIZATION 9 Cloud deployment models: public, private, hybrid, community Categories of cloud computing: Everything as a service: Infrastructure, platform, software -– Pros and Cons of cloud computing Implementation levels of virtualization virtualization structure virtualization of CPU,– Memory and I/O devices virtual clusters– and Resource Management – Virtualization for data center automation. – – UNIT IV PROGRAMMING MODEL 9 Open source grid middleware packages Globus Toolkit (GT4) Architecture , Configuration Usage of Globus Main components an– d Programming model - Introduction to Hadoop– Framework - Map–reduce, Input splitting, map and reduce functions, specifying input and output parameters, configuring and running a job Design of Hadoop file system, HDFS concepts, command line and java interface, dataflow of F–ile read & File write. UNIT V SECURITY 9 Trust models for Grid security environment Authentication and Authorization methods Grid security infrastructure Cloud Infrastructure secu– rity: network, host and application level aspects– of data security, provider– data and its security, Identity and access management architecture,– IAM practices in the cloud, SaaS, PaaS, IaaS availability in the cloud, Key privacy issues in the cloud. TOTAL: 45 PERIODS M.I.E.T./CSE/IV YR/ GRID AND CLOUD COMPUTING CS6703 – GRID AND CLOUD COMPUTING TEXT BOOK: 1. Kai Hwang, Geoffery C. Fox and Jack J. Dongarra, stributed and Cloud Computing: Clusters, Grids, Clouds and the Future of InternetDi, First Edition, Morgan Kaufman Publisher, an Imprint of Elsevier, 2012. REFERENCES: 1. Jason Venner, Pro Hadoop- Build Scalable, Distributed Applications in the Cloud , A Press, 2009 2. Tom White, adoop The Definitive Guide , First Edition. O Reilly, 2009. 3. Bart Jacob (Editor),H Introduction to Grid Computing , IBM ’Red Books, Vervante, 2005 nd 4. Ian Foster, Carl Kesselman, The Grid: Blueprint for a New Computing Infrastructure , 2 Edition, Morgan Kaufmann. 5. Frederic Magoules and Jie Pan, Introduction to Grid Computing CRC Press, 2009. 6. Daniel Minoli, A Networking App roach to Grid Computing , John Wiley Publication, 2005. 7. Barry Wilkinson, Grid Computing: Techniques and Applications , Chapman and Hall, CRC, Taylor and Francis Group, 2010. SUBJECT IN-CHARGE HOD/CSE M.I.E.T./CSE/IV YR/ GRID AND CLOUD COMPUTING CS6703 – GRID AND CLOUD COMPUTING M.I.E.T. ENGINEERING COLLEGE (Approved by AICTE and Affiliated to Anna University Chennai) TRICHY – PUDUKKOTTAI ROAD, TIRUCHIRAPPALLI – 620 007 DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING Sub. Code : CS6703 Branch/Year/Sem : CSE/IV/VII Sub Name : GRID AND CLOUD COMPUTING Batch : 2015-2019 Staff Name : A.BARVEEN Academic Year : 2018-2019 COURSE OBJECTIVE 1. Understand how Grid computing helps in solving large scale scientific problems. 2. Gain knowledge on the concept of virtualization that is fundamental to cloud computing. 3. Learn how to program the grid and the cloud 4. Understand the security issues in the grid and the cloud environment. COURSE OUTCOMES 1. Understand the concept of distributed computing. 2. Apply grid computing techniques. 3. Understand the concept of virtualization. 4. Use grid and cloud tool kits to develop the applications. 5. Apply the security models in the grid and cloud environment 6. Design and develop a private cloud environment with security enhanced. Prepared by Approved by Verified By STAFF NAME PRINCIPAL HOD (A.BARVEEN) M.I.E.T./CSE/IV YR/ GRID AND CLOUD COMPUTING CS6703 – GRID AND CLOUD COMPUTING M.I.E.T. ENGINEERING COLLEGE (Approved by AICTE and Affiliated to Anna University Chennai) TRICHY – PUDUKKOTTAI ROAD, TIRUCHIRAPPALLI – 620 007 UNIT-I INTRODUCTION: 1.1 Evolution of Distributed computing: 1.1.1 Scalable computing over the Internet: Data Deluge enabling new challenges: From Desktop/HPC/Grids to Internet Clouds in 30 Years HPC moving from centralized supercomputers to geographically distributed desktops, desk sides, clusters, and grids to clouds over last 30 years Location of computing infrastructure in areas with lower costs in hardware, software, datasets, space, and power requirements – moving from desktop computing to datacenter-based clouds Interactions among 4 technical challenges: Data Deluge, Cloud Technology, eScience, and Multicore/Parallel Computing Clouds and Internet of Things M.I.E.T./CSE/IV YR/ GRID AND CLOUD COMPUTING CS6703 – GRID AND CLOUD COMPUTING Computing Paradigm Distinctions Centralized Computing All computer resources are centralized in one physical system. Parallel Computing All processors are either tightly coupled with central shared memory or loosely coupled with distributed memory Distributed Computing A distributed system consists of multiple autonomous computers, each with its own private memory, communicating over a network. Cloud Computing An Internet cloud of resources that may be either centralized or decentralized. The cloud apples to parallel or distributed computing or both. Clouds may be built from physical or virtualized resources. Technology Convergence toward HPC for Science and HTC for Business: Utility Computing M.I.E.T./CSE/IV YR/ GRID AND CLOUD COMPUTING CS6703 – GRID AND CLOUD COMPUTING 1.2 Technologies for Network-based Systems 33 year Improvement in Processor and Network Technologies Modern Multi-core CPU Chip Multi-threading Processors Four-issue superscalar (e.g. Sun Ultras arc I) Implements instruction level parallelism (ILP) within a single processor. Executes more than one instruction during a clock cycle by sending multiple instructions to redundant functional units. Fine-grain multithreaded processor Switch threads after each cycle Interleave instruction execution If one thread stalls, others are executed Coarse-grain multithreaded processor Executes a single thread until it reaches certain situations Simultaneous multithread processor (SMT) Instructions from more than one thread can execute in any given pipeline stage at a time. M.I.E.T./CSE/IV YR/ GRID AND CLOUD COMPUTING CS6703 – GRID AND CLOUD COMPUTING 5 Micro-architectures of CPUs Each row represents the issue slots for a single execution cycle: • A filled box indicates that the processor found an instruction to execute in that issue slot on that cycle; An empty box denotes an unused slot. 33 year Improvement in Memory and Disk Technologies Architecture of A Many-Core Multiprocessor GPU interacting with a CPU Processor M.I.E.T./CSE/IV YR/ GRID AND CLOUD COMPUTING CS6703 – GRID AND CLOUD COMPUTING GPU Performance Bottom – CPU - 0.8 Gflops/W/Core (2011) Middle – GPU - 5 Gflops/W/Core (2011) Top - EF – Exascale computing (10^18 Flops) Interconnection Networks • SAN (storage area network) - connects servers with disk arrays • LAN (local area network) – connects clients, hosts, and servers • NAS (network attached storage) – connects clients with large storage systems M.I.E.T./CSE/IV YR/ GRID AND CLOUD COMPUTING CS6703 – GRID AND CLOUD COMPUTING Datacenter and Server Cost Distribution: Virtual Machines Eliminate real machine constraint Increases portability and flexibility Virtual machine adds software to a physical machine to give it the appearance of a different platform or multiple platforms. Benefits Cross platform compatibility Increase Security Enhance Performance Simplify software migration Initial Hardware Model All applications access hardware resources (i.e. memory, i/o) through system calls to operating system (privileged instructions) Advantages Design is decoupled (i.e. OS people can develop OS separate of Hardware people developing hardware) Hardware and software can be upgraded without notifying the Application programs M.I.E.T./CSE/IV YR/ GRID AND CLOUD COMPUTING CS6703 – GRID AND CLOUD COMPUTING Disadvantage Application compiled on one ISA will not run on another ISA. Applications compiled for Mac use different operating system calls then application designed for windows. ISA’s must support old software . Can often be inhibiting in terms of performance Since software is developed separately from hardware… Software is not necessarily optimized for hardware. Virtual Machine Basics Virtual software placed between underlying machine and conventional software Conventional software sees different ISA from