High Performance Computing Applied to Cloud Computing

View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Theseus HIGH PERFORMANCE COMPUTING APPLIED TO CLOUD COMPUTING Li Luxingzi Thesis Technology, Communication and Transport Programme in Information Technology 2015 Abstract of thesis Technology, Communication and Transport Degree programme in Information Technology Author Li Luxingzi Year 2015 Supervisor Veikko Keränen Title of thesis High Performance Computing Applied to Cloud Computing No. of pages 48 The purpose of this thesis was to introduce high performance computing and cloud computing. The purpose was also to describe how to apply high performance computing to cloud computing as well as its possibilities and challenges. There were two case studies in the thesis project to present the application of cloud computing. Both quantitative and qualitative methods were used in this research. The majority of materials were from books and Internet resources. The thesis may be useful to students, teachers, and people who are interested in information technology, mathematics and science. Key words high performance computing, cloud computing, cluster CONTENTS 1 INTRODUCTION ............................................................................................. 6 2 HIGH PERFORMANCE COMPUTING ............................................................ 7 2.1 Background of High Performance Computing ........................................ 7 2.1.1 Overview of High Performance Computing .................................. 7 2.1.2 History of High Performance Computing ..................................... 8 2.2 High Performance Computing Architecture ............................................ 9 2.2.1 Five Major Parts of High Performance Computing System .......... 9 2.2.2 Building High Performance Computing Cluster ......................... 10 2.3 Performance and Optimization ............................................................ 14 2.3.1 Peak Performance ..................................................................... 14 2.3.2 Performance Improvements ...................................................... 14 2.4 Quantitative Analysis of Performance of Linux High Performance Computing Cluster ..................................................................................... 16 2.4.1 Metrics ....................................................................................... 16 2.4.2 Arithmetical Operation Analysis of Linux Cluster System .......... 17 2.5 Top 500 List ......................................................................................... 19 2.6 Parallel Computing .............................................................................. 22 3 CLOUD COMPUTING ................................................................................... 25 3.1 Overview of Cloud Computing ............................................................. 25 3.1.1 Definition of Cloud Computing ................................................... 25 3.1.2 Service Models .......................................................................... 26 3.2 Service-Oriented Infrastructure of Cloud Computing ........................... 28 3.2.1 Service-Oriented Infrastructure Framework ............................... 28 3.2.2 Cloud Modules of Service-Oriented Infrastructure Structure ..... 29 3.3 Typical Technologies ........................................................................... 33 3.3.1 Data Storage Technology .......................................................... 33 3.3.2 Distributed Programming Technology ........................................ 36 3.3.3 Automatic Deployment Technology ........................................... 37 3.4 Case Study .......................................................................................... 38 3.4.1 Cloud Computing to E-Commerce ............................................. 38 3.4.2 Information Ecosystem of Cloud Computing .............................. 39 4 HIGH PERFORMANCE COMPUTING APPLIED TO CLOUD COMPUTING 42 4.1 Comparison Application Profile of Cloud Computing and High Performance Computing ............................................................................ 42 4.2 High Performance Computing Deployments in Cloud Computing Environments ............................................................................................. 43 5 CONCLUSION ............................................................................................... 46 REFERENCES ................................................................................................. 47 LIST OF FIGURES Figure 1. Five Major Parts in Distributed-Memory High Performance Computing System (Gerber 2012) ...................................................................................... 10 Figure 2. Building High Performance Computing Cluster .................................. 10 Figure 3. Processor Pipelining (Jacob 2015) .................................................... 15 Figure 4. Prototypical Linux Cluster .................................................................. 17 Figure 5. Top 10 Ranking (Wikimedia Foundation, Inc. 2015) .......................... 20 Figure 6. Performance Development (TOP500.org. 2014) ............................... 21 Figure 7. Country System and Vendor System Share (TOP500.org. 2014) ...... 21 Figure 8. Vertical Structure of a Superstep of Bulk Synchronous Parallel (Wikimedia Foundation, Inc. 2014) ................................................................... 23 Figure 9. Process of Parallel Solution ............................................................... 24 Figure 10. Deployment and Service Models of Cloud Computing ..................... 28 Figure 11. Service-Oriented Infrastructure Framework (Smoot-Tan 2012) ...... 29 Figure 12. The Control Flow and the Data Flow................................................ 36 Figure 13. Automatic Deployment Framework .................................................. 37 Figure 14. Information Ecosystem of Cloud Computing .................................... 40 Figure 15. Internet Solution of the Integration of High Performance Computing and Cloud Computing (H3C Technologies Co. 2015) ....................................... 44 Figure 16. Deployment of the IRF II (The Second Generation of Intelligent Resilient Framework) (H3C Technologies Co. 2015) ........................................ 45 Table 1. Example of Hardware and Software Configuration (Based on 168-Node IBM Server) ...................................................................................................... 13 Table 2. Example of Security Policy of Infrastructure Cloud ............................ 33 Table 3. Differences of Google File System and the General Distributed File System .............................................................................................................. 35 6 1 INTRODUCTION Nowadays, high performance computing is more and more important for the economic and technological development. The high performance computing also becomes an indicator to measure the power of a country. Therefore, it is important and meaningful to improve the performance and universality of high performance computing. In addition, cloud computing can provide users with easier computing service through various hardware devices. Applying high performance computing to cloud computing become a meaningful challenge for many scientists and professional talents. The current research results are listed in the thesis. The purpose of the thesis is to introduce high performance computing and cloud computing. The architecture and performance of high performance computing are described in the Chapter 2. The definition of cloud computing and its typical technologies are presented in the Chapter 3. There are two case studies in the thesis project, as the examples of application of cloud computing. Quantitative as well as qualitative analysis methods are used to the research. 7 2 HIGH PERFORMANCE COMPUTING 2.1 Background of High Performance Computing 2.1.1 Overview of High Performance Computing High performance computing (HPC) refers to the computing system, including several processors as part of a single machine or a cluster of several computers as an individual resource. High performance computing owes its feature of high speed computing to its great ability to process information. Therefore the main methodology that is currently applied to high performance computing is parallel computing. In short, high performance computing is legendary for its processing capacity. For instance, it is shown by the latest analysis that machines can perform 1015 floating point operation per second. (Jones 2011.) In a mesh network, the structure of systems can improve the speed of host communication by shortening the physical and the logical distance between the network nodes. Although the network topology and hardware play a necessary role in the high performance computing system, it is the operating system and application software that makes the system so effective and usable. A control node, the interface between system and client computers, manages the distributed computing workload. (Gerber 2012.) There are two models for task execution in high performance computing environments: SIMD (Single Instruction Multiple Data) and MIMD (Multiple Instruction Multiple Data). SIMD will execute the same computing instructions and operations across multiple processes at the same time. MIMD uses multiple processors to asynchronously control multiple instructions,

Load more