Towards an Efficient Distributed Cloud

TOWARDS AN EFFICIENT DISTRIBUTED CLOUD ARCHITECTURE BY PRAVEEN KHETHAVATH Bachelor of Engineering in Electronics and Communication Engineering Osmania University Hyderabad, AP, INDIA 2006 Master of Science in Computer Science University of Northern Virginia Annandale, VA 2008 Submitted to the Faculty of the Graduate College of the Oklahoma State University in partial fulfillment of the requirements for the Degree of DOCTOR OF PHILOSOPHY July, 2014 TOWARDS AN EFFICIENT DISTRIBUTED CLOUD ARCHITECTURE Dissertation Approved: Johnson P Thomas Dissertation Adviser Eric Chan-tin Dissertation Co-Adviser Subhash Kak Mary Gade ii LIST THE PUBLICATIONS YOU HAVE FROM THIS WORK Praveen Khethavath, Johnson Thomas. “Game Theoretic approach to Resource provisioning in a Distributed Cloud”, submitted at 28th IEEE International Conference on. Advanced Information Networking and Applications Workshops WAINA 2014(Accepted) Praveen Khethavath, Johnson Thomas, Eric Chan-Tin, and Hong Liu. "Introducing a Distributed Cloud Architecture with Efficient Resource Discovery and Optimal Resource Allocation". In Proceedings of 3rd IEEE SERVICES CloudPerf Workshop 2013 Praveen Khethavath, Nhat, Prof. Johnson P Thomas. “A Virtual Robot Sensor Network (VRSN)”. In Proceedings of 2nd International Workshop on Networks of Cooperating Objects CONET 2011 Praveen Khethavath, Johnson Thomas. “Distributed Cloud Architecture: Resource Modelling and Security Concerns”. In Proceedings of 3rd Annual conference on Theoretical and Applied Computer Science (TACS 2012) iii ACKNOWLEDGEMENTS I would like to express my deepest gratitude to my advisor, Dr. Johnson Thomas for his excellent guidance, patience, and providing me with an excellent atmosphere for doing research and throughout my thesis. His guidance helped me to successfully complete my research. For me, he was not only a respectable professor who led me on the way to do research, but also an attentive tutor who trained me to be a good teacher in my future career. I really appreciate everything he has done in the past years. Besides my advisor, I would like to thank Dr. Eric Chan-Tin my co-advisor, for his support and guidance in my research. I greatly thank him for all the kindness, support and encouragement. I would also like to thank Dr. Subhash Kak and Dr. Mary Gade for their encouragement and insightful comments. I would like to also thank Dr. Tingting Chen for providing opportunity to work on her research. I would like to thank all my committee members for being a support and guiding me to advance in my future goals. I would like to thank my family for supporting me throughout my life and it is they who have made me who I am now. iv Acknowledgements reflect the views of the author and are not endorsed by committee members or Oklahoma State University. NAME: PRAVEEN KHETHAVATH DATE OF DEGREE: MAY, 2014 TITLE OF STUDY: Towards an Efficient Distributed Cloud Architecture MAJOR FIELD: COMPUTER SCIENCE Abstract: Cloud computing is an emerging field in computer science. Users are utilizing less of their own existing resources, while increasing usage of cloud resources. There are many advantages of distributed computing over centralized architecture. With increase in number of unused storage and computing resources and advantages of distributed computing resulted in distributed cloud computing. In the distributed cloud environment that we propose, resource providers (RP) compete to provide resources to the users. In the distributed cloud all the cloud computing and storage services are offered by distributed resources. In this architecture resources are used and provided by the users in a peer to peer fashion. We propose using multi- valued distributed hash tables for efficient resource discovery. Leveraging the fact that there are many users providing resources such as CPU and memory, we define these resources under one key to easily locate devices with equivalent resources. We then propose a new auction mechanism, using a reserve bid formulated rationally by each user for the optimal allocation of discovered resources. We have evaluated the performance of resource discovery mechanisms for the distributed cloud and distributed cloud storage and compared the results with existing DHTs, peer to peer clients such as VUZE and explored the feasibility and efficiency of the proposed schemes in terms of resource/service discovery and allocation. We use a simultaneous Auction mechanism and select a set of winners once we receive all contributions or bids. In a real world scenario, users request resources with multiple capabilities, and in order to find such resources we use a contribution mechanism where service providers will provide a contribution price to users for providing a resource. Users use our proposed auction mechanism to select the resources from the set of resource providers. We show that Nash equilibrium can be achieved and how we can avoid the problem of free riders in the distributed cloud. Network latency is an important factor when deciding which resource provider to select. We used treeple a secure latency estimation scheme to obtain network measurements in distributed systems. We developed a mobile application using distributed cloud which preserves privacy and provides security for a user. Distributed cloud is used for developing such an application where all the data needs to be close to the users and avoids single point of failure, which is the problem with existing cloud. v TABLE OF CONTENTS Chapter Page I. INTRODUCTION .............................................................................................................. 1 1.1 RESEARCH BACKGROUND ................................................................................................................ 1 1.2 MOTIVATION .................................................................................................................................... 2 1.3 OBJECTIVES ...................................................................................................................................... 3 1.4 CONTRIBUTIONS .............................................................................................................................. 4 1.5 DISSERTATION OUTLINE ................................................................................................................... 6 II. CLOUD COMPUTING OVERVIEW ................................................................................ 8 2.1 INTRODUCTION ............................................................................................................................... 8 2.2 CLOUD DEPLOYMENT MODELS ....................................................................................................... 9 2.3 CLOUD SERVICE MODELS .............................................................................................................. 10 2.4 BASIC CLOUD ARCHITECTURE ....................................................................................................... 12 2.5 SUMMARY ..................................................................................................................................... 14 III. DISTRIBUTED CLOUD ARCHITECTURE .................................................................. 16 3.1 PROBLEM STATEMENT .................................................................................................................. 17 3.2 LITERATURE REVIEW ..................................................................................................................... 19 3.3 RESOURCE DESCRIPTION FRAMEWORK ........................................................................................ 21 3.4 HIERARCHIAL MODEL OF DISTRIBUTED CLOUD ............................................................................ 24 3.5 DISTRIBUTED CLOUD ARCHITECTURE ........................................................................................... 25 vi IV. RESOURCE DISCOVERY MECHANISM IN DISTRIBUTED CLOUD ..................... 29 4.1 PROBLEM STATEMENT .................................................................................................................. 29 4.2 LITERATURE REVIEW ..................................................................................................................... 30 4.3 RESOURCE DISCOVERY IN DISTRIBUTED CLOUD ........................................................................... 33 4.3.1 NAÏVE SOLUTION FOR RESOURCE DISCOVERY .................................................................... 33 4.3.2 RESOURCE DISCOVERY USING MULTI-VALUED HASHTABLE SCHEME ................................. 35 4.4 EXPERIMENTAL ANALYSIS ............................................................................................................. 39 4.4.1 DISTRIBUTED CLOUD COMPUTING ...................................................................................... 39 4.4.2 DISTRIBUTED CLOUD STORAGE ........................................................................................... 43 4.5 CONCLUSIONS .............................................................................................................................. 46 V. GAME THEORETIC APPROACHES FOR RESOURCE ALLOCATION ................... 47 5.1 PROBLEM STATEMENT ................................................................................................................ 47 5.2 LITERATURE REVIEW .................................................................................................................... 48 5.3 A GAME THEORETIC APPROACH

Towards an Efficient Distributed Cloud

Cloud Interoperability with the Opennebula Toolkit

Python for Bioinformatics, Second Edition

Cloud Computing and Enterprise Data Reliability Luan Gashi University for Business and Technology, [email protected]

The Opennebula Standard-Based Open-Source Toolkit to Build Cloud Infrastructures

Geospatial Data AS a Service: Use Cases

Developing Cloud Computing Infrastructures in Developing Countries in Asia

Cloud Computing and Big Data Is There a Relation Between the Two: a Study

Evolution of As-A-Service Era in Cloud

Accelerate Delivery of Your Modern Data Analytics Platform with the Adatis Framework

Cloud Computing and Mobile Application Development

Lenovo Big Data Validated Design for Cazena Saas Using Cloudera on Thinkagile HX

A STUDY of CLOUD COMPUTING TECHNOLOGY ADOPTION by SMALL and MEDIUM ENTERPRISES (Smes) in GAUTENG PROVINCE