Byzantine Fault Tolerance for Nondeterministic Applications Bo Chen

BYZANTINE FAULT TOLERANCE FOR NONDETERMINISTIC APPLICATIONS BO CHEN Bachelor of Science in Computer Engineering Nanjing University of Science & Technology, China July, 2006 Submitted in partial fulfillment of the requirements for the degree MASTER OF SCIENCE IN COMPUTER ENGINEERING at the CLEVELAND STATE UNIVERSITY December, 2008 The thesis has been approved for the Department of ELECTRICAL AND COMPUTER ENGINEERING and the College of Graduate Studies by ______________________________________________ Thesis Committee Chairperson, Dr. Wenbing Zhao ____________________ Department/Date _____________________________________________ Dr. Yongjian Fu ______________________ Department/Date _____________________________________________ Dr. Ye Zhu ______________________ Department/Date ACKNOWLEDGEMENT First, I must thank my thesis supervisor, Dr. Wenbing Zhao, for his patience, careful thought, insightful commence and constant support during my two years graduate study and my career. I feel very fortunate for having had the chance to work closely with him and this thesis is as much a product of his guidance as it is of my effort. The other member of my thesis committee, Dr. Yongjian Fu and Dr. Ye Zhu suggested many important improvements to this thesis and interesting directions for future work. I greatly appreciate their suggestions. It has been a pleasure to be a graduate student in Secure and Dependable System Group. I want to thank all the group members: Honglei Zhang and Hua Chai for the discussion we had. I also wish to thank Dr. Fuqin Xiong for his help in advising the details of the thesis submission process. I am grateful to my parents for their support and understanding over the years, especially in the month leading up to this thesis. Above all, I want to thank all my friends who made my life great when I was preparing and writing this thesis. BYZANTINE FAULT TOLERANCE FOR NONDETERMINISTIC APPLICATIONS BO CHEN ABSTRACT The growing reliance on online services accessible on the Internet demands highly reliable system that would not be interrupted when encountering faults. A number of Byzantine fault tolerance (BFT) algorithms have been developed to mask the most complicated type of faults — Byzantine faults such as software bugs, operator mistakes, and malicious attacks, which are usually the major cause of service interruptions. However, it is often difficult to apply these algorithms to practical applications because such applications often exhibit sophisticated non-deterministic behaviors that the existing BFT algorithms could not cope with. In this thesis, we propose a classification of common types of replica nondeterminism with respect to the requirement of achieving Byzantine fault tolerance, and describe the design and implementation of the core mechanisms necessary to handle such replica nondeterminism within a Byzantine fault tolerance framework. In addition, we evaluated the performance of our BFT library, referred to as ND-BFT using both a micro-benchmark application and a more realistic online porker game application. The performance results show that the replicated online poker game performs approximately 13% slower than its nonreplicated counterpart in the presence of small number of players. iv Keywords: Byzantine fault tolerance, replica nondeterminism, security, replica consistency, replication, intrusion tolerance, performance, online poker game v TABLE OF CONTENTS Page ABSTRACT ............................................................................................................. iii LIST OF FIGURES ................................................................................................ viii ACRONYM .............................................................................................................. ix CHAPTER I. INTRODUCTION ...................................................................................................1 1.1 Contribution …………………………………..…………………………....3 1.2 Thesis Outline .................................................................................. ........….. 4 II. BACKGROUND …………………………………………………...…………..…..5 2.1 Fault Tolerance ........................................................................................ .... 5 2.2 Byzantine Fault Tolerance ........................................................................... 7 2.2.1 Byzantine Fault ............................................... ……………………. 7 2.2.2 Byzantine Fault Tolerance ................................................................ 8 2.3 Other Byzantine Fault Tolerance Techniques ........................................... 11 2.3.1 Paxos ..................................................................................................11 2.3.2 Threshold Digital Signatures ………………………......…………...12 III. BYZANTINE FAULT TOLERANT FOR NONDETERMINISTIC APPLICATIONS .............................................................................................................................14 3.1 System Model .............................................................................................. 14 3.1.1 Operation ...................................................................................... 15 3.1.2 Failure Model ................................................................................. 15 3.1.3 Communication Model .................................................................. 16 vi 3.1.4 Cryptography ................................................................................ 17 3.2 Threat Analysis ............................................................................................ 18 3.3 Type of Replica Nondeterminism................................................................. 19 3.4 Solution for each type of Replica Nondeterminism...................................... 22 3.4.1 Verifiable Pre-determinable Nondeterminism .............................. 25 3.4.2 Non-Verifiable Pre-determinable Nondeterminism ...................... 27 3.4.3 Verifiable Post-determinable Nondeterminism ............................. 31 3.4.4 Non-Verifiable Post-determinable Nondeterminism....................... 34 3.5 Proof of correctness...................................................................................... 37 IV. IMPLEMENTATION AND PERFORMANCE EVALUATION .................... 39 4.1 Implementation............................................................................................. 39 4.1.1 Library ....…………………………………..…………...….....….. 40 4.1.2 Interface .............................................................................…….…. 44 4.1.2 Online poker game …………………..……………………....……. 45 4.2 Performance Evaluation …………..…………………………..……….…. 47 4.2.1 Experimental Setup …………………………...……………...…... 48 4.2.2 Normal Case Operation.................................................................... 48 4.2.3 Online Poker Game ………………………………...……….……..54 V. RELATED WORKS …...……………………………………………………..….58 VI. CONCLUSION AND FUTURE WORKS ………..……………..……………...60 6.1 Summary.............................................................................................. 61 6.2 Future Work ....................................................................................... 63 BIBLIOGRAPHY ..................................................................................................... 66 vii LIST OF FIGURES Figure Page Figure 1: Byzantine Agreement (one traitor) ............................................................8 Figure 2: Normal Case Operation of BFT .............................................................. 11 Figure 3: System Architecture .................................................................................24 Figure 4: Solution for Verifiable Pre-Determinable Non-Determinism .................27 Figure 5: Solution for Non-Verifiable Pre-Determinable Non-Determinism ..........29 Figure 6: Solution for Verifiable Post-Determinable Non-Determinism .................34 Figure 7: Solution for Non-Verifiable Post-Determinable Non-Determinism ....... 37 Figure 8: Message Format ........................................................................................42 Figure 9: Architecture of online poker game with ND-BFT .................................. 46 Figure 10: End-to-End Latency of Pure Non-Determinism .....................................49 Figure 11: End-to-End Latency of Composite Non-Determinism ..............................57 Figure 12: Throughput of Pure Non-Determinism ................................................... 53 Figure 13: Throughput of Composite Non-Determinism ....................................... 54 Figure 14: Throughput of Online poker game (four replicas) ............................... ...56 Figure 15: Throughput of Online poker game (seven replicas) ................................57 viii ACRONYM BFT Byzantine Fault Tolerance ND-BFT Nondeterminism Byzantine Fault Tolerance VPRE Verifiable Pre-Determinable Nondeterminism NPRE Non-Verifiable Pre-Determinable Nondeterminism VPOST Verifiable Post-Determinable Nondeterminism NPOST Non-Verifiable Post-Determinable Nondeterminism ix CHAPTER I INTRODUCTION The society is increasingly dependent on services provided by computer systems and our vulnerability to computer failures is growing as a result: we expect to have highly-available systems or applications that should work correctly and provide services without interruptions. This requires the system or the application to be carefully designed and implemented, and rigorously tested. However, considering the intense

Byzantine Fault Tolerance for Nondeterministic Applications Bo Chen

Deterministic Execution of Multithreaded Applications

CUDA Toolkit 4.2 CURAND Guide

Unit: 4 Processes and Threads in Distributed Systems

A Short History of Computational Complexity

Deterministic Parallel Fixpoint Computation

Internally Deterministic Parallel Algorithms Can Be Fast

Entropy Harvesting in GPU[1] CUDA Code for Collecting Entropy

Deterministic Atomic Buffering

Complexity Theory Lectures 1–6

Optimal Oblivious RAM*

Semantics and Concurrence Module on Semantic of Programming Languages

00 Fast K-Selection Algorithms for Graphics Processing Units