INFORMATION to USERS This Manuscript Has Been Reproduced from the Microfilm Master. UMI Films the Text Directly from the Origina

INFORMATION TO USERS This manuscript has been reproduced from the microfilm master. UMI films the text directly from the original or copy submitted. Thus, some thesis and dissertation copies are in typewriter face, while others may be from any type of computer printer. Hie quality of this reproduction is dependent upon the quality of the copy submitted. Broken or indistinct print, colored or poor quality illustrations and photographs, print bleed through, substandard margins, and improper alignment can adversely afreet reproduction. In the unlikely event that the author did not send UMI a complete manuscript and there are missing pages, these will be noted. Also, if unauthorized copyright material had to be removed, a note will indicate the deletion. Oversize materials (e.g., maps, drawings, charts) are reproduced by sectioning the original, beginning at the upper left-hand corner and continuing from left to right in equal sections with small overlaps. Each original is also photographed in one exposure and is included in reduced form at the back of the book. Photographs included in the original manuscript have been reproduced xerographically in this copy. Higher quality 6" x 9" black and white photographic prints are available for any photographs or illustrations appearing in this copy for an additional charge. Contact UMI directly to order. University Microfilms International A Ben & Howell information Company 300 North ZeebRoad Ann Arbor Mt 48106-1346 USA 313 761-4700 800 521-0600 Order Number 9605332 Fault-tolerant distributed algorithms for consensus and termination detection Wu, Li-Fen, Ph.D. The Ohio State University, 1994 UMI 300 N. Zeeb Rd. Ann Arbor, Ml 48106 Fa u l t -T o l e r a n t D is t r ib u t e d A l g o r it h m s fo r C o n s e n s u s a n d T e r m in a t io n D e t e c t io n DISSERTATION Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate School of The Ohio State University By Li-Fen Wu, B.S., M,S, ***** The Ohio State University 1994 Dissertation Committee: Approved by Professor Ten-Hwang Lai Professor Ming-Tsan Liu Adviser Professor Mukesh Singhal Department of Computer and Information Science © Copyright by Li-Fen Wu 1994 To My Parents ii A cknowledgments I would like to express my deepest gratitude to my advisor, Professor Ten-Hwang Lai, for his guidance, advice, support, and encouragement over these years. My special thanks are to Professor Ming-Tsan Liu and Professor Mukesh Singhal for being in my Dissertation Defense Committee. They have helped me not only at the committee, but also throughout my graduate school study years. I would like to thank Jeff Ward for his constructive criticisms and suggestions that have considerably improved this work. I also appreciate the staff in the Department of Computer and Information Science. Their kind assistance made the study here a smooth and pleasant experience. Special thanks are due to Tom Fletcher, Elley Quinlan, and Carl Phillips. All my family members deserve the greatest merits and special thanks. They have been always very supportive during my graduate study at the Ohio State University. Abundant credits should go to my husband, for his love, support, understanding and sacrifice. My thanks shall go to all my friends here in Columbus, Ohio, who have shared my ups and downs during the past years. In particular, my special thanks are due to Ming-Jye Sheng, Shu-Hwa Sarah Yu, Ing-Miin Hsu, and Shu-Shang Sam Wei for their persistent support and encouragement. I would like to thank Dan Oblinger, John Cameron, and Deborah Shands for their kindness to help me to adjust to the new environment. Last but not least, I am also greatly indebted to Dr. Rong Chang, my manager at IBM, for his understanding and encouragement. V it a March 28, 1965 ....................................................... Born: Taiwan, Republic of China Sept. 1983 - June 1987 ..........................................B.S., Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan Sept. 1987 - Mar. 1989 .........................................M.S., Department of Computer and In formation Science The Ohio State University, Columbus, Ohio Mar. 1989 - Dec. 1993 .........................................Graduate Teaching Associate, Depart ment of Computer and Information Sci ence The Ohio State University, Columbus, Ohio Dec. 1993 - June 1994 ........................................... Software Developer, Internetworking and Multi-media Services Division Advantis, White Plains, New York June 1994 - present ............................................... Software Developer, Network Applica tion Services Division IBM, White Plains, New York Publications Fail-Flush : A Primitive for Fault-Tolerant Computing. IASTED International Con ference on Reliability, Quality Control, and Risk Assessment,pages. 62-65, November 1992. An (N-l)-Resilient Algorithm for Distributed Termination Detection. Fourth IEEE Symposium on Parallel and Distributed Processing, pages 274-281, December 1992. (co-author: T. -H. Lai) Consensus and Termination Detection in the Presence of Faulty Processes. Inter national Conference on Parallel and Distributed Systems (ICPADS), pages 267-274, December 1992. (co-authors : T. -H. Lai, Y. -C. Tseng) Low-congestion Embedding of Multiple Graphs in a Hypercube. International Con ference on Parallel and Distributed Systems (ICPADS), pages 378-385, December 1992. (co-authors : Y. -C. Tseng, T. -H. Lai) Matrix Representation of Graph Embedding on a Hypercube. Journal of Parallel and Distributed Computing, to appear, (co-authors : Y. -C. Tseng, T. -H. Lai) An (N-l)-Resilient Algorithm for Distributed Termination Detection. IEEE Transac tions on Parallel and Distributed Systems, to appear, (co-author: T. -H. Lai) Fields of Study Major Field: Computer and Information Science Studies in: Parallel and Distributed Computing Prof. Ten-Hwang Lai Computer Architecture and Networking Prof. Ming-Tsan Liu Computer Graphics Prof. Wayne Carlson L i s t o f T a b l e s TABLE PAGE 1 Performance Measures .............................................................................................. 127 L i s t o f F i g u r e s FIGURE PAGE 1 Context of Distributed Computing ....................................................................... 25 2 State Transition Graph of Basic Process 7* ...................................................... 35 3 Process at{ of Consensus Protocol a ...................................................................... 35 4 The Protocol for Process t, 1 < *< n ............................................................... 53 5 S tate Transition of a Basic Process....................................................................... 72 6 Dijkstra-Scholten Algorithm for p;, 1 < t < n .................................................... 76 7 Algorithm for p;, 1 < i < n....................................................................................... 87 8 An Example ................................................................................................................... 90 9 A C ounter Exam ple ..................................................................................................... 135 T a b l e o f C o n t e n t s DEDICATION ........................................................................................................................... ii ACKNOWLEDGMENTS ...................................................................................................... iii VITA .............................................................................................................................................. v LIST OF TABLES .................................................................................................................. vii LIST OF FIGURES .............................................................................................................. viii CHAPTER PAGE I IN T R O D U C T IO N ........................................................................................................... 1 1.1 Contributions of Research ............................................................................... 4 1.1.1 The Two Proposed Services ............................................................... 5 1.1.2 TD Is Harder Than C O N ................................................................... 6 1.1.3 A Fault-Tolerant Distributed TD Algorithm .............................. 7 1.1.4 Other Applications ................................................................................ 9 1.2 Organization of Dissertation ............................................................................ 9 II Survey of the Literature ................................................................................................ 12 2.1 Distributed Computing ................................................................................... 13 2.2 Fault-Tolerant Distributed Com puting ........................................................ 15 2.3 The Consensus Problem ................................................................................... 20 2.4 The Termination Detection Problem ........................................................... 21 III Consensus and Termination Detection .................................................................. 25 3.1 T he Traditional M o d e l ...................................................................................... 25 3.2 Problem Descriptions .....................................................................................

INFORMATION to USERS This Manuscript Has Been Reproduced from the Microfilm Master. UMI Films the Text Directly from the Origina

Failure Detectors for Wireless Sensor-Actuator Systems

The Dynamic Enterprise Bus∗

Low-Overhead Accrual Failure Detector

A Literature Review of Failure Detection Within the Context of Solving the Problem of Distributed Consensus

Failure Detectors for Wireless Sensor-Actuator Systems Hamza A

Self-Healing Distributed Systems

A Self-Tuning Failure Detection Scheme for Cloud Computing Service

The Failure Detector Abstraction

AVR1003: Using the XMEGA Clock System

Distributed Algorithms

Failure Detectors Outline

Fault Tolerance Management for a Hierarchical Gridrpc Middleware