Design Issues in Distributed Management Information Systems
Total Page:16
File Type:pdf, Size:1020Kb
DESIGN ISSUES IN DISTRIBUTED MANAGEMENT INFORMATION SYSTEMS by JACOB AKOKA B.S., Faculte des Sciences-Paris (1970) S.M., University of Paris VI (1971) Doctorat 3eme Cycle, University of Paris VI (1975) SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE .OF DOCTOR OF PHILOSOPHY at the MASSACHUSETTS INSTITUTE OF TECHNOLOGY May 4, 1978 Signature of Author .................................................. Sloan School of Management, May 4, 1978 Certified by ......................................................... Thesis Supervisor Accepted by ......................................................... Chairman, Department Committee DESIGN ISSUES IN DISTRIBUTED MANAGEMENT INFORMATION SYSTEMS by Jacob Akoka Submitted to the Department of Management on May 4, 1978 in partial fulfillment of the requirements for the Degree of Doctor of Philosophy ABSTRACT Due to the advances in computer network technology and the steadily decreasing cost of hardware, distributed information systems have become a potential alternative to centralized information systems.' This thesis analyzes issues related to the design of distributed information systems. Most of the research done in the past can be characterized by a piece- meal approach since it tends to consider the computer network design issue and the distributed data base design issue separately. The critical survey presented in Chapter 2 points out the need to integrate both issues in an overall design approach. In an attempt to incorporate most of the factors that compose distributed information systems, we present a global model in which network topology, communication channels capacity, size of computer hardware, pricing schemes, and routing dis- ciplines are interrelated in an optimal design. In addition, we show how to derive from the global model a design model for distributed database systems. To solve the two distributed information system models (integer nonlinear programming models), we propose a mathematical programming algorithm that has the following characteristics: - it takes advantage of the special structure of the models; - it uses a specific binary bounded branch and bound method which limits a priori the number of the nodes of the tree. Comparing with previous work in this area, the algorithm is more efficient than traditional branch and bound techniques and can avoid the disadvantages of heuristic methods which were widely used in the optimi- zation of computer networks and distributed database systems. The global model and the solution procedure are used to design the distributed information system for a large bank in Europe. Our solution proved to be-better in terms of optimal configuration than the bank's experimental network. In addition, our model for the design of distri- buted database systems performed better than past models. An efficient computer code allowing to solve real-life distributed information system problems, with little programming effort for the users, was developed and tested. An extension of our global model to the issue of centralization versus decentralization of information systems is presented. The inform- ation system is divided in three components (systems operations, systems development, and systems management), and a specific submodel is used to facilitate the decision for each component. Finally, further research in the area of design models for distributed information systems is discussed. Peter P. S. Chen, Assistant Professor ACKNOWLEDGEMENTS I would like to express my gratitude to my thesis advisor, Pro- fessor P.P.S. Chen, for his careful guidance and his valuable comments; his patience and constructive criticism helped me to integrate many concepts within a coherent frame. His encouragement has been invaluable. Thank you so much Peter! I would also like to thank Professors J.D.C. Little and S.E. Madnick for their insightful comments. Their suggestions contributed to broaden my view of the distributed management information system design problem. Thank you for your help and encouragement. I am very grateful to Maitre Olivier, General Director of ESSEC (Ecole Superieure des Sciences Economiques et Commerciales) for his moral support and for facilitating my staying at the Sloan School of Management (M.I.T.). I would like to thank the following for the financial aid received in support of this doctoral program: l'Ecole Superieure des Sciences Economiques et Commerciales (ESSEC), La Fondation Nationale Pour l'Enseignement de la Gestion des Entreprises (FNEGE), and l'Institut de Recherche d'Informatique et d'Automatique (IRIA). Thanks to my colleagues, D. Briolat, H. Dayan, R. Thietart, G. Viens and B. Yon for their help. I wish to express my personal appreciation to Judith Mason for her excellent typing. Last but not least, I would like to sincerely thank my wife, 5 Vivi, and my children Karen and Nathalie, for their encouragement and understanding during all phases of this work. 6 To my mother TABLE OF CONTENTS Page CHAPTER ONE -- INTRODUCTION I. Toward Distributed Information Systems 12 II. Statement of the Research . ... ... 14 III. Research Methodology ..... ..... 17 IV. Outline of Dissertation .... .... 18 CHAPTER TWO -- DISTRIBUTED INFORMATION SYSTEMS: A STATE OF THE ARTS STUDY I. Introduction .. ... ... ... ... ... ... ... 24 II. Major Arguments Used for Distributed Information Systems . 25 A. Economy of Scale . .. .. ... .. .. .. .. ... 25 B. Sophistication of Applications . .. ... ... ... 29 C. Quality of Systems Development ... ... ,.... 30 III. Computer Network: A Survey .. .. ... .. .. .. .. 32 A. Network Topologies .. .. .. .. .... .. .. .. 32 B. Communication Systems .. ... .. .. ... ..... 39 C. Computer Network Design Principles.... .. .. ..43 D. Solutions for the Optimum Design .. .. .. .. .. 46 IV. Distributed Database Systems: An Overview . .. .. .. ..52 A. Introduction . .. .. .. ... .. .. 52 B. Classification of Distributed Databases..... .. ..54 C. Design Models for Distributed Database Systems . 59 V. Conclusion .. .. ... .. ... .. ... .. 66 CHAPTER THREE -- DESIGN AND OPTIMIZATION OF DISTRIBUTED INFORMATION SYSTEMS I. Introduction .. .. ... ... ... ... .. 76 II. The Global Model . .... .... ... .. ... 81 A. Assumptions and Modelling Aspects .. -- -- 82 B. The Objective Function ... ... .... 87 C. The Constraints .... ... ... .... 95 D. Final Formulation of the Problems ... ... 100 E. Model Type .. ... ... .... ... .. .. 104 Page F. Relevances of the Model ... ... ... ... 104 (i) Relevance to the Computer Network Design Problem . .... .... .... 104 (ii) Relevance to the Centralization- Decentralization Issue ... ..... ... 105 (iii) Relevance to the Distributed Databases Design Problem .............. 106. III. The Optimal Distributed Database Systems Model ... 107 IV. Possible Extensions of the Model ...... .... 111 A. Access Time Constraints ........ ..... 111 B. Reliability Constraints ........ ..... 112 V. Conclusion .... ........ ........ 113 CHAPTER FOUR -- A MATHEMATICAL PROGRAMMING ALGORITHM FOR DISTRIBUTED INFORMATION SYSTEMS I. Introduction ... ............. ..... 116 II. The Algorithm: A Brief Outline ... ........ 122 III. Detailed Description of the Algorithm . ..... .. 124 IV. The Computer Implementation of the Algorithm ... .. 132 V. Conclusion .. .... .... .... .... .... 137 CHAPTER FIVE -- APPLICATION OF THE DESIGN MODELS FOR DISTRIBUTED INFORMATION SYSTEMS I. Introduction .................... 140 II. Application of the Global Model for the Design of a Distributed Information System for a Large Bank . .. 141 A. The Settings . ..... .... ..... ... 141 B. The Data Processing Group .. .......... .. 142 C. The Decentralization ....... ....... 144 D. The Bank's Experimental Network ....... .. .. 145 E. Application of our Model ....... ..... .. 151 F. Sensitivity Analysis .... .......... 168 III. Optimal Distributed Database Systems: An Application of the Second Model ....... ....... ... 177 A. First Case - No Return Flow of Information and no Storage Cost of Database ...... ..... 184 Page B. Second Case - Existence of Storage Cost for Database ... .... .... .... .... .... 187 C. Third Case - Existence of the Return Flow of Information . ... ... .... ... ... ... 192 IV. Conclusion .. .... .... .... ... .... ... 198 CHAPTER SIX -- APPLICATION OF THE GLOBAL MODEL TO THE ISSUE OF CENTRALIZATION VERSUS DECENTRALIZATION OF INFOR- MATION SYSTEMS I. Introduction ....................... 200 II. A Critical Survey of the Methods used in the Past to Solve the Issue of Centralization Versus Decentrali- zation of Information Systems ....... ....... 201 III. The Decision Support Model ................ 212 A. Introduction ..................... 212 B. System Operations: An Optimization Model ....... 214 C. System Development: A Decision Table ......... 221 D. System Management: Guidelines ... ........ 226 IV. ConLclusion ...... .......... ........ 229 CHAPTER SEVEN -- SUMMARY, PERSISTING PROBLEMS AND DIRECTIONS FOR FURTHER RESEARCH I. Summary .... ... .... .... .... ... ... 233 A. The Design and Optimization of Distributed Systems . 233 B. The Solution Procedure . ... ... .. .. ..... 234- C. The Computer Code .. .. ... ... ... ... .. 236 D. Real-Life Applications . ... ... ... ... .. 236 E. The Issue of Centralization Versus Decentralization of Information Systems .. ... .. ... .. .*. 237 II. Persisting Problems in Distributed Information Systems .. 239 III. Directions for Further Research ........ ..... 242 APPENDICES A. Mathematical Description of Step 3 of our Algorithm . 247 B. Advantages and Disadvantages