Transfer Learning in Takagi-Sugeno Fuzzy Models

TRANSFER LEARNING IN TAKAGI-SUGENO FUZZY MODELS Hua Zuo Faculty of Engineering and Information Technology University of Technology Sydney A thesis submitted for the Degree of Doctor of Philosophy May 2018 Certificate of Authorship/Originality I certify that the work in this thesis has not previously been submitted for a degree nor has it been submitted as part of the requirements for a degree except as fully acknowledged within the text. I also certify that the thesis has been written by me. Any help that I have received in my research work and the preparation of the thesis itself has been acknowledged. In addition, I certify that all information sources and literature used are indicated in the thesis. Signature of candidate: Date: Acknowledgements I would like to express my sincere gratitude to my principal supervisor, Profes- sor Guangquan Zhang, and my co-supervisors, Professor Jie Lu and Dr Vahid Behbood. Their comprehensive guidance covered all aspects of my PhD study, including research methodology, research topic selection, experiments, academic writing skills and thesis writing, even sentence structure and formulas. Their critical comments and suggestions strengthened my study significantly. Their strict academic attitude and respectful personalities benefited my PhD study and will be a great treasure throughout my life. Without their excellent supervision and continuous encouragement, this research would not have been finished on time. Thanks to all of you for your kind help! I am grateful to all members of the Decision Systems and e-Service Intelli- gent (DeSI) Lab in the Centre for Artificial Intelligence (CAI) for their careful participation in my presentation and their valuable comments on my research. I wish to express my appreciation to all friends (especially Hongshu Chen, Qian Zhang, Shan Xue and Fan Dong) for their help, accompany and valued comments in not only on my study, but also on my life. I would like to thank Ms Sue Felix and Ms Jem Moore for helping me to proofread my publications and this thesis. I wish to express my appreciation for the financial support I received for my study. Special thanks go to the University of Technology Sydney (UTS) and the China Scholarship Council (CSC). Also thanks for the support of the Australian Research Council under DP 140101366. iv Last but not least, I would also like to thank my family members. Thanks to my mother, father and my brother for their continued encouragement and generous support. Thank you for supporting me and always being there for me during my ups and downs. Pursuing a PhD has always been a long-term challenge for me and a dream of my brother’s. This dream would have not been achieved without the love, great empathy and kind assistance of my family. Abstract In classical data-driven machine learning methods, massive amounts of labeled data are required to build a high-performance prediction model. However, the amount of labeled data in many real-world applications is insufficient, so estab- lishing a prediction model is impossible. Transfer learning has recently emerged as a way of exploiting previously acquired knowledge to solve new yet similar problems much more efficiently and effectively. It exploits the knowledge ac- cumulated in auxiliary domains to help construct prediction models in a target domain with inadequate training data. Most existing methods on transfer learning address classification tasks, but studies on transfer learning in the case of regression problems, which is an important category in prediction models, are still scarce. In addition, the existing works ignore the inherent phenomenon of uncertainty - a crucial factor during the knowledge transfer process. To fill these gaps, this research develops algorithms and methods to deal with transfer learning in homogeneous and heterogeneous spaces using fuzzy rule- based models. Fuzzy rules are first generated from the source domain through a learning process. These rules, as acquired knowledge, are then transferred to the target domain. First, a novel fuzzy rule-based domain adaptation method is proposed to transfer knowledge between domains in homogeneous spaces. It utilizes the existing fuzzy rules of source domain and modifies the input space with nonlinear mappings to adapt to the current regression tasks in the target domain. Second, a granular fuzzy domain adaptation framework, comprising vi three methods, is developed to handle the knowledge transfer problems based on the idea of granular computing. Thirdly, a fuzzy domain adaptation method is developed to handle the case that the numbers of fuzzy rules in two domains do not match. In addition, the proposed methods are also used to solve the classification problems in transfer learning. Fourthly, an innovative method that combines an infinite Gaussian mixture model with active learning is presented to discover the structure of data and actively augment information in a target domain. Fifthly, a fuzzy heterogeneous domain adaptation method is proposed to transfer knowledge in heterogeneous spaces. The proposed algorithms and methods are validated in each step of development using experiments performed on both synthetic and real-world datasets. The results show that this study significantly improve the performance of existing models when solving new tasks in the target domain in both homogeneous and heterogeneous domain adaptation settings. Table of Contents Certificate of Authorship/Originality ii List of Figures xii List of Tables xv 1 Introduction 1 1.1 Background ............................ 1 1.2 Research questions and objectives ................ 5 1.3 Research contributions ...................... 9 1.4 Research significance ....................... 11 1.5 Thesis structure .......................... 12 1.6 Publications related to this thesis ................. 14 2 Literature Review 17 2.1 Definitions and categories of transfer learning and the category . 17 2.2 Different types of transfer learning methods based on content . 19 2.3 Types of transfer learning models based on domain knowledge . 21 2.4 Types of transfer learning based on computational intelligence techniques ............................. 24 2.4.1 Transfer learning using neural networks (NN) ...... 24 2.4.1.1 Transfer learning using deep NN ....... 24 Table of Contents viii 2.4.1.2 Transfer learning using multiple task NN . 26 2.4.1.3 Transfer learning using radial basis function NNs 27 2.4.2 Transfer learning using Bayes .............. 28 2.4.2.1 Transfer learning using naïve Bayes ...... 28 2.4.2.2 Transfer learning using Bayesian network . 30 2.4.2.3 Transfer learning using a hierarchical Bayesian model ..................... 33 2.4.3 Transfer learning using fuzzy systems and genetic algorithms ........................... 34 2.4.4 Applications of transfer learning ............. 37 2.4.4.1 Natural language processing .......... 37 2.4.4.2 Computer vision and image processing .... 38 2.4.4.3 Biology .................... 40 2.4.4.4 Finance .................... 41 2.4.4.5 Business management ............. 41 2.5 Granular computing ........................ 42 2.5.1 Concepts of granular computing ............. 42 2.5.2 The formalisms of granular computing .......... 44 2.5.3 The application of granular computing .......... 45 2.6 Summary ............................. 47 3 Fuzzy Homogeneous Domain Adaptation 48 3.1 Introduction ............................ 48 3.2 Takagi-Sugeno fuzzy model ................... 50 3.3 Fuzzy domain adaptation ..................... 54 3.3.1 Problem statement .................... 54 3.3.2 Fuzzy rule-based domain adaptation in homogeneous space 55 Table of Contents ix 3.3.3 Knowledge transfer in fuzzy rule-based models ..... 57 3.4 Empirical results analysis ..................... 61 3.4.1 Generation of synthetic datasets ............. 62 3.4.2 Comparison of evolution algorithms partical swarm optimization (PSO) and differential evolution (DE) .......... 68 3.4.3 Comparison of different ways of constructing mappings 73 3.4.4 Experiments on real-world datasets ........... 76 3.5 Summary ............................. 80 4 Granular Domain Adaptation in Takagi-Sugeno Fuzzy Models 81 4.1 Introduction ............................ 81 4.2 Problem statement ........................ 83 4.3 The definitions of fuzzy domain adaptation ........... 83 4.4 Granular domain adaptation in Takagi-Sugeno fuzzy models . 85 4.4.1 Granular transfer learning framework .......... 85 4.4.2 Methods and algorithms of GFRDA ........... 89 4.4.3 Performance index .................... 97 4.5 Empirical results analysis ..................... 100 4.5.1 Experiments on synthetic datasets ............ 100 4.5.1.1 Datasets and experimental settings ...... 101 4.5.1.2 Experimental results .............. 103 4.5.2 Experiments on real-world datasets ........... 109 4.5.3 Fuzzy transfer learning for label space adaptation .... 111 4.6 Summary ............................. 117 5 Fuzzy Domain Adaptation for the Dismatch of Fuzzy Rules 119 5.1 Introduction ............................ 119 5.2 Problem statement ........................ 120 Table of Contents x 5.3 Fuzzy transfer learning for dismatch of fuzzy rules ....... 121 5.4 Empirical results analysis ..................... 126 5.4.1 Datasets and experimental settings ............ 127 5.4.2 Validation of algorithm effectiveness ........... 128 5.4.3 Parameter sensitivity analysis .............. 131 5.5 Comparison of two ways of constructing mappings ....... 134 5.6 Application to the classification problems ............ 139 5.7 Summary ............................

Load more