Kwanashie, Augustine (2015) Efficient Algorithms for Optimal Matching Problems Under Preferences
Total Page:16
File Type:pdf, Size:1020Kb
Kwanashie, Augustine (2015) Efficient algorithms for optimal matching problems under preferences. PhD thesis. http://theses.gla.ac.uk/6706/ Copyright and moral rights for this thesis are retained by the author A copy can be downloaded for personal non-commercial research or study This thesis cannot be reproduced or quoted extensively from without first obtaining permission in writing from the Author The content must not be changed in any way or sold commercially in any format or medium without the formal permission of the Author When referring to this work, full bibliographic details including the author, title, awarding institution and date of the thesis must be given Glasgow Theses Service http://theses.gla.ac.uk/ [email protected] Efficient Algorithms for Optimal Matching Problems Under Preferences Augustine Kwanashie Submitted in fulfillment of the requirements for the degree of Doctor of Philosophy School of Computing Science College of Science and Engineering University of Glasgow September 2015 c Augustine Kwanashie 2015 Abstract In this thesis we consider efficient algorithms for matching problems involving pref- erences, i.e., problems where agents may be required to list other agents that they find acceptable in order of preference. In particular we mainly study the Stable Marriage problem (sm), the Hospitals / Residents problem (hr) and the Student / Project Allo- cation problem (spa), and some of their variants. In some of these problems the aim is to find a stable matching which is one that admits no blocking pair. A blocking pair with respect to a matching is a pair of agents that prefer to be matched to each other than their assigned partners in the matching if any. We present an Integer Programming (IP) model for the Hospitals / Residents problem with Ties (hrt) and use it to find a maximum cardinality stable matching. We also present results from an empirical evaluation of our model which show it to be scalable with respect to real-world hrt instance sizes. Motivated by the observation that not all blocking pairs that exist in theory will lead to a matching being undermined in practice, we investigate a relaxed stability criterion called social stability where only pairs of agents with a social relationship have the ability to undermine a matching. This stability concept is studied in instances of the Stable Marriage problem with Incomplete lists (smi) and in instances of hr. We show that, in the smi and hr contexts, socially stable matchings can be of varying sizes and the problem of finding a maximum socially stable matching (max smiss and max hrss respectively) is NP-hard though approximable within 3=2. Furthermore we give polynomial time algorithms for three special cases of the problem arising from restrictions on the social network graph and the lengths of agents' preference lists. We also consider other optimality criteria with respect to social stability and establish inapproximability bounds for the problems of finding an egalitarian, minimum regret and sex equal socially stable matching in the sm context. We extend our study of social stability by considering other variants and restrictions of max smiss and max hrss. We present NP-hardness results for max smiss even under certain restrictions on the degree and structure of the social network graph as well as the presence of master lists. Other NP-hardness results presented relate to the problem of determining whether a given man-woman pair belongs to a socially stable matching and the problem of determining whether a given man (or woman) is part of at least one socially stable matching. We also consider the Stable Roommates problem with Incomplete lists under Social Stability (a non-bipartite generalisation of smi under social stability). We observe that the problem of finding a maximum socially stable matching in this context is also NP-hard. We present efficient algorithms for three special cases of the problem arising from restrictions on the social network graph and the lengths of agents' preference lists. These are the cases where (i) there exists a constant number of acquainted pairs (ii) or a constant number of unacquainted pairs or (iii) each preference list is of length at most 2. We also present algorithmic results for finding matchings in the spa context that are optimal with respect to profile, which is the vector whose ith component is the number of students assigned to their ith-choice project. We present an efficient algorithm for finding a greedy maximum matching in the spa context | this is a maximum matching whose profile is lexicographically maximum. We then show how to adapt this algorithm to find a generous maximum matching | this is a matching whose reverse profile is lexicographically minimum. We demonstrate how this approach can allow additional constraints, such as lecturer lower quotas, to be handled flexibly. We also present results of empirical evaluations carried out on both real world and randomly generated datasets. These results demonstrate the scalability of our algorithms as well as some interesting properties of these profile-based optimality criteria. Practical applications of spa motivate the investigation of certain special cases of the problem. For instance, it is often desired that the workload on lecturers is evenly dis- tributed (i.e. load balanced). We enforce this by either adding lower quota constraints on the lecturers (which leads to the potential for infeasible problem instances) or adding a load balancing optimisation criterion. We present efficient algorithms in both cases. Another consideration is the fact that certain projects may require a minimum number of students to become viable. This can be handled by enforcing lower quota constraints on the projects (which also leads to the possibility of infeasible problem instances). A technique of handling this infeasibility is the idea of closing projects that do not meet their lower quotas (i.e. leaving such project completely unassigned). We show that the problem of finding a maximum matching subject to project lower quotas where projects can be closed is NP-hard even under severe restrictions on preference lists lengths and project upper and lower quotas. To offset this hardness, we present polynomial time heuristics that find large feasible matchings in practice. We also present ip models for the spa variants discussed and show results obtained from an empirical evaluation carried out on both real and randomly generated datasets. These results show that our algorithms and heuristics are scalable and provide good matchings with respect to profile-based optimality. Table of Contents 1 Introduction 1 2 Literature Review of Matching Problems 5 2.1 Introduction . .5 2.2 The Stable Marriage problem (sm)....................7 2.2.1 Introduction . .7 2.2.2 Structure of stable matchings in sm ................9 2.2.3 Stable Marriage problem with Incomplete lists . 12 2.2.4 Stable Marriage problem with Ties . 12 2.2.5 Stable Marriage problem with Ties and Incomplete lists . 13 2.2.6 Other optimality criteria . 14 2.3 The Hospitals/Residents problem (hr).................. 15 2.3.1 Introduction . 15 2.3.2 Hospital/Residents with Ties . 16 2.3.3 Cloning hr instances . 18 2.4 The Stable Roommates problem (sr)................... 19 2.4.1 Rotations in the Stable Roommates problem . 20 2.4.2 Structure of stable matchings in sr ................ 20 2.4.3 Stable Roommates with Incomplete lists . 20 2.5 Locally Stable Matchings . 21 2.5.1 Introduction . 21 2.5.2 Locally Stable Matchings in the Job Market Context . 22 2.5.3 Other results relating to Locally Stable Matchings . 24 2.6 The Student/Project Allocation problem . 24 2.6.1 Introduction . 24 2.6.2 Two-sided preferences and stability . 25 2.6.3 One-sided preferences and profile-based optimality . 27 2.6.4 Other spa models and approaches . 29 2.7 Integer Programming approaches to matching problems . 30 3 An Integer Programming Approach to the Hospitals/Residents Prob- lem with Ties 32 3.1 Introduction . 32 3.2 An IP model for max hrt ......................... 33 3.3 Implementing the model . 35 3.3.1 Reducing the model size . 35 3.3.2 Improving optimisation performance . 37 3.3.3 Testing the implementation . 38 3.4 Empirical evaluation . 39 3.4.1 Using random instances . 40 3.4.2 Using real instances . 43 3.5 Open problems . 44 4 Socially Stable Matchings in the Hospitals/Residents Problem 46 4.1 Introduction . 46 4.2 Preliminary definitions and results . 47 4.3 Reduction from hrss to hr+sn ...................... 48 4.4 Hardness of max smiss ........................... 50 4.5 Approximating max hrss ......................... 51 4.5.1 Approximation results . 52 4.5.2 Inapproximability result . 57 4.6 Some special cases of hrss ......................... 58 4.6.1 (2; 1)-max smiss .......................... 59 4.6.2 hrss with a constant number of unacquainted pairs . 62 4.6.3 hrss with a constant number of acquainted pairs . 63 4.7 Empirical evaluation . 67 4.7.1 Introduction . 67 4.7.2 Varying instance size . 69 4.7.3 Varying the density of the social network . 69 4.8 Conclusion . 71 5 Further Algorithmic Results on Socially Stable Matchings 73 5.1 Introduction . 73 5.2 Fair socially stable matchings . 74 5.2.1 Introduction . 74 5.2.2 Egalitarian socially stable matchings . 75 5.2.3 Minimum regret socially stable matchings . 77 5.2.4 Sex-equal socially stable matchings . 78 5.3 Further restrictions of com smiss ..................... 80 5.3.1 Restrictions on the degree of the social network graph . 80 5.3.2 Restrictions on the structure of the social network graph . 83 5.3.3 Restrictions on the ordering of preference lists . 84 5.4 Minimum socially stable matchings . 86 5.5 Further hardness results for smiss ....................