Decentralised Multi-Robot Systems

Decentralised Multi-Robot Systems Towards Coordination in Real World Settings Thesis submitted in accordance with the requirements of the University of Liverpool for the degree of Doctor in Philosophy by Daniel Claes April 2018 Contents List of Figures vii List of Tables xiii Preface xv Abstract xvii Acknowledgements xix 1 Introduction1 1.1 Content of this Thesis.............................3 1.2 Structure and Contributions of this Thesis..................4 2 Background7 2.1 Navigation in a Shared Workspace......................7 2.1.1 The Velocity Obstacle Paradigm...................8 2.1.2 Incorporating Kinematic and Dynamic Constraints......... 12 2.1.3 Selection of a Collision-Free Velocity................. 13 2.2 Decision Making and Task Allocation.................... 15 2.2.1 Markov Decision Processes...................... 15 2.2.2 Solution Methods for Markov Decision Processes.......... 16 2.2.3 Extensions for Planning with Multiple Agents............ 21 2.2.4 Monte Carlo Tree Search....................... 22 2.2.5 Multi-Robot Task Allocation and Market-based Approaches.... 24 2.3 Robotic Platforms and Core Components.................. 27 2.3.1 Localisation............................... 28 2.3.2 Navigation using the Dynamic Window Approach and a Global Plan................................... 30 2.4 Summary.................................... 31 3 Collision Avoidance in Shared Workspaces 33 3.1 Related Work.................................. 34 3.1.1 Local control.............................. 35 3.1.2 Collision avoidance in shared workspaces.............. 36 3.2 Convex Outline Collision Avoidance under Localization Uncertainty... 36 3.3 Towards Human-Safe Pro-Active Collision Avoidance............ 41 3.3.1 Problems of Convex Outline Collision Avoidance under Localiza- tion Uncertainty............................ 41 iii 3.3.2 Static Obstacles with Velocity Obstacle-based Methods...... 42 3.3.3 Convex Outline Collision Avoidance under Localization Uncer- tainty with Monte Carlo sampling.................. 43 3.3.4 Convex Outline Collision Avoidance under Localization Uncer- tainty with the Dynamic Window Approach............. 45 3.3.5 Pro-Active Collision Avoidance.................... 47 3.3.6 Complexity Analysis of the Approach................ 47 3.4 Experiments and Results............................ 48 3.4.1 Simulation Runs............................ 49 3.4.2 Real World Experiments........................ 56 3.5 Summary.................................... 62 4 Spatial Task Allocation Problems 63 4.1 Related Work.................................. 65 4.2 Spatial Task Allocation Problems....................... 67 4.3 Spatial Task Allocation Problems Formulated as Multi-Agent Markov De- cision Processes................................. 69 4.4 Online Approximations for Solving Spatial Task Allocation Problems... 71 4.4.1 Subjective Approximations...................... 72 4.4.2 Phase-Approximations......................... 75 4.4.3 Social Laws to Overcome Symmetries................ 77 4.5 Experiments and Results............................ 77 4.6 Summary.................................... 81 5 Warehouse Commissioning as a Spatial Task Allocation Problem 83 5.1 Related Work.................................. 85 5.2 Warehouse Commissioning with multiple Robots.............. 86 5.2.1 A commissioning Spatial Task Allocation Problem......... 86 5.3 Monte Carlo Tree Search for SPATAPs.................... 88 5.3.1 Dealing with huge State Spaces.................... 89 5.3.2 Incentivizing Agents.......................... 89 5.3.3 Modelling Other Agents: Greedy Rollout Policies.......... 89 5.4 Experiments and Results............................ 92 5.4.1 Fixed allocation vs online re-allocation................ 93 5.4.2 Different rollout policies........................ 95 5.4.3 Planning times............................. 95 5.4.4 Limited vs unlimited capacities.................... 96 5.4.5 Larger warehouses........................... 98 5.4.6 Positioning............................... 99 5.5 Summary.................................... 99 6 Real World Multi-Robot Warehouse Commissioning 101 6.1 Robot Competitions and Related Work................... 102 6.1.1 Related Work.............................. 102 6.2 Problem Description and Environment.................... 103 6.3 Required Components for a Real World Application............ 105 6.3.1 Localisation, Navigation and Collision Avoidance.......... 106 6.3.2 Object Recognition........................... 106 iv 6.3.3 Inverse Kinematics for the youBot Arm............... 109 6.3.4 Commissioning Spatial Task Allocation Problems in the Real World112 6.3.5 Robot Behaviour Creation....................... 113 6.4 Experiments and Results............................ 115 6.4.1 Object Recognition and Grasping................... 116 6.4.2 Warehouse Commissioning Results.................. 120 6.5 Summary.................................... 129 7 Conclusions 131 7.1 Contributions.................................. 131 7.2 Applications to other domains......................... 133 7.3 Limitations of the approach.......................... 134 7.4 Future Work.................................. 134 A Robot Systems 137 A.1 youBot...................................... 137 A.2 Turtlebot.................................... 140 Abbreviations 141 Bibliography 143 List of my Publications 153 Awards and Achievements 155 v List of Figures 2.1 Creating the different velocity obstacles out of a workspace configuration. (a) A workspace configuration with two robots RA and RB. (b) Translating the situation into the velocity space and the resulting Velocity Obstacle (VO) for RA........................................9 vA+vB 2.2 (a) Translating the VO by 2 results in the Reciprocal Velocity Obstacle (RVO), i.e. each robot has to take care of half of the collision avoidance. (b) Translating the apex of the RVO to the intersection of the closest leg of the RVO to the own velocity, and the leg of the VO that corresponds to the leg that is furthest away from the own velocity. This encourages passing the robot on a preferred side, i.e. in this example passing on the left. The resulting cone is the Hybrid Reciprocal Velocity Obstacle (HRVO)....... 11 2.3 Truncation. (a) Truncation of a VO of a static obstacle at τ = 2. (b) Approximating the truncation by a line for easier calculation.......... 12 2.4 Non-holonomic tracking error........................... 12 2.5 (a) ClearPath enumerates intersection points for all pairs of VOs (solid dots). In addition the preferred velocity vA is projected on the closest leg of each VO (open dots). The point closest to the preferred velocity (dashed line) and outside of all VOs is selected as new velocity (solid line). The next best points are shown for reference. (b) Optimal Reciprocal Collision Avoidance (ORCA) creates a convex representation of the safe velocity space and uses linear programming to find the closest point to the preferred velocity. (c) We can also use Monte Carlo sampling to select the best velocity. The distance for each sample to the preferred velocity (dashed line) is evaluated. If the sample falls within any VO, it is discarded. Yellow shows a high rating and blue is a low rating................................. 14 2.6 A map created by gmapping from the smARTLab laboratory and the adja- cent corridor.................................... 28 2.7 Typical particle filter situations. (a) A well localised robot at the end of a corridor resulting in a particle cloud with small variance. (b) In an open ended corridor the sensor only provides valid readings to the sides, resulting in an particle cloud elongated in the direction of the corridor. (c) In an open space no sensor readings result in a particle cloud driven purely by the motion model.................................... 29 vii 2.8 Dynamic Window Approach (DWA) generates various sample control inputs and uses forward simulations in the configuration space to detect if the given combination of control inputs leads to a collision. In this example, the lower two trajectories lead to a collision and will be excluded............. 30 3.1 The corridor problem: Approximating the localisation uncertainty (and the footprint) with circumscribed circles, vastly overestimates the true sizes, such that the robots do not fit next to each other. Thus, the HRVO together with the VO of the walls invalidates all forward movements............. 37 3.2 Using Convex Outline Collision Avoidance under Localization Uncertainty (COCALU) solves the corridor problem. Since the robots' footprints and localisation uncertainty are approximated with less overestimation, the robots can pass along the corridor without a problem................. 38 3.3 (a) Three iterations of convex hull peeling. (b) Minkowski sum of the resulting convex polygon and a circular footprint.................... 39 3.4 (a) Constructing a VO out of a robots' footprint and an obstacle line- segment. (b) Truncating the VO by τ....................... 42 3.5 Increasing the footprint of the other robots is one way to create more safety. However this also reduces the available safe velocities to choose from and could lead to potential problems in dense configurations, where the whole velocity space becomes unavailable........................ 43 3.6 Different cost functions for evaluating a velocity. Parts in yellow depict lower, i.e. better costs, and parts in blue show higher costs. The distances to the preferred velocity (a) and current velocity (b) are shown as cost, where further

Decentralised Multi-Robot Systems

Information Systems Foundations: Constructing and Criticising

Biomimicry--A Review

Possibilities at the Intersection of AI and Blockchain Technology

Robotchain: Artificial Intelligence on a Blockchain Using Tezos Technology

Immune-Inspired Self-Healing Swarm Robotic Systems Amelia

Modelling and Verification of Multi-Agent Systems Via Sequential Emulation

Rural Livelihoods and Poverty Reduction Policies

Designing Collective Decision-Making Dynamics for Multi-Agent Systems with Inspiration from Honeybees

Investigating the Impact of Global Stablecoins

Variability in Individual Assessment Behaviour and Its Implications for Collective Decision-Making

On Artificial Immune Systems and Swarm Intelligence

Variability in Individual Assessment Behaviour and Its Implications for Collective Decision-Making