Priority Inheritance with Backtracking for Iterative Multi-Agent Path Finding

Priority Inheritance with Backtracking for Iterative Multi-agent Path Finding Keisuke Okumura1 , Manao Machida2 , Xavier Defago´ 1 and Yasumasa Tamura1 1Tokyo Institute of Technology 2NEC Corporation [email protected], [email protected], fdefago, [email protected] Abstract That is, whenever an agent reaches a goal, it receives a new one. This rules out any simple adaptation of offline The Multi-agent Path Finding (MAPF) problem and computationally-intensive optimal solutions due to pro- consists in all agents having to move to their own hibitive computations, and new goals typically appearing at destinations while avoiding collisions. In practical runtime. Furthermore, centralized solutions being inherently applications to the problem, such as for navigation problematic for systems with many agents due to scalabil- in an automated warehouse, MAPF must be solved ity concerns, decoupled algorithms such as prioritized route iteratively. We present here a novel approach to it- planning are adequate, especially if fully decentralized. De- erative MAPF, that we call Priority Inheritance with centralized solutions are highly attractive to multi-agent/robot Backtracking (PIBT). PIBT gives a unique priority systems for many reasons, including a higher potential for to each agent every timestep, so that all movements robustness and fault-tolerance, better scalability, and lower are prioritized. Priority inheritance, which aims at production cost [Yan et al., 2013]. Finally, implementations dealing effectively with priority inversion in path without global communication are preferable since they pro- adjustment within a small time window, can be ap- vide better potential for scalability and concurrency. plied iteratively and a backtracking protocol pre- iterative vents agents from being stuck. We prove that, re- In this paper, we present a novel algorithm for Multi-agent Path Finding Priority gardless of their number, all agents are guaranteed (iterative MAPF), called Inheritance with Backtracking to reach their destination within finite time, when (PIBT), which focuses on the the environment is a graph such that all pairs of ad- adjacent movements of multiple agents based on prioritized jacent nodes belong to a simple cycle of length 3 planning in a short time window. Priority inheritance is a or more (e.g., biconnected). Our implementation well-known approach to deal effectively with priority inver- [ et al. ] of PIBT can be fully decentralized without global sion in real-time systems Sha , 1990 , and is applied communication. Experimental results over vari- here to path adjustment. When a low-priority agent X im- ous scenarios confirm that PIBT is adequate both pedes the movement of a higher-priority agent Y, agent X for finding paths in large environments with many temporarily inherits the higher-priority of Y. To avoid a sit- agents, as well as for conveying packages in an au- uation where agents are stuck waiting, priority inheritance is tomated warehouse. executed in combination with a backtracking protocol. Since PIBT assumes that agents can only communicate when lo- cated within two hops of each other (e.g., Manhattan distance 1 Introduction 2 in a grid environment), it can be implemented in a fully In systems using physical moving agents, it is essential to al- decentralized way and inherits the above characteristics. arXiv:1901.11282v3 [cs.MA] 23 Jun 2019 low agents to reach their own destinations smoothly without Our main contributions are two-folds: 1) we propose an al- collisions, by providing valid paths while minimizing excess gorithm ensuring that every agent always reaches its destina- travel time. This problem, known as Multi-agent Path Find- tion within finite time as long as the environment satisfies the ing (MAPF), is however computationally difficult due to the condition that all pairs of adjacent nodes belong to a simple search space growing exponentially as the number of agents cycle of length 3 or more (includes undirected biconnected increases. MAPF is an important problem due to its appli- graphs); and 2) we evaluate that algorithm in various envi- cations in various domains, e.g., traffic control [Dresner and ronments, showing its practicality. In particular, experimental Stone, 2008], automated warehouse [Wurman et al., 2008], results over various scenarios confirm its adequateness both or airport surface operation [Morris et al., 2016], etc. for finding paths in large environments with many agents, as Previous research on MAPF focuses on solving a “one- well as for conveying packages in an automated warehouse. shot” version of the problem, i.e., to make agents reach their The paper is organized as follows. Section 2 reviews ex- goal from their initial position only once. In practical ap- isting algorithms for MAPF and a variant called Multi-agent plications, such as conveying packages in a warehouse [Ma Pickup and Delivery (MAPD). Section 3 defines iterative et al., 2017], MAPF must however be solved iteratively. MAPF, which provides an abstract framework for multiple moving agents. Section 4 presents the PIBT algorithm and its a delivery location. The paper proposes two decoupled algo- theoretical analysis. Section 5 presents empirical results on rithms based on HCA∗ for MAPD, called respectively Token both path finding (MAPF) and pickup and delivery (MAPD). Passing (TP) and Token Passing with Task Swap. These algo- Section 6 concludes the paper and discusses future work. rithms can easily be adapted to be decentralized but require a certain amount of non-task endpoints where agents do not 2 Related Work block other agents’ motion. Many complete MAPF algorithms exist, such as A∗ with Op- erator Decomposition [Standley, 2010], Enhanced Partial Ex- pansion A∗ [Goldenberg et al., 2014], Increasing Cost Tree 3 Problem Definition Search [Sharon et al., 2013], Conflict-based Search [Sharon ∗ et al., 2015],M [Wagner and Choset, 2015], etc. The problem of iterative Multi-agent Path Finding (itera- Finding an optimal solution is however NP-hard [Yu and tive MAPF) is a generalization of problems addressing mul- LaValle, 2013], and optimal algorithms do not scale in the tiple moving agents, including both Multi-agent Path Find- number of agents and are too costly for iterative use, hence ing (MAPF) and Multi-agent Pickup and Delivery (MAPD). the need for sub-optimal solvers, e.g., FAR [Wang et al., Since iterative MAPF is an abstract model we do not intend to 2008], MAPP [Wang and Botea, 2011], Tree-based Agent solve it directly, rather, it is necessary to embody task creation Swapping Strategy [Khorshid et al., 2011], BIBOX [Surynek, according to target domains. 2009], CBS variants [Barer et al., 2014; Cohen et al., 2016]. Push and Swap/Rotate [Luna and Bekris, 2011; de Wilde et al., 2013], which partly inspired our proposal, are sub- The system consists of a set of agents, A = fa1; : : : ; ang, optimal centralized approaches that allow one agent to push and an environment given as a graph G = (V; E), where another away from its path. However, they only allow a sin- agents occupy nodes in V and move along edges in E. Con- gle agent or a pair of agents to move at each timestep. En- sidering practical situations, G must be a simple (neither hanced Push and Swap include Parallel Push and Swap [Sajid loops nor multi-edges) and strongly-connected (every node et al., 2012] where all agents can move simultaneously, or is reachable from every other node) directed graph. This in- Push-Swap-Wait [Wiktor et al., 2014] which takes a decen- cludes all simple undirected graphs that are connected. tralized approach in narrow passages. DisCoF [Zhang et al., Let vi(t) denote the node occupied by agent ai at discrete 2016] combines decoupled prioritized planning with fallback time t. The initial position of agent ai is vi(0) and given to coupled push and swap in case of conflicts. PIBT, the pro- as input. At each timestep, ai selects one node vi(t + 1) 2 posed method, can be seen as a combination of safe “push” fvjv 2 V; (vi(t); v) 2 Eg [ fvi(t)g as its location for the operations, thanks to backtracking and dynamic priorities. next timestep. Agents must avoid 1) collision: vi(t) 6= vj(t); Prioritized planning is computationally cheap and hence and 2) intersection with others: vi(t) 6= vj(t + 1) _ vj(t + ∗ ∗ attractive for MAPF. Hierarchical Cooperative A (HCA ) 1) 6= vj(t). We do not prohibit rotations, so vi(t + 1) = ∗ ∗ and Windowed HCA (WHCA ) [Silver, 2005] are decou- vj(t)^vj(t+1) = vk(t)^· · ·^vl(t+1) = vi(t) is possible. pled approaches in that they plan a path for each agent Let Γ = fτ1; τ2;::: g be a dynamic set of tasks with new one after the other while avoiding collisions with previously tasks being added over time, i.e., not all tasks are known computed paths. WHCA∗ uses a limited lookahead win- ∗ initially. A task is defined as a finite set of goals τj = dow. Our proposal, PIBT, is based on WHCA with a win- fg ; g ; : : : ; g g where g 2 V , possibly with dependencies dow size of one. Decentralized solutions, i.e., where each 1 2 m k as a partial order on gk. Let tj be the timestep when τj is agent computes its own path based on information from added to Γ. An agent is free when it has no assigned task. A other agents, are inherently decoupled. Decentralized solu- task τj 2 Γ can only be assigned to free agents. When τj is tions for MAPF [Velagapudi et al., 2010; Cˇ ap´ et al., 2015; assigned to ai, ai starts visiting goals in τj. When all goals Chouhan and Niyogi, 2015] are hence usually prioritized. have been visited, τj is completed and ai becomes free. The negotiation process for ordering priorities studied by Azarm et al.

Priority Inheritance with Backtracking for Iterative Multi-Agent Path Finding

Lecture 4 Dynamic Programming

Exhaustive Recursion and Backtracking

Backtrack Parsing Context-Free Grammar Context-Free Grammar

Backtracking / Branch-And-Bound

Module 5: Backtracking

CS/ECE 374: Algorithms & Models of Computation

Toward a Model for Backtracking and Dynamic Programming

Tree Search Backtracking Search Backtracking Search

CSCI 742 - Compiler Construction

CSE 100: GRAPH SEARCH Announcements

Lecture 9: Games I

Lecture 3: Backtracking