IEEE TRANSACTIONS ON ROBOTICS, VOL. 26, NO. 4, AUGUST 2010 619 Evolutionary Trajectory Planner for Multiple UAVs in Realistic Scenarios Eva Besada-Portas, Luis de la Torre, Jesus´ M. de la Cruz, Member, IEEE, and Bonifacio de Andres-Toro´
Abstract—This paper presents a path planner for multiple un- critical when the UAVs must operate in urban terrains [2]), this manned aerial vehicles (UAVs) based on evolutionary algorithms paper focuses only on the second: the development of a planner (EAs) for realistic scenarios. The paths returned by the algorithm capable of solving the route-generation problem by obtaining fulfill and optimize multiple criteria that 1) are calculated based on the properties of real UAVs, terrains, radars, and missiles and a near-optimal path in every possible situation that UAVs can 2) are structured in different levels of priority according to the face [3]. selected mission. The paths of all the UAVs are obtained with the The advances in microcontroller design, optimization tech- multiple coordinated agents coevolution EA (MCACEA), which is niques, and control theory increment the number of fields, a general framework that uses an EA per agent (i.e., UAV) that both civil and military, where UAVs are currently being used: share their optimal solutions to coordinate the evolutions of the EAs populations using cooperation objectives. This planner works surveillance, reconnaissance, geophysical survey, environmen- offline and online by means of recalculating parts of the original tal and meteorological monitoring, aerial photography, search- path to avoid unexpected risks while the UAV is flying. Its search and-rescue tasks, etc. Although all these missions seem differ- space and computation time have been reduced using some special ent, the optimality of a route for any of them can be defined operators in the EAs. The successful results of the paths obtained by different optimization planning criteria (such as minimal fly- in multiple scenarios, which are statistically analyzed in the paper, and tested against a simulator that incorporates complex models ing time and/or path length) and fulfillment of some mission of the UAVs, radars, and missiles, make us believe that this planner constraints (such as flying at a given altitude or visiting some could be used for real-flight missions. points). Besides, the physical characteristics of the UAVs and Index Terms—Aerial robotics, multiobjective evolutionary algo- the environment also restrict the feasibility of any route and rithms (EAs), path planning for multiple mobile robot systems. should be considered by a realistic planner. On military missions, UAVs work in dangerous environments, and therefore, it is vital to always keep their routes apart from any type of threat and restricted zone. Therefore, the best routes are I. INTRODUCTION those that minimize the risk of destruction of the UAVs,optimize NMANNED aerial vehicles (UAVs) are aircrafts without some other planning criteria, and fulfill all the constraints that are U onboard pilots that can be remotely controlled or can fly imposed by the proposed mission, the physical characteristics autonomously based on preprogrammed flight plans [1]. The of the UAVs, and the environment. The original routes that are autonomy level achieved by the UAVs depends on the method- obtained offline by the planner are not always valid in dynamic ology used to control the vehicle and to generate its routes. Al- environments where the position of all the threats is not known though both tasks are equally important (and become especially beforehand. Therefore, the planner must be able to also work online in order to propose a new path during the UAVs mission when a pop-up (i.e., unknown threat) appears. Finding the optimal solution to the route-planning problem is Manuscript received December 1, 2009; revised March 22, 2010; accepted April 13, 2010. Date of publication May 24, 2010; date of current version nondeterministic-polynomial-time complete (NP-complete) [4], August 10, 2010. This paper was recommended for publication by Associate and therefore, the time required to solve it increases very quickly Editor G. Antonelli and Editor J.-P. Laumond upon evaluation of the review- as the size of the problem grows. The problem has been tackled ers’ comments. This work was supported by the Community of Madrid under Project “COSICOLOGI” S-0505/DPI-0391, by the Spanish Ministry of Educa- with different heuristics, such as mixed-integer linear program- tion and Science under Project DPI2006-15661-C02-01 and Project DPI2009- ming [5]–[8] and A* [9]–[12], and with nonlinear program- 14552-C02-01, and by the European Aeronautic Defense and Space Company ming [13]. The planners based on those techniques work with (Construcciones Aeronauticas Sociedad Anonima) under Project 353/2005. The work of E. Besada-Portas was supported by the Spanish Postdoctoral Grant a simplified version of the original problem, where the addi- EX-2007-0915 associated with the Prince of Asturias Endowed Chair of the tion of new constraints or objective criteria is a difficult task. University of New Mexico, University of New Mexico, Albuquerque, NM. This They consider only point-mass dynamics, discretize the solu- paper was presented in part at the Genetic Evolutionary Computation Confer- ence, Atlanta, GA, 2008, and in part at the 8th International FLINS Conference, tion space, and, in some cases [5]–[8], linearize the models. Madrid, Spain, 2008. Although the majority solve the single-UAV case, Richards and E. Besada-Portas, J. M. de la Cruz, and B. de Andres-Toro´ are with the Depar- How [7] and Raghunathan et al. [13] tackle the multi-UAV tamento de Computadores y Automatica,´ Universidad Complutense de Madrid, Ciudad Universitaria, Madrid 28040, Spain (e-mail: [email protected]; one. [email protected]; [email protected]). Evolutionary algorithms (EAs) are versatile optimizers that L. de la Torre is with the Department of Computer Sciences and Auto- have already been used to solve different UAV path-planning matic Control, Spanish University for Distance Education, Madrid 28080, Spain (e-mail: [email protected]). problems: Mittal and Deb [14], Nikolos et al. [15], [18], Digital Object Identifier 10.1109/TRO.2010.2048610 Hasircioglu et al. [16], Pehlivanoglu et al. [17], and Nikolos
1552-3098/$26.00 © 2010 IEEE 620 IEEE TRANSACTIONS ON ROBOTICS, VOL. 26, NO. 4, AUGUST 2010 and Tsourvelouds [19] optimized the paths of UAVs flying over tory instead of the local ones of the absolute Cartesian a terrain, and Zheng et al. [20] and Zhang et al. [21] searched codification. Our new codification takes advantage of all for optimal UAV paths in military missions. All formulated the the approaches: We use a list of 3-D absolute Cartesian problem as finding the trajectory that minimizes and fulfills a points, whose (x, y) values are generated using the relative set of optimization indexes and constraints, and while Mittal polar coordinates, to define cubic-spline trajectories. and Deb [14], Nikolos et al. [15], Hasircioglu et al. [16], 3) Using a general multiobjective evaluation method based Pehlivanoglu et al. [17], and Zhang et al. [21] solved the single- on goals and priorities. The evaluation method that com- UAV problem, Nikolos et al. [18], Nikolos and Tsourvelouds bines the different objectives and constraints used in [19], and Zheng et al. [20] solved the multi-UAV one. [14]–[21] does not provide an easy support to modify This paper presents our evolutionary planner to solve a the priorities of the objectives and their goals for different multi-UAV route-planning and cooperation problem. It extends missions, and only in [14] is the concept of Pareto optimal- [14]–[21] and our previous works, which are presented in [22] ity [25] used. The planners of [22] and [23], and this paper and [23], as follows. use the generic multiobjective Pareto-evaluation function 1) Using complex models and new optimization criteria and with goals and priorities presented in [26], which easily constraints for the military problem. To minimize the time permits to change the levels of priority of the different response of the planner, an extremely simple model of objectives as well as to include new objectives and con- the UAVs, terrain, and threats is used in [14]–[21], and straints. This way, our planners can be used for different the performance of the planner is tested against the same types of missions very easily. model instead of a realistic complex simulator. However, 4) Developing a general evolutionary framework to optimize in [22], [23], and in this paper, the routes are evaluated the behavior of multiple cooperating agents. The multiple according to the properties of a realistic aircraft for the path-generation problem is solved using, for each UAV, UAVs, real maps of the world for the terrain, and the an EA that optimizes its path based on the objectives and characteristics of the radars and missiles of the complex constraints associated with that UAV and some coopera- simulator used to test the results of our planner. Besides tion objectives related with the others. As the evaluation the UAVs position that is considered in [14]–[21], we also of the cooperation objectives requires some knowledge of use their velocity to calculate the minimum turning radius, the optimal solutions of all the UAVs, all the EAs need to the fuel consumption, and the risk of the trajectories. Not run simultaneously and send some information of the op- only do we check that the UAVs do not collide against timal solution they have obtained so far to the remaining the terrain, as in [14]–[21], but we also use maps of any EAs, i.e., coordination between the solutions (which, in the part of the world to represent it, as in [17] and [20], and particular case of this work, means coordination between to inhibit the radar detection’s capability when UAVs fly UAVs) is obtained by introducing coordination objectives hidden behind mountains. We calculate the probability to and sharing information between the EAs. The result of detect and/or to destroy an UAV based on the range of this approach is a multiple coordinated agents coevolution detection, the tracking probability, and the fire range of EA (MCACEA), which is a general evolutionary frame- the three types of radars and missiles considered in our work to optimize some characteristics (i.e., the path in our simulator. Our planner also supports the definition of pro- problem) of multiple cooperating agents. MCACEA can hibited zones (i.e., no flying zones, NFZs), which UAVs obtain the path of a single UAV (as in [14]–[17], and [21]) have to avoid due to mission restrictions. Finally, this is or the paths of several cooperating UAVs that have to be the first of our papers that introduces the mathematical flying simultaneously without colliding (as in [18]–[20]). models used in all our work. Although the core idea of MCACEA is shortly described 2) Implementing a new codification for the trajectories.Al- in [22], this paper presents it in detail and includes new and though 3-D trajectories can be defined pointwise as the more challenging problems that illustrate its performance linear segments determined by a list of 3-D absolute Carte- better. sian points (x, y, z), as in [20] and [21], smoother curves, 5) Speeding up the computation by combining some problem- which are easier to follow by the UAVs, can be obtained specific features with standard efficient EA operators, with other representations [24]. Therefore, the list of whose parameters are tuned after a statistical perfor- 3-D points determines the B-spline curve followed by mance analysis. Following the ideas in [16], [17], and [20], each UAV in [14]–[16], [18], and [19], a Bezier curve our EAs also include some special features, such as the in [17], and a cubic spline (which lets us include in the option of forcing the UAVs to fly at constant altitude and trajectory all the fixed points that each UAV has to visit, the initialization of the algorithm with previous solutions. and not only one, as in [14]) in our study, i.e., in [22] The substitution method based on hypercubes [27], which and [23]. Besides, the EAs in [14]–[17] codify the 3-D is used in [22] and [23], is replaced by the more efficient points with absolute Cartesian coordinates, while the EAs one that is included in the second version of the Non- in [18], [19], [22], and [23] use relative polar coordi- dominated Sorting Genetic Algorithm (NSGA-II) [28]. nates for (x, y). Using relative polar coordinates signif- Besides, some of the EA parameters are selected using icantly reduces the search space; however, the mutation the statistical performance analysis based on dominance and crossover steps produce global changes in the trajec- ranking that is presented in [29]. The lack of a significative BESADA-PORTAS et al.: EVOLUTIONARY TRAJECTORY PLANNER FOR MULTIPLE UAVs IN REALISTIC SCENARIOS 621
max statistical difference between the results of the optimiza- ni are obtained as follows: tions carried out with and without the local-search operator 2 min v that is used periodically over the best solutions in our pre- Ri = (1) max 2 − vious work has made us eliminate it as well. Therefore, g (ni ) 1 the improved EAs that constitute our current multi-UAV max × −9 2 − × −4 planner are computationally quicker and more standard. ni =5.3809 10 zi 4.4291 10 zi +6.1000 Finally, we want to highlight that our EA planner works offline (2) and online. The offline paths are obtained by the multiple EAs working inside the multiple-agent framework before the mission where g is the gravity. All the trajectory points, whose starts based on the original information that the system has about turning radius Ri is smaller than the minimum permitted the environment. During the simulations, if an unexpected threat one, are simultaneously penalized by the use of (3), whose suddenly pops up, the EAs of the affected UAVs start working minimum value (i.e., zero) ensures the fulfillment of the online to find a new path that avoids the threat. constraint. Ri is calculated as the radius of the circumfer- The rest of this paper is organized as follows. Section II de- ence that is defined by the points i − 1, i, and i +1 scribes the problem by means of the mathematical models used N 1,R≤ Rmin to define the missions. Sections III–VI, which are, respectively, 1 1 i i ci with ci = (3) related with points 2)–5) of this section, explain the different i=1 0, otherwise. parts of our EA planner. Section VII presents the results of our 2) Limited UAV slope: The UAVmaneuverability is also con- planner in different scenarios, and Section VIII contains the fi- strained by its maximum climbing slope α and its mini- nal discussion and conclusions. Finally, the Appendix contains i mum gliding slope β , which, in our case, depend on its the most-used acronyms. i altitude zi . These slope limits are calculated with −10 2 −5 αi = −1.5377 × 10 z − 2.6997 × 10 zi +0.4211 II. PROBLEM DESCRIPTION i The military missions of the optimization problem that are (4) × −9 2 − × −6 − solved by our EA planner are defined by a set of optimization βi =2.5063 10 zi 6.3014 10 zi 0.3257. planning criteria and constraints, which include the minimiza- tion of the risk of destruction of the UAVs as well as the restric- (5) tions imposed by the UAVs’ dynamics and the environment. The UAV slope at the ith point (i.e., Si ) is obtained with Currently, our problem considers 11 objectives. The first ten zi+1 − zi measure the fitness of the trajectory of each UAVindependently. Si = . (6) 2 2 Six are constraints, while the other four are values that should (xi+1 − xi) +(yi+1 − yi) be minimized in order to optimize the solution. The last, i.e., the eleventh one, is a cooperation constraint that checks the paths Similarly to the previous constraint, all the trajectory feasibility when all the UAVs are flying simultaneously. points, whose slope is out of the permitted range, are pe- Their values are calculated over the UAV trajectories, which nalized using the following expression, whose minimum are cubic-spline curves obtained from the list of 3-D Cartesian value (i.e., zero) is the optimal for this constraint: waypoints used to codify the solutions of each EA; for more N 2 2 0,βi c) The fuel consumption when the UAV is flying with 4) Terrain: A feasible path cannot go through the terrain MT maximum bank angle (i.e., Fi ), which is dependent and has to avoid collisions with mountains. To ensure this max on ni behavior, the algorithm penalizes the solutions that have MT − × −1 max 2 at least one point of the spline trajectory inside the terrain. Fi = 3.0435 10 (ni ) If map(xi ,yi ) is the function that returns the altitude of +16.552 × nmax +0.3565. (10) i the terrain at any point (xi,yi ), the following expression, The real cases are considered as a combination of the which calculates the number of points that are inside the previous cases, where only the main contributions that terrain, is used to penalize this constraint: appear in each type of maneuver are taken into account. N ≤ a) When the UAV is ascending, the real consumption 4 4 1,zi map(xi ,yi ) A ci with ci = (15) (i.e., FCi ) depends basically on the fuel consumption 0, otherwise. CH i=1 at constant height (i.e., Fi ) and at maximum slope MS (i.e., Fi ), as well as on the discrepancy between the 5) Map limits: To penalize the points of the trajectory that actual and maximum slopes (i.e., Si and αi ) are not inside the map limits, (16), shown below, is used, where xm and xm are the map lower and upper limits A CH Si MS − CH l u FCi = Fi + Fi Fi . (11) m m αi for the “x” coordinate, and yl and yu are the equiva- lent for the “y” coordinate. Its minimum value (i.e., zero) b) When the UAV is turning but not ascending, the fuel T ensures the fulfillment of the following map constraint: consumption (i.e., FCi ) depends on the fuel consump- CH N tion at constant height (i.e., Fi ) and at maximum MT 5 turning angle (i.e., Fi ), as well as on the ratio be- ci tween the actual and maximum load factors (i.e., n i=1 i and nmax) i 0, InMap(xi ,yi ) ni 5 FCT = F CH + F MT − F CH (12) with ci = i i max i i 1, otherwise ni where n can be obtained from (1), using R and n m ≤ ≤ m ∧ m ≤ ≤ m i i i InMap(xi,yi )=(xl xi xu ) (yl yi yu ). min max instead of Ri and ni . c) Finally, when the UAV flies straight and descending, (16) CH the fuel consumption (i.e., FCi ) is approximated by 6) Flight-prohibited zones (i.e., NFZs): The user can define the one for horizontal and straight-flight conditions certain zones where UAVsmust not enter because they are F CH i . This assumption is conservative since in this considered high-risk zones, unknown zones, etc. situation, the consumption is lower. They are defined as M rectangular regions constrained Considering that between two points of the trajectory, the NFZ,j NFZ,j NFZ,j by their external limits (xl , xu , yl and fuel consumption is constant, the total fuel consumption yNFZ,j, with j =1:M). The N points of the trajectory is calculated as the summation of the product of the fuel u case(i) have to avoid them. To penalize this objective, for each of needed at point i (i.e., FCi ) and the time needed to the trajectory points that are inside a NFZ, we accumulate go from i to i +1(∆ti), which is estimated as follows: the distance to the closest NFZ edge, with the purpose of distinguishing between two solutions that have the same − 2 − 2 − 2 (xi+1 xi ) +(yi+1 yi) +(zi+1 zi) number of points inside the NFZs, but at different distances ∆t = . i v of their frontiers. The following expression imposes the (13) selected penalization and the constraint is fulfilled with its The total fuel consumption value is limited by the fuel minimal value (i.e., zero): capacity of the UAV (i.e., Fuel). The following expres- sion only penalizes the constraint when the consumption N M j exceeds the capacity. Its minimum value (i.e., zero) is ob- di tained when the fuel consumption is less than the fuel i=1 j=1 capacity. A positive value implies that the consumption is j,k,a min(di ), if InNFZ (i, j) higher than the capacity. Therefore, the following expres- j k,a with di = sion calculates the extra fuel that is needed to follow the 0, otherwise trajectory: j,k,a NFZ,j d = |a