Pseudospectral Methods for Infinite-Horizon Optimal Control
Total Page:16
File Type:pdf, Size:1020Kb
JOURNAL OF GUIDANCE,CONTROL, AND DYNAMICS Vol. 31, No. 4, July–August 2008 Pseudospectral Methods for Infinite-Horizon Optimal Control Problems Fariba Fahroo∗ and I. Michael Ross† Naval Postgraduate School, Monterey, California 93943 DOI: 10.2514/1.33117 A central computational issue in solving infinite-horizon nonlinear optimal control problems is the treatment of the horizon. In this paper, we directly address this issue by a domain transformation technique that maps the infinite horizon to a finite horizon. The transformed finite horizon serves as the computational domain for an application of pseudospectral methods. Although any pseudospectral method may be used, we focus on the Legendre pseudospectral method. It is shown that the proper class of Legendre pseudospectral methods to solve infinite- horizon problems are the Radau-based methods with weighted interpolants. This is in sharp contrast to the unweighted pseudospectral techniques for optimal control. The Legendre–Gauss–Radau pseudospectral method is thus developed to solve nonlinear constrained optimal control problems. An application of the covector mapping principle for the Legendre–Gauss–Radau pseudospectral method generates a covector mapping theorem that provides an efficient approach for the verification and validation of the extremality of the computed solution. Several example problems are solved to illustrate the ideas. I. Introduction infinite horizon. Second, a pseudospectral (PS) method is applied to VER the last few years, it has become possible to routinely discretize the problem. The discretized problem generates a sequence fi of approximate problems parameterized by the size of the grid. In the O solve many practical nite-horizon optimal control problems. fi Practical problems are typically nonlinear and constrained in both the third and nal step, each problem in this sequence is solved by a state and control variables. A recent example of generating routine globally convergent sequential quadratic programming (SQP) solutions to practical problems is the flight implementation of technique [8]. Note that global convergence does not mean global pseudospectral solutions to an optimal “zero-propellant maneuver” optimality [9]. Among many, this is one reason why it is important to of the International Space Station [1,2]. Furthermore, a fairly large select a correct sequence of problems together with the SQP class of finite-horizon problems can also be solved in real time. A subsequence to design a proper algorithm for solving optimal control number of issues related to this advancement are discussed in [3]. problems [10]. Such an algorithm for PS methods is called a spectral Real-time computation of optimal controls implies optimal feedback algorithm [11], which forms the core of PS methods for control. For guidance; hence, a number of control problems that are not readily the purpose of brevity, we do not discuss all aspects of our proposed addressable by traditional methods can now be solved in a unified approaches because many of these concepts have been extensively discussed elsewhere. For example, a complete description of the manner [4]. Despite such advancements, obtaining feedback controls – – for stabilizing a nonlinear system poses significant challenges [5]. Legendre Gauss Lobatto PS method is described in [12,13]; the Among many, one of the reasons for the continuing challenge is the spectral algorithm is described in [11]; and a convergence theorem is treatment of the infinite horizon in a nonlinear optimal control developed in [10]. Consequently, in this paper, we limit the scope of the problem to a complete discussion of only the open-loop issues problem [6]. A well-known strategy to address this issue is the fi receding-horizon technique, which forms the basis of model- pertaining to the treatment of the in nite horizon. Stability of the predictive methods for control. Even though finite-horizon problems closed-loop system is discussed in [14]. An overview of an fi implementation of the closed-loop control is discussed in [15]. In this are solved in real time, in this approach, the nite horizon is receded fi indefinitely as a means to approximate the infinite horizon. As a context, this paper provides the foundations for solving in nite- result of the mismatch between the computational horizon and the horizon optimal control problems. Part of this foundation is the infinite horizon, a large number of new problems emerge. Conse- development of a Radau-based PS method. It turns out that the well- fl known Lobatto family of PS methods is inappropriate for the quently, different avors of model-predictive control techniques fi have been proposed to address these problems. These issues are treatment of the in nite horizon as a result of singularities in the surveyed in [6] and the references contained therein. domain transformation technique. A detailed discussion of these In this paper, we propose an altogether different approach for issues is included in this paper. solving infinite-horizon, nonlinear, constrained optimal control As a matter of context, we note that our proposed approach is not problems. Our approach follows [7], in which we presented some an application of PS discretization applied to receding-horizon techniques. This approach has been discussed elsewhere [16–20] and initial ideas; here, we further the concepts along several different fi directions. First, the infinite horizon is mapped to a finite horizon by a offers well-established bene ts of PS methods; however, as alluded domain transformation technique that preserves the notion of the to earlier, the fundamental problems and issues [6] of receding- horizon techniques arising from the finiteness of solving the finite- horizon optimal control problem do not disappear by choosing Received 28 June 2007; revision received 4 September 2007; accepted for alternative discretizations. Thus, the proposed approach in this paper publication 12 September 2007. This material is declared a work of the U.S. provides a foundation for a new approach to solving infinite-horizon Government and is not subject to copyright protection in the United States. optimal control problems. Copies of this paper may be made for personal or internal use, on condition Two example problems are solved to illustrate various elements of that the copier pay the $10.00 per-copy fee to the Copyright Clearance Center, the main points of the proposed ideas: one is a standard linear- Inc., 222 Rosewood Drive, Danvers, MA 01923; include the code 0731-5090/ 08 $10.00 in correspondence with the CCC. quadratic regulator (LQR) problem in which the solutions from our ∗Professor, Department of Applied Mathematics, Code MA/Ff; method are compared favorably against the solutions from the [email protected]. Associate Fellow AIAA. Riccati approach. The second example is a nonlinear control problem †Professor, Department of Mechanical and Astronautical Engineering, of stabilizing NPSAT1, an experimental spacecraft designed and Code ME/Ro; [email protected]. Associate Fellow AIAA. built at the Naval Postgraduate School and scheduled to be launched 927 928 FAHROO AND ROSS in fall 2009. Additional example problems have been solved [7,14] 1 t 1 t t : , (3) but are not discussed in this paper for the purpose of brevity. 1 t 1 where 21; 1. Because we assumed that quantities exist in the II. Problem Formulation limit as t !1, it is clear that Eq. (3) maps the infinite domain to a We consider the following infinite-horizon, constrained, nonlinear finite domain. Using optimal control problem: dt 2 8 R : r (4) > 1 1 2 <> Minimize Jx ; u 0 F x t; u tdt d _ f 1 Subject to x t x t; u t B> 0 0 we reformulate problem 1B on the finite interval 1; 1 as :> x x h 0 8 R x t; u t > 1 > Minimize Jx ; u 1 r F x t ; u t d < dx t N N r f x t ; u t where 0; 1 3 t7 !fx 2 R x ; u 2 R u g is the state-control function Subject to d B> 1 0 pair; F: RNx RNu ! R, f: RNx RNu ! RNx , and h: RNx :> x t x 0 h 0 RNu ! RNh are continuously differentiable functions; and x is a x t ; u t given value of the state at the initial time t 0. The symbol N 1 denotes the vector dimension (i.e., number of variables) of the Evaluations at must be performed in the sense of the limit subscripted quantities. Thus, it is clear that we are dealing with a consistent with t !1. In applying the Pontryagin minimum fairly general optimal control problem for which the problem principle for problem B, we generate the following problem: formulation is motivated by stabilizing nonlinear control systems 8 > Find x ; u ; ; that frequently describe high-performance aerospace vehicles. We > dx t 1 > Such that r f x t ; u t assume that a solution exists for problem B in some appropriate > d > x t 1 x0 Sobolev space [21] and that the limits > <> h x t ; u t 0 lim lim @H x tx1 and u tu1 B 0 t!1 t!1 > @u > t T h x t ; u t 0 > exist. Then, under appropriate conditions [22], the Pontryagin > t 0 1 > minimum principle holds for problem B. That is, if fx ; ug is a > _ t r @H 1 :> @x solution to problem B, then there exist multipliers (duals or t 1 0 covectors) that satisfy the adjoint equation, the Hamiltonian minimization condition, and the transversality conditions. This where evaluations at t 1 are also to be understood in the sense of a dualization may be cast in terms of a boundary-value problem, limit. 1 fi denoted as problem B , and is de ned as The equivalence between problems 1B and B require a set of 8 technical assumptions that we implicitly assume.