Complexity and Algorithms for Nonlinear Optimization Problems

Ann Oper Res (2007) 153: 257–296 DOI 10.1007/s10479-007-0172-6 Complexity and algorithms for nonlinear optimization problems Dorit S. Hochbaum Published online: 3 May 2007 © Springer Science+Business Media, LLC 2007 Abstract Nonlinear optimization algorithms are rarely discussed from a complexity point of view. Even the concept of solving nonlinear problems on digital computers is not well defined. The focus here is on a complexity approach for designing and analyzing algorithms for nonlinear optimization problems providing optimal solutions with prespecified accuracy in the solution space. We delineate the complexity status of convex problems over network constraints, dual of flow constraints, dual of multi-commodity, constraints defined by a sub- modular rank function (a generalized allocation problem), tree networks, diagonal dominant matrices, and nonlinear knapsack problem’s constraint. All these problems, except for the latter in integers, have polynomial time algorithms which may be viewed within a unifying framework of a proximity-scaling technique or a threshold technique. The complexity of many of these algorithms is furthermore best possible in that it matches lower bounds on the complexity of the respective problems. In general nonseparable optimization problems are shown to be considerably more difficult than separable problems. We compare the complexity of continuous versus discrete nonlinear problems and list some major open problems in the area of nonlinear optimization. Keywords Nonlinear optimization · Convex network flow · Strongly polynomial algorithms · Lower bounds on complexity 1 Introduction Nonlinear optimization problems are considered to be harder than linear problems. This is the chief reason why approximate linear models are frequently used even if the circum- stances justify a nonlinear objective. A typical approach is to replace an objective function An earlier version of this paper appeared in 4OR, 3:3, 171–216, 2005. D.S. Hochbaum () Department of Industrial Engineering and Operations Research and Walter A. Haas School of Business, University of California, Berkeley, USA e-mail: [email protected] 258 Ann Oper Res (2007) 153: 257–296 that is nonlinear by a piecewise linear function. This approach may adversely affect the algorithm’s complexity as often the number of pieces is very large and integer variables may have to be introduced. For problems with nonlinear objective function the leading methodology consists of iterative and numerical algorithms in which complexity analysis is substituted with a convergence rate proof. In order to apply complexity analysis to nonlinear optimization problems, it is necessary to determine what it means to solve such a problem. Unlike linear problems, for nonlinear problems the length of the output can be infinite, such as in cases when a solution is irrational. There are two major complexity models for nonlinear optimization. One that seeks to approximate the objective function. The second, which we present here, approximates the optimal solution in the solution space. The latter has a number of advantages described next in Sect. 1.1. Our goals here are to set a framework for complexity analysis of nonlinear optimization with linear constraints, delineate as closely as possible the complexity borderlines between classes of nonlinear optimization problems, and generate a framework of effective techniques. In particular it is shown how properties of convexity, separability and quadraticness of nonlinear optimization contribute to a substantial reduction in the problems’ complexity. We review a spectrum of techniques that are most effective for each such class and demon- strate that in some cases the techniques lead to best possible algorithms in terms of their efficiency. This paper is an updated version of Hochbaum (2005). Some of the subjects covered appeared earlier in Hochbaum (1993). 1.1 The complexity model, or what constitutes a solution to a nonlinear optimization problem There is a fundamental difficulty in solving nonlinear optimization on digital computers. Un- like linear programming, for which all basic solutions to a set of linear inequalities require only finite accuracy of polynomial length in the length of the input (see, e.g., Papadimitiou and Steiglitz 1982, Lemma 2.1), one cannot bound a priori the number of digits required for the length of nonlinear programming optimal solutions. This feature is important since computers can only store numbers of finite accuracy. Even the simplest nonlinear optimization problems can have irrational solutions and thus writing the output alone requires infinite complexity. This is the case, for instance, in the minimization of the convex function max{x2 − 2, 0}. So the interpretation of what it means to solve a problem in reals is not obvious. Traditional techniques for coping with nonlinear problems are reviewed extensively in Minoux (1986). These techniques have several shortcomings. In some applications, the nonlinear function is not available analytically. It is accessible via a data acquisition process or as a solution to a system of differential equations (as in some queuing systems). A typical traditional method approximates first the data input function as an analytic function, while assuming it is a polynomial of a conveniently low degree. Consequently, there are errors attributed to the inaccuracy of the assumed input even before an algorithm is applied for solving the problem. Further difficulties arise because of conditions required for the algorithms to work. The algorithms typically make use of information about the function’s derivatives from numerical approximations, which incorporates a further element of error in the eventual outcome. Moreover, certain assumptions about the properties of the nonlinear functions, such as dif- ferentiability and continuity of the gradients, are often made without any evidence that the “true” functions indeed possess them. Ann Oper Res (2007) 153: 257–296 259 In addition to the computational difficulties, the output for nonlinear continuous optimization problems can consist of irrational or even transcendental (non algebraic) numbers. Since there is no finite representation of such numbers, they are usually truncated to fit within the prescribed accuracy of the hardware and software. Complexity analysis requires finite length input and output which is of polynomial length as a function of the length of the input. Therefore the usual presentation of nonlinear problems renders complexity analysis inapplicable. For this reason complexity theory has not addressed nonlinear problems, with the exception of some quadratic problems. To resolve this issue, the nonlinear functions can be given in form of a table or oracle. An oracle ac- cepts arguments of limited number of digits, and outputs the value of the function truncated to the prescribed number of digits. Since nonlinear functions cannot be treated with the same absolute accuracy as linear functions, the notion of approximate solutions is particularly important. One definition of a solution to a nonlinear optimization problem is one that approximates the objective function. Among the major proponents of this approach are Nemirovsky and Yudin (1983)who chose to approximate the objective value while requiring some prior knowledge about the properties of the objective function. We observe that any information about the behavior of the objective at the optimum can always be translated to a level of accuracy of the solution vector itself (and vice versa). A detailed discussion of this point is provided in Hochbaum and Shanthikumar (1990). We thus believe that the interest of solving the optimization problem is in terms of the accuracy of the solution rather than the accuracy of the optimal objective value. We thus use a second definition of a solution is one that approximates the solution in the solution space, within a prescribed accuracy of . According to the concept of -accuracy of Hochbaum and Shanthikumar (1990)asolutionissaidtobe-accurate if it is at most () at a distance of (in the L∞ norm) from an optimal solution. That is, a solution, x is ∗ () ∗ -accurate if there exists an optimal solution x such that ||x − x ||∞ ≤ . is then said to be the accuracy required in the solution space. In other words, the solution is identical to 1 the optimum in O(log ) decimal digits. Both definitions of a solution overlap for nonlinear problems on integers in which case an accuracy of <1 is sufficient. The computation model here assumes the unit cost model, i.e. any arithmetic operation or comparison is counted as a single operation, even if the operands are real numbers. We 1 use here however only operands with up to O(log ) significant digits. A similar complexity model using -accuracy was used by Shub and Smale (1996); Rene- gar (1987) and others. These works, however, address algebraic problems, as opposed to optimization problems addressed here. 1.2 Classes of nonlinear problems addressed and some difficult cases A fundamental concept linking between the solutions to continuous and integer problems is that of proximity. Classes of convex optimization problems for which there is a “good” proximity with relative closeness of the continuous and integer solutions are solvable in polynomial time that depends on that distance. These classes include problems with a constraint matrix that has small subdeterminants. A prominent example of such optimization problems is the convex separable network flow problem.

Load more